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Preface to the Third Edition 


The principal change from the second edition is the addition of Grdbner bases to this 
edition. The basic theory is introduced in a new Section 9.6. Applications to solving 
systems of polynomial equations (elimination theory) appear at the end of this section, 
rounding it out as a self-contained foundation in the topic. Additional applications and 
examples are then woven into the treatment of affine algebraic sets and k-algebra homo- 
morphisms in Chapter 15. Although the theory in the latter chapter remains independent 
of Grobner bases, the new applications, examples and computational techniques sig- 
nificantly enhance the development, and we recommend that Section 9.6 be read either 
as a segue to or in parallel with Chapter 15. A wealth of exercises involving Grobner 
bases, both computational and theoretical in nature, have been added in Section 9.6 
and Chapter 15. Preliminary exercises on Grdbner bases can (and should, as an aid to 
understanding the algorithms) be done by hand, but more extensive computations, and 
in particular most of the use of Grdbner bases in the exercises in Chapter 15, will likely 
require computer assisted computation. 

Other changes include a stre amlinin g of the classification of simple groups of order 
168 (Section 6.2), with the addition of a uniqueness proof via the projective plane of 
order 2. Some other proofs or portions of the text have been revised slightly. A number 
of new exercises have been added throughout the book, primarily at the ends of sections 
in order to preserve as much as possible the numbering schemes of earlier editions. 
In particular, exercises have been added on free modules over noncommutative rings 
(10.3), on Krull dimension (15.3), and on flat modules (10.5 and 17.1). 

As with previous editions, the text contains substantially more than can normally 
be covered in a one year course. A basic introductory (one year) course should probably 
include Part I up through Section 5.3, Part D through Section 9.5, Sections 10.1, 10.2, 
10.3, 11.1, 11.2 and Part IV. Chapter 12 should also be covered, either before or after 
Part IV. Additional topics from Chapters 5, 6. 9, 10 and 1 1 may be interspersed in such 
a course, or covered at the end as time permits. 

Sections 10.4 and 10.5 are at a slightly higher level of difficulty than the initial 
sections of Chapter 10, and can be deferred on a first reading for those following the text 
sequentially. The latter section on properties of exact sequences, although quite long, 
maintains coherence through a parallel treatment of three basic functors in respective 
subsections. 

Beyond the core material, the third edition provides significant flexibility for stu- 
dents and instructors wishing to pursue a number of important areas of modem algebra. 
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either in the form of independent study or courses. For example, well integrated one- 
semester courses for students with some prior algebra background might include the 
following: Section 9.6 and Chapters 15 and 16; or Chapters 10 and 17; or Chapters 5, 
6 and Part VI. Each of these would also provide a solid background for a follow-up 
course delving more deeply into one of many possible areas: algebraic number theory, 
algebraic topology, algebraic geometry, representation theoiy, Lie groups, etc. 

The choice of new material and the style for developing and integrating it into the 
text are in consonance with a basic theme in the book: the power and beauty that accrues 
from a rich interplay between different areas of mathematics. The emphasis throughout 
has been to motivate the introduction and development of important algebraic concepts 
using as many examples as possible. We have not attempted to be encyclopedic, but 
have tried to touch on many of the central themes in elementary algebra in a manner 
suggesting the very natural development of these ideas. 

A number of important ideas and results appear in the exercises. This is not because 
they are not significant, rather because they did not fit easily into the flow of the text 
but were too important to leave out entirely. Sequences of exercises on one topic 
are prefaced with some remarks and are structured so that they may be read without 
actually doing the exercises. In some instances, new material is introduced first in 
the exercises — often a few sections before it appears in the text — so that students may 
obtain an easier introduction to it by doing these exercises (e.g., Lagrange’s Theorem 
appears in the exercises in Section 1 .7 and in the text in Section 3.2). All the exercises 
are wi thin the scope of the text and hints are given [in brackets] where we felt they were 
needed. Exercises we felt might be less straightforward are usually phrased so as to 
provide the answer to the exercise; as well many exercises have been broken down into 
a sequence of more routine exercises in order to make them more accessible. 

We have also purposely minimized the functorial language in the text in order to 
keep the presentation as elementary as possible. We have refrained from providing 
specific references for additional reading when there are many fine choices readily 
available. Also, while we have endeavored to include as many fundamental topics as 
possible, we apologize if for reasons of space or personal taste we have neglected any 
of the reader’s particular favorites. 

We are deeply grateful to and would like here to th ank the many students and 
colleagues around the world who, over more than 15 years, have offered valuable 
comments, insights and encouragement — their continuing support and interest have 
motivated our writing of this third edition. 


David Dummit 
Richard Foote 
June, 2003 
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Preliminaries 


Some results and notation that are used throughout the text are collected in this chapter 
for convenience. Students may wish to review this chapter quickly at first and then read 
each section more carefully again as the concepts appear in the course of the text 

0.1 BASICS 

The basics of set theory: sets, H, U, e, etc. should be familiar to the reader. Our 
notation for subsets of a given set A will be 

B = [a e A \ . . . (conditions on a) . . . 

The order or cardinality of a set A will be denoted by I A| . If A is a finite set the order 
of A is simply the number of elements of A. 

It is important to understand how to test whether a particular x e A lies in a subset 
B of A (cf. Exercises 1-4). The Cartesian product of two sets A and B is the collection 
A x B = {( a , b) \ a €. A, b e B), of ordered pairs of elements from A and B. 

We shall use the following notation for some common sets of numbers: 

(1) Z = {0, ±1, ±2, ±3, ... 1 denotes the integers (the Z is for the German word for 
numbers: “Zahlen”). 

(2) Q = [a/b \ a, b eZ, b ^ 0} denotes the rational numbers (or rationals). 

(3) E = { all decimal expansions ± d\di ■ . . d n .a\aia^ . . . ) denotes the real numbers 
(or reals). 

(4) C = [a+bi | a, b e E, i 2 = -1} denotes the complex numbers . 

(5) Z + , (Q> + and E + will denote the positive (nonzero) elements in Z, Q and R, respec- 
tively. 

We shall use the notation f : A ^ B ox A B to denote a function / from A 
to B and the value of / at a is denoted f(a) (Le., we shall apply all our functions on 
the left). We use the words function and map interchangeably. The set A is called the 
domain of / and B is called the codomain of /. The notation / : a box a b\i f 
is understood indicates that / (a) = b, i.e., the function is being specified on elements. 

If the function / is not specified on elements it is important in general to check 
that / is well defined , i.e., is unambiguously determined. For example, if the set A 
is the union of two subsets A\ and A 2 then one can try to specify a function from A 
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to the set {0, 1} by declaring that / is to map everything in Ay to 0 and is to map 
everything in A 2 to 1. This unambiguously defines / unless A j and A 2 have elements 
in common (in which case it is not clear whether these elements should map to 0 or to 
1). Checking that this / is well defined therefore amounts to checking dial A\ and A 2 
have no intersection. 

The set 

f(A) = [b € B \ b = f(a), for some a e A) 

is a subset of B, called the range or image of / (or the image of A under f). For each 
subset C of B the set 

/-'( O = (o 6 A | /(a) e C) 

consisting of the elements of A mapping into C unde- / is called the preimage or inverse 
image of C unde f. For each b e B, the preimage of {£>} unde / is called the fiber of 
/ ove b. Note that / -1 is not in geneal a function and that the fibers of / generally 
contain many elements since there may be many elements of A mapping to the element 
b. 

If / : A B and g : B — ► C, then the composite map go f : A C is defined 
by 

(go f)(a) = g(/(a»- 

Let / : A — > fl. 

(1) / is injective or is an injection if whenever a\ ^ a 2 , then f(a\) ^ f(a 2 ). 

(2) / is surjective or is a surjection if for all b € B there is some a € A such that 
f(a) = b, i.e., the image of / is all of B. Note that since a function always maps 
onto its range (by definition) it is necessary to specify the codomain B in order for 
the question of surjectivity to be meaningful. 

(3) / is bijective or i s a bijection if it is both injective and surjective If such a bisection 
/ exists from A to B> we say A and B are in bijective correspondence. 

(4) / has a left inverse if there is a function g : B A such that go f : A — > A is 
the identity map on A, i.e., (g o f)(a) = a, for all a € A. 

(5) / has a right inverse if there is a function h : B A such that / o h : B B is 
the identity map on B. 

Proposition 1. Let / : A -► B. 

(1) The map / is injective if and only if / has a left inverse. 

(2) The map / is surjective if and only if / has a right inverse. 

(3) The map / is a bijection if and only if there exists g : B -+ A such that fog 
is the identity map on B and go f is the identity map on A. 

(4) If A and B are finite sets with the same number of elements (i.e., \A\ = |U|), 
then / : A — ► B is bijective if and only if / is injective if and only if / is 
surjective. 

Proof: Exercise. 

In the situation of part (3) of the proposition above the map g is necessarily unique 
and we shall say g is the 2-sided inverse (or simply the inverse ) of /. 
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A permutation of a set A is simply a bijection from A to itself. 

If AC. B and / : Z? — > C, we denote the restriction of / to A by /|^ . When the 
domain we are considering is understood we shall occasionally denote / U again simply 
as / even though these are formally different functions (their domains are different). 

If A c B and g : A -> C and there is a function / : B —> C such that f\ A = g, 
we shall say / is an extension of g to B (such a map / need not exist nor be unique). 

Let A be a nonempty set. 

(1) A binary relation on a set A is asubset R of A x A and we writer ~ b if (a, b) G R. 

(2) The relation ~ on A is said to be: 

(a) reflexive if a ~ a, for all a e A, 

(b) symmetric if a ~ b implies b ~ a for all a, b g A, 

(c) transitive if a ~ b and b ~ c implies a ~ c for all a, b, c G A. 

A relation is an equivalence relation if it is reflexive, symmetric and transitive. 

(3) If ~ defines an equivalence relation on A, then the equivalence class of a G A is 
defined to be {x G A | x ~ a}. Elements of the equivalence class of a are said 
to be equivalent to a. If C is an equivalence class, any element of C is called a 
representative of the class C. 

(4) A partition of A is any collection {A/ | i e /} of nonempty subsets of A (7 some 
indexing set) such that 

(a) A = U ig/A,- , and 

(b) A, n Aj = 0, for all i, je / with i ^ j 

i.e., A is the disjoint union of the sets in the partition. 

The notions of an equivalence relation on A and a partition of A are the same: 
Proposition 2. Let A be a nonempty set. 

(1) If ~ defines an equivalence relation on A then the set of equivalence classes of 
~ form a partition of A. 

(2) If {A/ | i g /} is a partition of A then there is an equivalence relation on A 
whose equivalence classes are precisely the sets A if i g I. 

Proof: Omitted. 

Finally, we shall assume the reader is familiar with proofs by induction. 

EXERCISES 

In Exercises 1 to 4 let A be the set of 2 x 2 matrices with real number entries. Recall that 
matrix multiplication is defined by 

fa b\ f p q \ _ f ap + br aq+bs\ 

\c d ) \r s ) \cp + dr cq + ds ) 

Let 

" = (J 0 
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and let 


B = [X <= A | MX = XA/}. 


1. Detennine which of the following elements of A lie in B: 

(i :)■ o 0- c :)• c ;)• c :)■ c ;)■ 


2. Prove that if P, Q e B, then P + Q &B (where 4- denotes the usual sum of two matrices). 

3. Prove that ifP.QeB, then P Q e B (where ■ denotes the usual product of two matrices). 

4. Find conditions on p, q, r, s which determine precisely when (? 

5. Detennine whether the following functions / are well defined: 

(a) / : Q — ► Z defined by f(a/b) = a. 

(b ) f '■ Q-+ Q defined by f(a/b) = a 2 /b 2 ‘. 

6. Determine whether the function / : R + — Z defined by mapping a real number r to the 
first digit to the right of the decimal point in a decimal expansion of r is well defined. 

7. Let / : A -+ B be a suijective map of sets. Prove that the relation 

a ~ b if and only if / (a) = f ( b ) 

is an equivalence relation whose equivalence classes are the fibers of /. 


0.2 PROPERTIES OF THE INTEGERS 

The following properties of the integers Z (many familiar from elementary arithmetic) 
will be proved in a more general context in the ring theoiy of Chapter 8, but it will 
be necessary to use them in Part I (of course, none of the ring theoiy proofs of these 
properties will rely on the group theory). 

(1) (Well Ordering of Z) If A is any nonempty subset of Z + , there is some element 
m e A such that m < a, for all a e A (m is called a minimal element of A). 

(2) If a, b € Z with a ^ 0, we say a divides b if there is an element c € Z such that 
b = ac. In this case we write a \ b\ if a does not divide b we write a \ b. 

(3) Ifa.fc € Z — {0}, there is a unique positive integer d, called the greatest common 
divisor of a and b (or g.c.d. of a and b\ satisfying: 

(a) d | a and d \ b (so d is a common divisor of a and b), and 

(b) if e j a and e \ b> then e ] d (so d is the greatest such divisor). 

The g.c.d. of a and b will be denoted by (a, b). If (a, b) = 1, we say that a and b 
are relatively prime. 

(4) If a, b € Z — {0}, there is a unique positive integer /, called the least common 
multiple of a and b (or Lc.m. of a and b\ satisfying: 

(a) a | / and b \ l (so / is a common multiple of a and b), and 

(b) if a | m and b \ m, then / | m (so l is the least such multiple). 

The connection between the greatest common divisor d and the least co mm on 
multiple / of two integers a and b is given by dl = ab. 

(5) The Division Algorithm', if a, b e Z — {0}, then there exist unique r € Z such 
that 

a — qb + r and 0 < r < \b\. 
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where q is the quotient and r the remainder. This is the usual “long division” 
familiar from elementary arithmetic. 

(6) The Euclidean Algorithm is an important procedure which produces a greatest 
co mm on divisor of two integers a and b by iterating the Division Algorithm; if 
a, b € Z — {0}, then we obtain a sequence of quotients and remainders 


a = g 0 b + r 0 

(0) 

b = gir 0 + r x 

(1) 

ro = qir\ + r 2 

(2) 

r\ = < 73^2 + O 

(3) 

-2 = q n r n - 1 + r„ 

(n) 

— 1 ~ *7/i+l rii 

(n+ 1) 


where r n is the last nonzero remainder. Such an r„ exists since \b\ > ]ro| > |rj | > 
■ ■ ■ > \r n | is a decreasing sequence of strictly positive integers if the remainders 
are nonzero and such a sequence cannot continue indefinitely. Then r„ is the g.c.d. 
(a, b ) of a and b. 


Example 

Suppose a = 57970 and b = 10353. Then applying the Euclidean Algorithm we obtain: 

57970= (5)10353 + 6205 
10353 = (1)6205 + 4148 
6205 = (1)4148 + 2057 
4148 = (2)2057 + 34 
2057 = (60)34 + 17 
34 = (2)17 

which shows that (57970, 10353) = 17. 


(7) One consequence of the Euclidean Algorithm which we shall use regularly is the 
following: if a, b e Z — {0}, then there exist jc, y € Z such that 

(a, b) = ax +by 

that is, the g.c.d of a and b is a X-linear combination of a and b. This follows 
by recursively writing the element r n in the Euclidean Algorithm in terms of the 
previous remainders (namely, use equation (n) above to solve for r„ — r„_ 2 — q n r „- 1 
in terms of the remainders r„_ j and r„- 2 , then use equation (n — 1) to write r n in 
terms of the remainders r„_ 2 and r„_ 3 , etc., eventually writing r„ in terms of a and 

by 
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Example 

Supposes = 57970 and b = 10353. whose greatest common divisor we computed above to 
be 1 7. From the fifth equation (the next to last equation) in the Euclidean Algorithm applied 
to these two integers we solve for their greatest common divisor. 17 = 2057 — (60)34. 
The fourth equation then shows that 34 = 4148 — (2)2057, so substituting this expression 
for the previous remainder 34 gives the equation 17 = 2057 — (60) [4148 — (2)2057], i.e., 
17 = (121)2057 — (60)4148. Solving the third equation for 2057 and substituting gives 
17 = (121) [6205 - (1)4148] - (60)4148 = (121)6205 - (181)4148. Using the second 
equation to solve for 4148 and then the first equation to solve for 6205 we finally obtain 

17 = (302)57970 - (1691)10353 

as can easily be checked directly. Hence the equation ax + by — (a , b) for the greatest 
common divisor of a and b in this example has the solution x = 302 and y = — 1691 . Note 
that it is relatively unlikely that this relation would have been found simply by guessing. 

The integers x and y in (7) above are not unique. In the example with a = 57970 
and b = 10353 we determined one solution to be x = 302 and y = —1691, for 
instance, and it is relatively simple to check that x = —307 and y — 1719 also 
satisfy 57970 jc + 10353y =17. The general solution for x and y is known (cf. the 
exercises below and in Chapter 8). 

(8) An element p of Z + is called a prime if p > 1 and the only positive divisors of p are 
1 and p (initially, the word prime will refer only to positive integers). An integer 
7i > 1 which is not prime is called composite. For example, 2,3, 5,7,1 1,13,17,19,... 
are primes and 4,6,8,9,10,12,14,15,16,18,... are composite. 

An important property of primes (which in fact can be used to define the primes 
(cf. Exercise 3)) is the following: if p is a prime and p | ab y for some a,b € Z, 
then either p \ a or p \ b. 

(9) The Fundamental Theorem of Arithmetic says: if n € Z, n > 1, then n can 
be factored uniquely into the product of primes, i.e., there are distinct primes 
P\ , Pi, • • ■ » Ps and positive integers a\ t ct 2 , ... ,a s such that 


n ~ Pi Pi - ■ ■ Ps ■ 


This factorization is unique in the sense llial if q\ , q 2 , . . - , qt arc any distinct primes 
and fii, fo, ■ - ■ , fit positive integers such that 


&2 Bt 

n = 4i Qi . 


then s — t and if we arrange the two sets of primes in increasing order, then g, = p- t 
and <*/ = fa, 1 < i < s. For example, n = 1852423848 — 2 3 3 2 1 1 2 19^1 and this 
decomposition into the product of primes is unique. 

Suppose the positive integers a and b are expressed as products of prime powers: 

a = p?p?...p?, b = ^'p^...p e s - 


where p \ , p ^ , . . . , p s are distinct and the exponents are > 0 (we allow the exponents 
to be 0 here so that the products are taken over the same set of primes — the exponent 
will be 0 if that prime is not actually a divisor). Then the greatest common divisor 
of a and b is 


(a, b) = p y 


Diin(a, ,^0 


minto,#) 

Pi 


...p 
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(and the least common multiple is obtained by instead taking the maximum of the 
a, and fit instead of the minimum ). 

Example 

In the example above, a = 57970 and b = 10353 can be factored as a = 2 ■ 5 ■ 1 1 ■ 17 - 31 
and b — 3 ■ 7 ■ 1 7 - 29, from which we can i mmediately conclude that their greatest common 
divisor is 17. Note, however, that for large integers it is extremely difficult to determine 
their prime factorizations (several common codes in current use are based on this difficulty, 
in fact), so that this is not an effective method to determine greatest common divisors in 
general. The Euclidean Algorithm will produce greatest common divisors quite rapidly 
without the need for the prime factorization of a and b. 

(10) The Euler <p-function is defined as follows: for «eZ + let <p(n) be the number of 
positive integers a < n with a relatively prime to n, i.e., (a, n) = 1. For example, 
<p( 12) = 4 since 1, 5, 7 and 11 are the only positive integers less than or equal 
to 12 which have no factors in co mm on with 12. Similarly, <p( 1) = 1, <p(2) = 1, 
^>(3) = 2, <p( 4) = 2, ^>(5) = 4, <p(6) = 2, etc. For primes p, <p(p ) = p — 1, and, 
more generally, for all a > 1 we have the formula 

<e(P°) = p° - p°~' = p°~'(p - \). 

The function ip is multiplicative in the sense that 

<p(ab) = <p(d)<p(b) if (a, b ) = 1 

(note that it is important here that a and b be relatively prime). Together with the for- 
mula above this gives a general formula for the values of ip : if n — /?“' p^ ... p“‘, 
then 

<p{n) = <p{p tt y ^(p{p^) ■ - - <p(Ps‘) 

= p^-'ip i - 1) p?-\p 2 - 1) ...prv* - 1). 

For example, ^>(12) = ^(2 2 )^(3) = 2* (2 — 1)3°(3 — 1) = 4. The reader should 
note that we shall use the letter <p for many different functions throughout the text 
so when we want this letter to denote Euler’s function we shall be careful to indicate 
this explicitly. 


EXERCISES 

1. For each of the following pairs of integers a and b, determine their greatest common 
divisor, their least common multiple, and write their greatest common divisor in the form 
ax + by for some integers x and y. 

(a) a = 2J0,b = 13. 

(b) a = 69,b = 372. 

(c) a = 792, b = 275. 

(d) a = 1 1391, b = 5673. 

(e) a = 1761, b ~ 1567. 

(f) a = 507885, b = 60808. 

2. Prove that if the integer k divides the integers a and b then k divides ar + bt for every pair 
of integers s and r. 
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3. Prove that if n is composite then there are integers a and b such that n divides ab but n 
does not divide either a orb. 

4. Let a, b and N be fixed integers with a and b nonzero and let d = {a, b) be the greatest 
common divisor of a and b. Suppose jco and yo are particular solutions to ax + by = N 
(i.e., ax o + byo = AO- Prove for any integer t that the integers 

b a 

x = *o+-r and y — y o - -?* 

d d 

are also solutions to ax + by — N (this is in fact the general solution). 

5. Determine the value <p(n) for each integer n < 30 where <p denotes the Euler ^function. 

6. Prove the Well Ordering Property of 7L by induction and prove the minimal element is 
unique. 

7. If p is a prime prove that there do not exist nonzero integers a and b such that a 2 = pb 2 
(i.e., y/p is not a rational number). 

8. Let p be a prime, n e Z + . Find a formula for the laigest power of p which divides 
n\ = n(n — l)(n — 2) . . .2 ■ 1 ( it involves the greatest integer function). 

9. Write a computer program to determine the greatest common divisor (a, b) of two integers 
a and b and to express (a , b) in the form ax + by for some integers x and y. 

10. Prove for any given positive integer N there exist only finitely many integers n with 
(p{n) = N where <p denotes Euler’s ^-function. Conclude in particular that <p(n) tends to 
infinity as n tends to infinity. 

11. Prove that if d divides n then <p{d) divides <p(n) where <p denotes Euler’s ^-function. 

0.3 Z /n Z : THE INTEGERS MODULO n 

Let n be a fixed positive integer. Define a relation on Z by 

a ~ b if and only if n \ (b — a). 

Clearly a ~ a, and a ~ b implies b ~ a for any integers a and b, so this 
relation is trivially reflexive and symmetric. If a ~ b and b ~ c then n divides a — b 
and n divides b — c so n also divides the sum of these two integers, i.e., n divides 
(a — b) + (b — c) = a — c, so a ~ c and the relation is transitive. Hence this is an 
equivalence relation. Write a = b (mod n) (read: a is congruent to b mod ri) if a ~ b. 
For any k e % we shall denote the equivalence class of a by a — this is called the 
congruence class or residue class of a mod n and consists of the integers which differ 
from a by an integral multiple of n, i.e.. 


a = [a + kn | k € Z} 

= (a, a ± «, a ± 2 n, a d= 3n, . . . }. 
There are precisely n distinct equivalence classes mod n, namely 

6 , 1 , 2 , 1 


determined by the possible remainders after division by n and these residue classes 
partition the integers Z. The set of equivalence classes under this equivalence relation 
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will be denoted by Z/nZ and called the integers modulo n (or the integers mod n). 
The motivation for this notation will become clearer when we discuss quotient groups 
and quotient rings. Note that for different n's the equivalence relation and equivalence 
classes are different so we shall always be careful to fix n first before using the bar 
notation. The process of finding the equivalence class mod n of some integer a is often 
referred to as reducing a mod n. This terminology also frequently refers to finding the 
smallest nonnegative integer congruent to a mod n (the least residue of a mod n). 

We can define an addition and a multiplication for the elements of Z/nZ, defining 
modular arithmetic as follows: for a, b € %lnL , define their sum and product by 

a. + b — a + h and a-b = ab. 

What this means is the following: given any two elements a and b in Z/nZ, to compute 
their sum (respectively, their product) take any representative integer a in the class 
a and any representative integer b in the class b and add (respectively, multiply) the 
integers a and b as usual in Z and then take the equivalence class containing the result. 
The following Theorem 3 asserts that this is well defined, i.e., does not depend on the 
choice of representatives taken for the elements a and b of Z/nZ. 

Example 

Suppose n = 12 and consider Z/12Z, which consists of the twelve residue classes 

6. 1. 2 IT 

determined by the twelve possible remainders of an integer after division by 12. The 
elements in the residue class 5, for example, are the integers which leave a remainder of 5 
when divided by 12 (the integers congruent to 5 mod 12). Any integer congruent to 5 mod 
12 (such as 5, 17, 29, ... or —7, —19, ... ) will serve as a representative for the residue class 
5. Note that TLf 12Z consists of the twelve elements above (and each of these elements of 
Z/12Z consists of an infinite number of usual integers). 

Suppose now that a = 5 and b = 8. The most obvious representative for a is the integer 
5 and simil arly 8 is the most obvious representative for b. Using these representatives for 
the residue classes we obtain 5 + 8=13 = 1 since 13 and 1 lie in the same class modulo 
n — 12. Ilad we instead taken the representative 17, say, for a (note that 5 and 17 do lie in 
the same residue class modulo 12) and the representative —28, say, for b, we would obtain 
5 + 8 = (17 — 28) = — 11 = 1 and as we mentioned the result does not depend on the 
choice of representatives chosen. The product of these two classes isdh = 5- 8= 40 = 4, 
also independent of the representatives chosen. 


Theorem 3. The operations of addition and multiplication on Z//iZ defined above 
are both well defined, that is, they do not depend on the choices of representatives for 
the classes involved. More precisely, if ay, ai € Z and b\,l>2 € Z with a[ = by and 
ai — b 2 , then a\ + a% = by + b^ and aya^ — bybz, i.e., if 


then 


ay = by (mod ri) and a2 = b^ (mod n) 
ay + = by + b2 (mod n) and aya 2 — b\l>2 (mod n). 
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Proof: Suppose a i = by (mod n),i.e.,ay — b\ is divisible by n. Thenaj = bi+sn 
for some integer s. Similarly, a 2 = bj (mod n) means a 2 = £>2 + tn for some integer t . 
Then < 2 j +a 2 = (b\ +b 2 ) + ( s+t)n so thata^ +a 2 = by +b 2 (mod n), which shews that 
the sum of the residue classes is independent of the representatives chosen. Similarly, 
a\a 2 = ( by+sn)(b 2 +tn ) = byfa+ibyt+fas+stnty shows thataya 2 = byb 2 (mod n) 
and so the product of the residue classes is also independent of the representatives 
chosen, completing the proof. 

We shall see later that the process of adding equivalence classes by adding their 
representatives is a special case of a more general construction (the construction of 
a quotient). This notion of adding equivalence classes is already a familiar one in 
the context of adding rational numbers: each rational number ajb is really a class of 
expressions: a/b = la /2b = —3a/ — 3b etc. and we often change representatives 
(for instance, take common denominators) in order to add two fractions (for example 
1/2 + 1/3 is computed by taking instead the equivalent representatives 3/6 for 1/2 
and 2/6 for 1/3 to obtain 1/2 + 1/3 = 3/6 + 2/6 = 5/6). The notion of modular 
arithmetic is also famili ar: to find the hour of day after adding or subtracting some 
number of hours we reduce mod 12 and find the least residue. 

It is important to be able to think of the equivalence classes of some equivalence 
relation as elements which can be manipulated (as we do, for example, with fractions) 
rather than as sets. Consistent with this attitude, we shall frequently denote the elements 
of TL/nTL simply by {0, 1 , .... n — 1} where addition and multiplication are reduced mod 
n. It is important to remember, however, that the elements of Z/nZ are not integers, but 
rather collections of usual integers, and the arithmetic is quite different For example, 
5 + 8 is not 1 in the integers Z as it was in the example of Z/12Z above. 

The fact that one can define arithmetic in Z/nZ has many important applications 
in elementary number theory. As one simple example we compute the last two digits in 
the number 2 1000 . First observe that the last two digits give the remainder of 2 1000 after 
we divide by 100 so we are interested in the residue class mod 100 containing 2 1000 . 
We compute 2 10 = 1024 = 24 (mod 100), so then 2 20 = (2 10 ) 2 = 24 2 = 576 = 76 
(mod 100). Then 2 40 = (2 20 ) 2 = 76 2 = 5776 e= 76 (mod 100). Similarly 2 80 = 
2 i60 ^ 2 320 = 2 640 = 76 (mod 100). Finally, 2 1000 = 2 640 2 320 2 40 = 76 • 76 • 76 = 76 
(mod 100) so the final two digits are 76. 

An important subset of Z/nZ consists of the collection of residue classes which 
have a multiplicative inverse in Z/nZ: 

(Z/nZ) 5 * = [a e 7L/nL | there exists c e Z/nZ with a c— I}. 

Some of the following exercises outline a proof that (Z/nZ) x is also the collection 
of residue classes whose representatives are relatively prime to n, which proves the 
following proposition. 

Proposition 4. (Z/nZ) x ={ae Z/nZ | ( a , n) = 1). 

It is easy to see that if any representative of a is relatively prime to n then all 
representatives are relatively prime to n so that the set on the right in the proposition is 
well defined. 
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Example 

For n = 9 we obtain (Z/9Z)* = {1,2, 4, 5, 7, 8} from the proposition. The multiplicative 
inverses of these elements are {1, 5, 7, 2, 4. 8), respectively. 

If a is an integer relatively prime to n then the Euclidean Algorithm produces integers 
x and y satisfying ax +ny = 1, hence ax = 1 (mod n), so that Jr is the multiplicative 
inverse of a in Z/nZ. This gives an efficient method for computing multiplicative 
inverses in Z/ nZ. 

Example 

Suppose n = 60 and a = 17. Applying the Euclidean Algori thm we obtain 

60 = (3)17 + 9 
17= (1)9 + 8 
9 =(1)8+1 

so that a and n are relatively prime, and (—7)17 + (2)60 = 1. Hence — 7 = 53 is the 
multiplicative inverse of 17 in Z/60Z. 


EXERCISES 

1. Write down explicitly all the elements in the residue classes of Z/18Z. 

2. Prove that the distinct equivalence classes in Z/nZ are precisely 6. 1, 2, . . . , n — 1 ( use 
the Division Algorithm). 

3. Prove that if a = a„ 10” + a„_il0” -1 + - ■ • + a\ 10 + oq is any positive integer then 
a = a n + a„_i + ■ - - + a\ + oq (mod 9) (note that this is the usual arithmetic rule that 
the remainder after division by 9 is the same as the sum of the decimal digits mod 9 - in 
particular an integer is divisible by 9 if and only if the sum of its digits is divisible by 9) 
[note that 10 = 1 (mod 9)]. 

4 . Compute the remainder when 37 100 is divided by 29. 

5. Compute the last two digits of 9 1500 . 

6. Prove that the squares of the elements in Z/4Z are just 6 and I. 

7. Prove for any integers a and b that a 2 + b 2 never leaves a remainder of 3 when divided by 
4 (use the previous exercise). 

8. Prove that the equation a 2 + b 2 = 3 c 2 has no solutions in nonzero integers a , b and c. 
[Consider the equation mod 4 as in the previous two exercises and show that a, b and c 
would all have to be divisible by 2. Then each of a 2 , b 2 and c 2 has a factor of 4 and by 
dividing through by 4 show that there would be a smaller set of solutions to the original 
equation. Iterate to reach a contradiction.] 

9. Prove that the square of any odd integer always leaves a remainder of 1 when divided by 

8 . 

10 . Prove that the number of elements of (Z/nZ) x is <p(n) where <p denotes the Euler <p- 
function. 

11 . Prove that if a, b e then a -b e (Z/nZ) x . 
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12. Let n e Z, n > l, and let a e Z with l < a < n. Prove if a and n are not relatively prime, 

there exists an integer b with 1 < b < n such that ab = 0 (mod n) and deduce that there 

cannot be an integer c such that ac= 1 (mod n). 

13 . Let n e Z, /i > 1, and let a € Z with 1 < a < n. Prove that if a and n are relatively prime 

then there is an integer c such that ac = 1 (mod n) ^use the fact that the g.c.d. of two 

integers is a Z- linear combination of the integers]. 

14 . Conclude from the previous two exercises that (Z/nZ) x is the set of elements a of Z/nZ 
with (a, n) = 1 and hence prove Proposition 4. Verify this directly in the case n = 12. 

15 . For each of the following pairs of integers a and n, show that a is relatively prime to n and 
determine the multiplicative inverse of a in Z/nZ. 

(a) a = 13, n = 20. 

(b) a = 69, n = 89. 

(c) a = 1891, n = 3797. 

(d) a — 6003722857, n = 77695236973. [The Euclidean Algorithm requires only 3 
steps for these integers.] 

16. Write a computer program to add and multiply mod n, for any n given as input The output 
of these operations should be the least residues of the sums and products of two integers. 
Also include the feature that if (a, n) = 1, an integer c between 1 and n — 1 such that 
a c = \ may be printed on request. (Your program should not of course, simply quote 
“mod” functions already built into many systems). 


f 
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Part I 


GROUP THEORY 


The modern treatment of abstract algebra begins with the disarmingly simple abstract 
definition of a group. This simple definition quickly leads to difficult questions involving 
the structure of such objects. There are many specific examples of groups and the power 
of the abstract point of view becomes apparent when results for all of these examples 
are obtained by proving a single result for the abstract group. 

The notion of a group did not simply spring into existence, however, but is rather the 
culmination of a long period of mathematical investigation, the first formal definition 
of an abstract group in the form in which we use it appearing in I882. 1 The definition 
of an abstract group has its ori gins in extremely old problems in algebraic equations, 
number theory, and geometry, and arose because very similar techniques were found 
to be applicable in a variety of situations. As Otto HBlder (1859-1937) observed, one 
of the essential characteristics of mathematics is that after applying a certain algorithm 
or method of proof one then considers the scope and limi ts of the method. As a result, 
properties possessed by a number of interesting objects are frequently abstracted and 
the question raised: can one determine all the objects possessing these properties? 
Attempting to answer such a question also frequently adds considerable understanding 
of the original objects under consideration. It is in this fashion that the definition of an 
abstract group evolved into what is, for us, the starting point of abstract algebra. 

We illustrate with a few of the disparate situations in which the ideas later formalized 
into the notion of an abstract group were used 

(1) In number theory the very object of study, the set of integers, is an example of a 
group. Consider for example what we refer to as ‘'Euler’s Theorem” (cf. Exercise 
22 of Section 3.2), one extremely simple example of which is that a 40 has last two 
digits 01 if a is any integer not divisible by 2 nor by 5. This was proved in 1761 
by Leonhard Euler (1707-1783) using “group-theoretic” ideas of Joseph Louis 
Lagrange (1736-1813), long before the first formal definition of a group. From 
our perspective, one now proves “Lagrange’s Theorem” (cf. Theorem 8 of Section 
3.2), applying these techniques abstracted to an arbitrary group, and then recovers 
Euler’s Theorem (and many others) as a special case. 

1 For most of the historical comments below, see the excellent book A History of Algebra, by B. L. 
van der Waerden, Springer- Verlag, 1980 and the references there, particularly The Genesis of the Abstract 
Group Concept: A Contribution to the History of the Origin of Abstract Group Theory (translated from 
the German by Abe Shenitzer), by H. Wussing, MIT Press, 1 984, See also Number Theory, An Approach 
Through History from Hammurapai to Legendre, by A. Weil, Birkb Baser, 1 984. 
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(2) Investigations into the question of rational solutions to algebraic equations of the 
form y 2 = x 3 — 2x (there are infinitely many, for example (0, 0), (—1, 1), (2, 2), 
(9/4, —21/8), (—1/ 169, 239/2197)) showed that connecting any two solutions by 
a straight line and computing the intersection of this line with the curve y 2 = 
jt 3 — 2x produces another solution. Such “Diophantine equations,” among others, 
were considered by Pierre de Fermat (1601-1655) (this one was solved by him in 
1644), by Euler, by Lagrange around 1777, and others. In 1730 Euler raised the 
question of determining the indefinite integral / dx/\/l — x 4 of the “lemniscatic 
differential” dx/*J 1 — jc 4 , used in determining the arc length along an ellipse (the 
question had also been considered by Gottfried Wilhelm Leibniz (1646-1716) and 
Johannes Bernoulli (1667-1748)). In 1752 Euler proved a “multiplication formula” 
for such elliptic integrals (using ideas of G.C. di Fagnano (1682-1766), received 
by Euler in 1751), which shows how two elliptic integrals give rise to a third, 
bringing into existence the theory of elliptic functions in analysis. In 1834 Carl 
Gustav Jacob Jacobi ( 1 804- 1851) observed that the work of Euler on solving certain 
Diophantine equations amounted to writing the multiplication formula for certain 
elliptic integrals. Today the curve above is referred to as an “elliptic curve” and 
these questions are viewed as two different aspects of the same thing — the fact 
that this geometric operation on points can be used to give the set of points on an 
elliptic curve the structure of a group. The study of the “arithmetic” of these groups 
is an active area of current research. 2 

(3) By 1824 it was known that there are formulas giving the roots of quadratic, cubic 
and quartic equations (extending the familiar quadratic formula for the roots of 
ax 1 + bx + c = 0). In 1824, however, Niels Henrik Abel (1802-1829) proved 
that such a formula for the roots of a quintic is impossible (cf. Corollary 40 of 
Section 14.7). The proof is based on the idea of examining what happens when 
the roots are permuted amongst themselves (for example, interchanging two of the 
roots). The collection of such permutations has the structure of a group (called, 
naturally enough, a “permutation group”). This idea culminated in the beautiful 
work of Evariste Galois (1811-1832) in 1830-32, working with explicit groups 
of “substitutions.” Today this work is referred to as Galois Theory (and is the 
subject of the fourth part of this text). Similar explicit groups were being used 
in geometry as collections of geometric transformations (translations, reflections, 
etc.) by Arthur Cayley (1821-1895) around 1850, C amill e Jordan (1838-1922) 
around 1867, Felix Klein (1849-1925) around 1870, etc., and the application of 
groups to geometry is still extremely active in current research into the structure of 
3-space, 4-space, etc. The same group arising in the study of the solvability of the 
quintic arises in the study of the rigid motions of an icosahedron in geometry and 
in the study of elliptic functions in analysis. 

The precursors of today’s abstract group can be traced back many years, even 
before the groups of “substitutions” of Galois. The formal definition of an abstract 
group which is our starting point appeared in 1882 in the work of Walter Dyck (1856- 
1934), an assistant to Felix Klein, and also in the work of Heinrich Weber (1842-1913) 

2 See The Arithmetic of Elliptic Curves by J. Silverman, Springer- Verlag, 1986. 
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in the same year. 

It is frequently the case in mathematics research to find specific application of 
an idea before having that idea extracted and presented as an item of interest in its 
own right (for example, Galois used the notion of a “quotient group” implicitly in his 
investigations in 1830 and the definition of an abstract quotient group is due to Holder in 
1889). It is important to realize, with or without the historical context, that the reason the 
abstract definitions are made is because it is useful to isolate specific characteristics and 
consider what structure is imposed on an object having these characteristics. The notion 
of the structure of an algebraic object (which is made more precise by the concept of 
an isomorphism — which considers when two apparently different objects are in some 
sense the same) is a major theme which will recur throughout the text. 
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CHAPTER 1 


Introduction to Groups 


1.1 BASIC AXIOMS AND EXAMPLES 

In this section the basic algebraic structure to be studied in Part I is introduced and some 
examples are given. 

Definition. 

(1) A binary operation * on a set G is a function ★ : G x G G. For any a, b € G 
we shall write a * b for *(a, b). 

(2) A binary operation ★ on a set G is associative if for all a, b, c e G we have 
a * (b * c) = (a * b) * c. 

(3) If * is a binary operation on a set G we say elements a and b of G commute if 
a * b = b*a. We say ★ (or G ) is commutative if for all a, b € G,a*b = b*a. 

Examples 

(1) + (usual addition) is a commutative binary operation on Z (or on Q, R, or C respec- 
tively). 

(2) x (usual multiplication) is a commutative binary operation on Z (or on <Q>, R, or C 
respectively). 

(3) — (usual subtraction) is a noncommutative binary operation on Z, where —(a, b) = 
a — b. The map a i->- — a is not a binary operation (not binary). 

(4) — is not a b inar y operation on Z + (nor Q + , R + ) because for a, b e Z + with a < b, 
a — b& Z + , that is, — does not map Z + x Z + into Z + . 

(5) Taking the vector cross-product of two vectors in 3-space R 3 is a binary operation 
which is not associative and not commutative. 

Suppose that ★ is a binary operation on a set G and H is a subset of G. If the 
restriction of * to H is a binaiy operation on H, i.e., for all a, b € H, a *b € H t 
then H is said to be closed under ★. Observe that if ★ is an associative (respectively, 
commutative) binary operation on G and ★ restricted to some subset H of G is a binary 
operation on H> then * is automatically associative (respectively, commutative) on H 
as well. 

Definition. 

(1) A group is an ordered pair (G, *) where G is a set and ★ is a binaiy operation 
on G satisfying the following axioms: 
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(i) (a * b) * c — a * (b * c ), for all a, b, c € G , i.e., * is associative, 

(ii) there exists an element e in G, called an identity of G, such that for all 
a G G we have a + e = e + a = a, 

(iii) for each a e G there is an element a 1 of G, called an inverse of a, 
such that a * a~ l = a~ ] * a — e. 

(2) The group (G, ★) is called abelian (or commutative ) if a *b = ★ c for all 

a,b € G. 

We shall immediately become less formal and say G is a group under ★ if (G, ★) is 
a group (or just G is a group when the operation ★ is clear from the context). Also, we 
say G is a finite group if in addition G is a finite set. Note that axiom (ii) ensures that 
a group is always nonempty. 

Examples 

(1) Z, <Q), R and € are groups under + with e = 0 and a ~ 1 = —a, for all a . 

(2) Q — (0), R -- {0}, € — {0}, <Q> 4 , R 4 are groups under x with e = 1 and a~ l = 

a 

for all a. Note however that Z - {0} is not a group under x because although x is an 
associative binary operation on Z — {0}, the element 2 (for instance) does not have an 
inverse in Z — {0}. 

We have glossed over the fact that the associative law holds in these familiar ex- 
amples. For Z under + this is a consequence of the axiom of associativity for addition 
of natural numbers. The associative law for Q under + follows from the associative 
law for Z — a proof of this will be outlined later when we rigorously construct <Q from 
Z (cf. Section 7.5). The associative laws for R and, in turn, C under 4- are proved 
in elementary analysis courses when R is constructed by completing Q — ultimately, 
associativity is again a consequence of associativity for Z. The associative axiom for 
multiplication may be established via a similar development, starting first with Z. Since 
R and C will be used largely for illustrative purposes and we shall not construct R from 
(Q> (although we shall construct C from R) we shall take the associative laws (under 4- 
and x ) for R and C as given. 

Examples (continued) 

(3) The axioms for a vector space V include those axioms which specify that (V, +) is an 
abelian group (the operation + is called vector addition). Thus any vector space such 
as R" is, in particular, an additive group. 

(4) For n e Z 4 , Z/nZ is an abelian group under the operation + of addition of residue 
classes as described in Chapter 0. We shall prove in Chapter 3 (in a more general 
context) that this binary operation + is well defined and associative; for now we take 
this for granted. The identity in this group is the element 6 and for each a e TL/nTL, 
the inverse of a is ^a. Henceforth, when we talk about the group Z/nZ it will be 
understood that the group operation is addition of classes mod n. 

(5) For n € Z 4 , the set (Z/nZ) x of equivalence classes a which have multiplicative 
inverses mod n is an abelian group under multiplication of residue classes as described 
in Chapter 0. Again, we shall take for granted (for the moment) that this operation 
is well defined and associative. The identity of this group is the element 1 and, by 
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definition of (Z/nZ) x , each element has a multiplicative inverse. Henceforth, when 
we talk about the group (Z/nZ) x it will be understood that the group operation is 
multiplication of classes mod n. 

(6) If (A, *) and (B, o) are groups, we can form a new group AxB, called their direct 
product, whose elements are those in the Cartesian product 

Ax B — {(a, b) | a e A, b e B] 
and whose operation is defined componentwise: 

(ai,bi)(a2. In) = 0 a\ *a2,b\ 0 & 2 ). 

For example, if we take A = B = R (both operations addition), R x R is the familiar 
Euclidean plane. The proof that the direct product of two groups is again a group is 
left as a straightforward exercise (later) — the proof that each group axiom holds in 
AxB is a consequence of that axiom holding in both A and B together with the fact 
that the operation in A x B is defined componentwise. 

There should be no confusion between the groups Z/«Z (under addition) and 
(Z/nZ) x (under multiplication), even though the latter is a subset of the former — the 
superscript x will always indicate that the operation is multiplication. 

Before continuing with more elaborate examples we prove two basic results which 
in particular enable us to talk about the identity and the inverse of an element 

Proposition 1* If G is a group under the operation * , then 

(1) the identity of G is unique 

(2) for each a € G, a 1 is uniquely determined 

(3) (a -1 ) -1 = a for all a € G 

(4) ( a + b)~ l = (b~ l ) + (a~ l ) 

(5) for any a\ , a 2 , . . . , a n € G the value of a\ ★ a 2 * - - - * is independent of how 
the expression is bracketed (this is called the generalized associative law). 


Proof: (1) If / and g are both identities, then by axiom (ii) of the definition of a 
group f +g = f (take a = f and e = g). By the same axiom / * g — g (take a — g 
and e = /). Thus f — g, and the identity is unique. 

(2) Assume b and c are both inverses of a and let e be the identity of G. By axiom 
(iii), a ^b = e and c *- a = e. Thus 


c = c ★ e 
= c* (a *b) 

— (c k- a) + b 

— e kb 
= b 


(definition of e - axiom (ii)) 
(since e — a kb) 
(associative law) 

(since e = c k a ) 

(axiom (ii)). 


(3) To show (a -1 ) -1 = a is exactly the problem of showing a is the inverse of a~ l 
(since by part (2) a has a unique inverse). Reading the definition of a~ l t with the roles 
of a and a~ l mentally interchanged shows that a satisfies the defining property for the 
inverse of a 1 , hence a is the inverse of a ~ l . 
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(4) Let c — {a * b)~ ] so by definition of c, (a * b) * c = e. By the associative law 

a ★ (b * c) = e. 

Multiply both sides on the left by a -1 to get 

a~ l * (a*(b* c )) = a~ l * e. 

The associative law on the left hand side and the definition of e on the right give 

(a -1 *a) + (b + c) — a~ l 


so 


e*(b*c) = a 


hence 

b*c — a~ l . 

Now multiply both sides on the left by b~ l and simplify similarly: 

b~ x * (b * c) = b~ l * a~ l 
( b~ l *b)*c = b~ l +a~ l 
e + c = b~ l *a~ x 
c = b~ l *a~ l , 


as claimed. 

(5) This is left as a good exercise using induction on n. First show the result is true 
for n — 1,2, and 3. Next assume for any k < n that any bracketing of a product of k 
elements, b) * fa * - > • ★ bk can be reduced (without altering the value of the product) to 
an expression of the form 

b\ * ibj. * (bs * (■ ■ ■ * bi ^) . . . ). 

Now argue that any bracketing of the product a\ * a% * • • • ★ a„ must break into 2 
suhproducts, say (a\ * a^. * • - ■ ★ at) ★ (a k +i * a k +2 ★ ■ • ■ ★ a*), where each sub-product 
is bracketed in some fashion. Apply the induction assumption to each of these two 
sub-products and finally reduce the result to the form ai * (a^ * (<23 * (- ■ - * a^)) . . . ) to 
complete the induction. 

Note that throughout the proof of Proposition 1 we were careful not to change 
the order of any products (unless permitted by axioms (ii) and (iii)) since G may be 
non-abelian. 

Notation: 

(1) For an abstract group G it is tiresome to keep writing the operation * throughout 
our calculations. Henceforth (except when necessary) our abstract groups G, H, 
etc. will always be written with the operation as - and a ■ b will always be written 
as ab. In view of the generalized associative law, products of three or more group 
elements will not be bracketed (although the operation is still a binary operation). 
Finally, for an abstract group G (operation ■) we denote the identity of G by 1. 
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(2) For any group G (operation ■ implied) and x e G and n e since the product 

xx ■ ■ x (n terms) does not depend on how it is bracketed, we shall denote it by x”. 

Denote x~ l x~ l - ■ ■ jt -1 (n terms) by x~ n . Let x° = 1, the identity of G . 

This new notation is pleasantly concise. Of course, when we are dealing with 
specific groups, we shall use the natural (given) operation. For example, when the 
operation is +, the identity will be denoted by 0 and for any element a, the inverse a~ l 

will be written —a and « + a H \-a(n > 0 terms) will be written na ; —a — a a 

(n terms) will be written — na and Oa = 0. 

Proposition 2. Let G be a group and let a, b e G. The equations ax = b and ya = b 
have unique solutions for x, y € G. In particular, the left and right cancellation laws 
hold in G, i.e., 

(1) if au = av, then u = v, and 

(2) if ub = vb, then u = v. 

Proof: We can solve ax = b by multiplying both sides on the left by a~ l and 
simplifying to get x — a~ x b. The uniqueness of x follows because aT y is unique. 
Similarly, if ya = b, y = ba~ l . If au = av, multiply both sides on the left by a~ l and 
simplify to get u = v. Similarly, the right cancellation law holds. 

One consequence of Proposition 2 is that if a is any element of G and for some 
b e G, ab = e or ba = e, then b = a~ l , i.e., we do not have to show both equations 
hold. Also, if for some b e G, ab = a (or ba = a), then b must be the identity of G, 
i.e., we do not have to check bx = xb = x for all x e G. 

Definition. For G a group and x € G define the order of x to be the smallest positive 
integer n such that x n = 1, and denote this integer by |x|. In this case x is said to be of 
order n. If no positive power of x is the identity, the order of x is defined to be infinity 
and x is said to be of infinite order. 

The symbol for the order of x should not be confused with the absolute value symbol 
(when CCRwe shall be careful to distinguish the two). It may seem injudicious to 
choose the same symbol for order of an element as the one used to denote the cardinality 
(or order) of a set, however, we shall see that the order of an element in a group is the 
same as the cardinality of the set of all its (distinct) powers so the two uses of the word 
“order” are naturally related. 

Examples 

(1) An element of a group has order 1 if and only if it is the identity. 

(2) In the additive groups Z, Q, R or C every nonzero (i.e., nonidentity) element has 
infinite order. 

(3) In the multiplicative groups R — {0} or Q — {0} the element — 1 has order 2 and all 
other nonidentity elements have infinite order. 

(4) In the additive group Z/9Z the element 6 has order 3, since 6 f 0,6+6 = 12 = 3^0, 
but 6 + 6 + 6 = 1 8 = 0, the identity in this group. Recall that in an additive group the 
powers of an element are the integer multiples of the element Similarly, the order of 
the element 5 is 9, since 45 is the smallest positive multiple of 5 that is divisible by 9. 
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(5) In the multiplicative group (Z/7Z) X , the powers of the element 2 are 2, 4, 8 = 1, the 
identity in this group, so 2 has order 3. Similarly, the element 3 has order 6, since 3 6 
is the smallest positive power of 3 that is congruent to 1 modulo 7. 

Definition. Let G = {gi, g 2 , . . . , g„} be a finite group with g\ = 1. The multiplica- 
tion table or group table of G is the n x n matrix whose i, j entry is the group element 
8i8j- 

For a finite group the multiplication table contains, in some sense, all the information 
about the group. Computationally, however, it is an unwieldly object (being of size the 
square of the group order) and visually it is not a very useful object for determining 
properties of the group. One might think of a group table as the analogue of having a 
table of all the distances between pairs of cities in the country. Such a table is useful 
and, in essence, captures all the distance relationships, yet a map (better yet, a map with 
all the distances labelled on it) is a much easier tool to work with. Part of our initial 
development of the theory of groups (finite groups in particular) is directed towards a 
more conceptual way of visualizing the internal structure of groups. 

EXERCISES 


Let G be a group. 

1. Determine which of the following binary operations are associative: 

(a) the operation * on Z defined by a *b — a — b 

(b) the operation * on K defined by a*b-=a + b + ab 

a + b 

(c) the operation * on Q defined by a*b — — - — 

(d) the operation * on Z x Z defined by (a, b) * (c, d) = (ad + be, bd) 

a 

(e) the operation * on Q — {0} defined by a * b = 

b 

2. Decide which of the binaiy operations in the preceding exercise are commutative. 

3. Prove that addition of residue classes in Z/nZ is associative (you may assume it is well 
defined). 

4. Prove that multiplication of residue classes in Z/nZ is associative (you may assume it is 
well defined). 

5. Prove for all n > 1 that Z/nZ is not a group under multiplication of residue classes. 

6. Determine which of the following sets are groups under addition: 

(a) the set of rational numbers (including 0 = 0/1) in lowest terms whose denominators 
are odd 

(b) the set of rational numbers (including 0 = 0/1) in lowest terms whose denominators 
are even 

(c) the set of rational numbers of absolute value < 1 

(d) the set of rational numbers of absolute value > 1 together with 0 

(e) the set of rational numbers with denominators equal to 1 or 2 

(f) the set of rational numbers with denominators equal to 1, 2 or 3. 

7. Let G=(reR|0<r <1} and for x, y e G let x ★ y be the fractional part of x -|- y 
(i.e., jt*y = x + y — [jf + y] where [a] is the greatest integer less than or equal to a). 
Prove that ★ is a well defined binary operation on G and that G is an abelian group under 
★ (called the real numbers mod I). 
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8. Let G = (z E C | z” = 1 for some n E Z+). 

(a) Prove that G is a group under multiplication (called the group of roots of unity in C). 

(b) Prove that G is not a group under addition. 

9. Let G = [u - 1- Z>"v/2 E IR | a, b € Q). 

(a) Prove that G is a group under addition. 

(b) Prove that the nonzero elements of G are a group under multiplication. [“Rationalize 
the denominators” to find multiplicative inverses.] 

10. Prove that a finite group is abelian if and only if its group table is a symmetric matrix. 

11. Find the orders of each element of the additive group Z/12Z. 

12. Find the orders of the following elements of the multiplicative group (Z/12Z) X : 1,-1, 
5, 7, =7, 13. 

13. Find the orders of the following elements of the additive group Z/36Z: 1, 2, 6, 9, 10, 12, 

-i, ^To, ^18. 

14. Find the orders of the following elements of the multiplicative group (Z/36Z) X : T, —1, 
5, 13, =13, 17. 

15. Prove that (aja 2 . . . a n )~ l = for all a\ , 02 . ■ - - > o. n € G. 

16. Let x be an element of G. Prove that x 2 = 1 if and only if \x\ is either 1 or 2. 

17. Let x be an element of G . Prove that if |jc| = n for some positive integer n then jc -1 = jc”^ 1 . 

18. Let x and y be elements of G. Prove that xy — yx if and only if y~ l xy — x if and only if 
jc -1 y -1 jcy = 1. 

19. Let x e G and let a, b € Z + . 

(a) Prove that x a + b = x°x b and (x a ) b = x^. 

(b) Prove that (jc fl ) -1 = x~ a . 

(c) Establish part (a) for arbitrary integers a and b (positive, negative or zero). 

20. For jc an element in G show that jc and jc -1 have the same order. 

21. Let G be a finite group and let x be an element of G of order n. Prove that if n is odd, then 
x = (jc 2 )* for some k. 

22. If jc and g are elements of the group G, prove that |jc| = |g -1 xg|. Deduce that \ab\ -- \ba\ 
for all a, b e G. 

23. Suppose x t G and |jc | = n < 00 . If n = st for some positive integers s and r, prove that 
1**1 = t. 

24. If a and b are commuting elements of G, prove that (ab) n — cPU 1 for all n e Z. [Do this 
by induction for positive n first] 

25. Prove that if jc 2 = 1 for all jc e G then G is abelian. 

26. Assume H is a nonempty subset of (G, *) which is closed under the binary operation on 
G and is closed under inverses, i.e., for all h and k E H,hk and h E H. Prove that H is 
a group under the operation ★ restricted to H (such a subset H is called a subgroup of G). 

27. Prove that if jc is an element of the group G then {jc n | n £ Z} is a subgroup (cf. the 
preceding exercise) of G (called the cyclic subgroup of G generated by jc). 

28. Let (A, *) and (£, o) be groups and let A x B be their direct product (as defined in Example 
6). Verify all the group axioms for A x B: 

(a) prove that the associative law holds: for all (< 2 /, b,) £ A x B, i = 1 , 2, 3 
(<zi, bi)[(a2, bi){ai,bf)] = [(fli, &i)(fl2. &2)](fl3. ^3)> 
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(b) prove that (1 , 1) is the identity of A x B, and 

(c) prove that the inverse of (a, b) is (a" 1 , 6 -1 ). 

29. Prove that A x B is an abelian group if and only if both A and B are abelian. 

30. Prove that the elements (a, 1) and (1 , b) of A x B commute and deduce that the order of 
(a, b ) is the least common multiple of |a| and |6|. 

31. Prove that any finite group G of even order contains an element of order 2. [Let t(G) be 
the set {g e G | j ^ j -1 }. Show that f (G) has an even number of elements and every 
nonidentity element of G — t(G) has order 2.] 

32. If x is an element of finite order n in G, prove that the elements 1, x, x 2 X* -1 are all 

distinct. Deduce that |x| < |G|. 

33. Let x be an element of finite order n in G. 

(a) Prove that if n is odd then x 1 ^ x~‘ for all i = 1, 2 n — l. 

(b) Prove that if n = 2k and 1 < i < n then x l = x~ l if and only if i = k. 

34. If x is an element of infinite order in G, prove that the elements x n , n e Z are all distincL 

35. If x is an element of finite order n in G, use the Division Algorithm to show that any 
integral power of x equals one of the elements in the set { 1 , x, x 2 , . . . , x n -1 } (so these are 
all the distinct elements of the cyclic subgroup (cf. Exercise 27 above) of G generated by 
x). 

36. Assume G = { 1 , a , b, c) is a group of order 4 with identity 1 . Assume also that G has no 
elements of order 4 (so by Exercise 32, every element has order £ 3). Use the cancellation 
laws to show that there is a unique group table for G. Deduce that G is abelian. 


1.2 DIHEDRAL GROUPS 

An important family of examples of groups is the class of groups whose elements are 
symmetries of geometric objects. The simplest subclass is when the geometric objects 
are regular planar figures. 

For each n e Z + , n > 3 let Dm be the set of symmetries of a regular n-gon, where 
a symmetry is any rigid motion of the n-gon which can be effected by taking a copy 
of the n-gon, moving this copy in any fashion in 3-space and then placing the copy 
back on the original n-gon so it exactly covers it. More precisely, we can describe the 
symmetries by first choosing a labelling of the n vertices, for example as shown in the 
following figure. 



/ 


Sec. 1.2 


Dihedral Groups 


23 


Then each symmetry s can be described uniquely by the corresponding permutation a 
of {1,2,3,..., w} where if the symmetry s puts vertex / in the place where vertex j 
was originally, then a is the permutation sending i to j. For instance, if s is a rotation 
of In/n radians clockwise about the center of the n- gon, then a is the permutation 
sending i to i + 1, 1 < i < n — 1, and a(n) = 1. Now make D 2n into a group by 
defining st for s,t 6 D 2n to be the symmetry obtained by first applying t then j to 
the n-gon (note that we are viewing symmetries as functions on the n-gon, so st is just 
function composition - — read as usual from right to left). If s, t effect the permutations 
er, r, respectively on the vertices, then st effects a o z. The binary operation on D 2n 
is associative since composition of functions is associative. The identity of D 2n is the 
identity symmetry (which leaves all vertices fixed), denoted by 1, and the inverse of 
s e Z) 2 n is the symmetry which reverses all rigid motions of s (soils effects permutation 
a on the vertices, s -1 effects <r _1 ). In the next paragraph we show 

\D 2n \=2n 

and so is called the dihedral group of order 2 n. In some texts this group is written 
D n \ however, (where the subscript gives the order of the group rather than the 
number of vertices) is more common in the group theory literature. 

To find the order \D 2n \ observe that given any vertex i, there is a symmetry which 
sends vertex 1 into position i. Since vertex 2 is adjacent to vertex 1, vertex 2 must 
end up in position i + 1 or i — 1 (where n + 1 is 1 and 1 — 1 is ft, i.e., the integers 
labelling the vertices are read mod n ). Moreover, by following the first symmetry by a 
reflection about the line through vertex i and the center of the n-gon one sees that vertex 
2 can be sent to either position i + 1 or i — 1 by some symmetry. Thus there are n ■ 2 
positions the ordered pair of vertices 1, 2 may be sent to upon applying symmetries. 
Since symmetries are rigid motions one sees that once the position of the ordered pair 
of vertices 1, 2 has been specified, the action of the symmetry on all remaining vertices 
is completely determined. Thus there are exactly 2 n symmetries of a regular n-gon. We 
can, moreover, explicitly exhibit 2 n symmetries. These symmetries are the n rotations 
about the center through 2ni /« radian, 0 < i < n — 1, and the n reflections through the 
n lines of symmetry (if n is odd, each symmetry line passes through a vertex and the 
mid-point of the opposite side; if n is even, there are n /2 lines of symmetry which pass 
through 2 opposite vertices and n/2 which perpendicularly bisect two opposite sides). 
For example, if ft = 4 and we draw a square at the origin in an x, y plane, the lines of 
symmetry are 
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the lines x = 0 (y-axis), y = 0 (x-axis), y = x and y = —x (note that “reflection” 
through the origin is not a reflection but a rotation of n radians). 

Since dihedral groups will be used extensively as an example throughout the text 
we fix some notation and mention some calculations which will simplify future com- 
putations and assist in viewing D-in as an abstract group (rather than having to return to 
the geometric setting at every instance). Fix a regular /i-gon centered at the origin in an 
x , y plane and label the vertices consecutively from 1 to n in a clockwise manner. Let r 
be the rotation clockwise about the origin through 2 nfn radian. Let s be the reflection 
about the line of symmetry through vertex 1 and the origin (we use the same letters for 
each n, but the context will always make n clear). We leave the details of the following 
calculations as an exercise (for the most part we shall be working with D& and Dg, so 
the reader may wish to try these exercises for n =3 and n = 4 first): 

(1) 1, r, r 2 , . . . , are all distinct and r n = 1, so \r\ = n. 

(2) \s\ =2. 

(3) s ^ r' for any i. 

(4) sr 1 ^ sr j , for all 0 < i, j < n — 1 with i ± j , so 

Din = {1, r, r 2 , . . . , r n ~\ s, sr, sr 2 sr" -1 } 

i.e., each element can be written uniquely in the form s k r‘ for some k = 0 or 
1 and 0 < i < n — 1. 

(5) rs = sr -1 . [First work out what permutation s effects on (1,2, . . . , n) and 
then work out separately what each side in this equation does to vertices 1 
and 2.] This shows in particular that r and s do not commute so that Di n is 
non-abelian. 

(6) r's = sr ~‘ , for all 0 < i < n. [Proceed by induction on i and use the fact that 
r I+l s = r(r*s) together with the preceding calculation.] This indicates how to 
commute s with powers of r. 

Having done these calculations, we now observe that the complete multiplication 
table of £> 2 ,, can be written in terms r and s alone, that is, all the elements of D 2n have a 
(unique) representation in the form s k r 1 , k = 0 or 1 and 0 < i < n — 1 , and any product 
of two elements in this form can be reduced to another in the same form using only 
“relations” (1), (2) and (6) (reducing all exponents mod n). For example, if n = 12, 

( sr 9 )(sr 6 ) = s(r 9 s)r 6 = r($r -9 )r 6 = s 2 r -9+6 = r -3 = r 9 . 


Generators and Relations 

The use of the generators r and s for the dihedral group provides a simple and succinct 
way of computing in D^. We can similarly introduce the notions of generators and 
relations for arbitrary groups. It is useful to have these concepts early (before their 
formal justification) since they provide simple ways of describing and computing in 
many groups. Generators will be discussed in greater detail in Section 2.4, and both 
concepts will be treated rigorously in Section 6.3 when we introduce the notion of free 
groups. 
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A subset 5 of elements of a group G with the property that every element of G 
can be written as a (finite) product of elements of S and their inverses is called a set of 
generators of G. We shall indicate this notationally by writing G = (S) and say G 
is generated by S or S generates G. For example, the integer 1 is a generator for the 
additive group % of integers since every integer is a sum of a finite number of +1 ’s and 
— l’s, so Z = ( 1 ). By property (4) of Dm the set S = (r, s) is a set of generators of 
Dm-i so Dm — ( r > )- We shall see later that in a finite group G the set S generates 
G if every element of G is a finite product of elements of S (i.e., it is not necessary to 
include the inverses of the elements of S as well). 

Any equations in a general group G that the generators satisfy are called relations 
in G. Thus in Dm have relations: r” = 1, s 2 = 1 and rs — sr~ l . Moreover, in 
Dm these three relations have the additional property that any other relation between 
elements of the group may be derived from these three (this is not immediately obvious; 
it follows from the fact that we can determine exactly when two group elements are 
equal by using only these three relations). 

In general, if some group G is generated by a subset S and there is some collection 
of relations, say R \ , /? 2 , ■ - - > Rm (here each Rj is an equation in the elements from 
S U (1)) such that any relation among the elements of S can be deduced from these, we 
shall call these generators and relations a presentation of G and write 

G = { S | R\, /?2» ■ - - . Rm )- 

One presentat ion for the dihedral group Dm (using the generators and relations above) 
is then 

Dm = {r,s | r" = s 2 = 1, rs = sr~ l ). (1.1) 

We shall see that using this presentation to describe Dm (rather than always reverting 
to the original geometric description) will greatly simplify working with these groups. 

Presentations give an easy way of describing many groups, but there are a number of 
subtleties that need to be considered. One of these is that in an arbitrary presentation it 
may be difficult (or even impossible) to tell when two elements of the group (expressed 
in terms of the given generators) are equal. As a result it may not be evident what the 
order of the presented group is, or even whether the group is finite or infinite! For 
example, one can show that (xi, yi | xf = y 2 = (x\y\) 2 = 1 ) is a presentation of a 
group of order 4, whereas (x2, yi \ *2 = y% = fayi) 2 = 1 ) is a presentation of an 
infinite group (cf. the exercises). 

Another subtlety is that even in quite simple presentations, some “collapsing’’ may 
occur because the relations are intertwined in some unobvious way, i.e., there may be 
“hidden,” or implicit, relations that are not explicitly given in the presentation but rather 
are consequences of the specified ones. This collapsing makes it difficult in general to 
determine even a lower bound for the size of the group being presented. For example, 
suppose one mimicked the presentation of Dm in an attempt to create another group by 
defining: 

Xm - (x.y \ x n = y 2 = \, xy = yx 2 ). (1.2) 

The “commutation” relation xy = yx 2 determines how to commute y and x (i.e., how 
to “move” y from the right of x to the left), so that just as in the group Dm every element 
in this group can be written in the form y k x‘ with all the powers of y on the left and all 
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the powers of x on the right Also, by the first two relations any powers of x and y can be 
reduced so that i lies between 0 and n — 1 and is 0 or 1 . One might therefore suppose 
that Jfy,, is again a group of order 2n. This is not the case because in this group there is 
a “hidden” relation obtained from the relation x = xy 2 (since y 2 = 1 ) by applying the 
commutation relation and the associative law repeatedly to move the y’s to the left: 

x = xy 2 = (xy)y = (y* 2 )y = Oa)(*y) = (yx)Cy* 2 ) 

= y(xy)x 2 — y(yx 2 )x 2 = y^x 4 = jc 4 . 

Since x 4 = x it follows by the cancellation laws that a 3 = 1 in X^, and from the 
discussion above it follows that X^n has order at most 6 for any n . Even more collapsing 
may occur, depending on the value of n (see the exercises). 

As another example, consider the presentation 

y = (u, v | u 4 = v 3 = 1, uv = v 2 u 2 ). (1.3) 

In this case it is tempting to guess that Y is a group of order 12, but again there are 
additional implicit relations. In fact this group Y degenerates to the trivial group of 
order 1, i.e., u and v satisfy the additional relations u = 1 and v = 1 (a proof is outlined 
in the exercises). 

This kind of collapsing does not occur for the presentation of D 2 „ because we 
showed by independent (geometric) means that there is a group of order 2 n with gen- 
erators r and s and satisfying the relations in (1). Asa result, a group with only these 
relations must have order at least 2n. On the other hand, it is easy to see (using the 
same sort of argument for Xm above and the commutation relation rs = sr~ l ) that any 
group defined by the generators and relations in (1) has order at mostTn. It follows that 
the group with presentation (1) has order exactly 2 n and also that this group is indeed 
the group of symmetries of the regular n-gon. 

The additional information we have for the presentation (1) is the existence of a 
group of known order satisfying this information. In contrast, we have no independent 
knowledge about any groups satisfying the relations in either (2) or (3). Without such 
independent “lower bound” information we migh t not even be able to determine whether 
a given presentation just describes the trivial group, as in (3). 

While in general it is necessary to be extremely careful in prescribing groups by 
presentations, the use of presentations for known groups is a powerful conceptual and 
computational tool. Additional results about presentations, including more elaborate 
examples, appear in Section 6.3. 


EXERCISES 

In these exercises, has the usual presentation D^n = { r, s \ r" = s 2 = 1 , rs — sr ~ l ). 

1. Compute the order of each of the elements in the following groups: 

(a) De (b) D& (c) f>io- 

2. Use the generators and relations above to show that if jc is any element of which is 
not a power of r, then rx = xr J . 

3. Use the generators and relations above to show that every element of Z> 2 * which is not a 
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power of r has order 2. Deduce that Di n is generated by the two elements j and sr, both 
of which have order 2. 

4. If n = 2k is even and n > 4, show that z = r* is an element of order 2 which commutes 
with all elements of D^n. Show also that z is the only nonidentity element of Dir, which 
commutes with all elements of £> 2 n ■ [cf. Exercise 33 of Section 1.] 

5. If n is odd and n > 3, show that the identity is the only element of which commutes 
with all elements of Z> 2 *. [cf. Exercise 33 of Section 1.] 

6. Let x and y be elements of order 2 in any group G. Prove that if t = xy then tx =xt~ l 
(so that if n = \xy\ < oo then x, t satisfy the same relations in G as r do in D^). 

7. Show that ( a, b \ a 2 = b 2 = ( ab) n = 1 ) gives a presentation for in terms of the two 
generators a = s and b = sr of order 2 computed in Exercise 3 above. [Show that the 
relations for r and s follow from the relations for a and b and, conversely, the relations for 
a and b follow from those for r and s.] 

8. Find the order of the cyclic subgroup of D 2 „ generated by r (cf. Exercise 27 of Section 1). 

In each of Exercises 9 to 13 you can find the order of the group of rigid motions in R 3 (also called 
the group of rotations) of the given Platonic solid by following the proof for the order of D 2 n • 
find the number of positions to which an adjacent pair of vertices can be sent Alternatively, 
you can find the number of places to which a given face may be sent and, once a face is fixed, 
the number of positions to which a vertex on that face may be sent 

9. Let G be the group of rigid motions in R 3 of a tetrahedron. Show that |G| = 12. 

10. Let G be the group of rigid motions in R 3 of a cube. Show that |G| =24. 

11. Let G be the group of rigid motions in R 3 of an octahedron. Show that |G) = 24. 

12. Let G be the group of rigid motions in R 3 of a dodecahedron. Show that |G | = 60. 

13. Let G be the group of rigid motions in R 3 of an icosahedron. Show that |G| =60. 

14. Find a set of generators for 7L. 

15. Find a set of generators and relations for TLjnL. 

16. Show that the group (*i, yi | x 2 — y 2 — (xiyi) 2 = 1 ) is the dihedral group D 4 (where 
xt may be replaced by the letter r and yi by s). [Show that the last relation is the same as: 

*1 y\ = yi*r l ] 

17. Let X 2 n be the group whose presentation is displayed in (1.2). 

(a) Show that if n = 31, then X 2 n has order 6, and it has the same generators and relations 
as As when x is replaced by r and y by s. 

(b) Show that if (3, n) = l, then x satisfies the additional relation: x = 1. In this case 
deduce that X 2 n has order 2. [Use the facts that x n = 1 and jc 3 = 1.] 

18. Let 1 be the group whose presentation is displayed in (1.3). 

(a) Show that v 2 = u -1 . [Use the relation: v 3 = 1.] 

(b) Show that v commutes with u 3 . [Show that v 2 u 3 v = u 3 by writing the left hand side 
as (v 2 u 2 )(uv) and using the relations to reduce this to the right hand side. Then use 
part (a).] 

(c) Show that v commutes with «. [Show that u 9 = u and then use part (b).] 

(d) Show that uv = 1. [Use part (c) and the last relation.] 

(e) Show that u = 1, deduce that v = 1, and conclude that 1 = 1. [Use part (d) and the 
equation « 4 i> 3 =1.] 
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1.3 SYMMETRIC GROUPS 


Let £2 be any nonempty set and let Sq be the set of all bijections from £2 to itself (i.e., 
the set of all permutations of £2). The set Sq is a group under function composition: o. 
Note that o is a binary operation on Sq since if a : £2 -> £2 and r : £2 -> £2 are both 
bijections, then a o r is also a bjjection from £2 to £2. Since function composition is 
associative in general, o is associative. The identity of Sq is the permutation 1 defined 
by 1 (a) — a, for all a € £2. For every permutation cr there is a (2-sided) inverse 
function, cr~ l : £2 -> £2 satisfying a o a -1 = cr~ ] ocr = 1 . Thus, all the group axioms 
hold for (Sq, o). This group is called the symmetric group on the set £2. It is important 
to recognize that the elements of Sq are the permutations of £2, not the elements of £2 
itself. 

In the special case when £2 = {1, 2, 3, the symmetric group on £2 is de- 
noted S„, the symmetric group of degree n. 1 The group S n will play an important role 
throughout the text both as a group of considerable interest in its own right and as a 
means of illustrating and motivating the general theory. 

First we show that the order of S„ is «!. The permutations of {1, 2, 3, . . . , n) are 
precisely the injective functions of this set to itself because it is finite (Proposition 0.1) 
and we can count the number of injective functions. An injective function cr can send 
the number 1 to any of the n elements of {1, 2, 3, , «}; cr(2) can then be any one of 
the elements of this set except cr(l) (so there are n — 1 choices for a (2)); <x(3) can be 
any element except o(l) or cr(2) (so there are n — 2 choices for <r(3)), and so on. Thus 
there are precisely n - (n — 1) - (n — 2) ... 2 ■ 1 = n! possible injective functions from 
(1, 2, 3, .... n) to itself. Hence there are precisely n \ permutations of {1, 2, 3, ... , n) 
so there are precisely n ! elements in S n . 

We now describe an efficient notation for writing elements cr of S„ which we shall 
use throughout the text and which is called the cycle decomposition. 

A cycle is a string of integers which represents the element of S„ which cyclically 
permutes these integers (and fixes all other integers). The cycle (a\ n 2 ...«»,) is the 
permutation which sends a,- to n, + i, 1 < i < m — 1 and sends a m to a \ . For example 
(213) is the permutation which maps 2 to 1, 1 to 3 and 3 to 2. In general, for each 
a € S„ the numbers from 1 to n will be rearranged and grouped into k cycles of the 
form 

(nij 0.2 . . . Off,y )(fl, W( _|_i ) ■ ■ ■ +2 • • - a mk ) 

from which the action of cr on any number from 1 to n can easily be read, as follows. 
For any x e {1,2, 3, ...,«} first locale x in the above expression. If x is not followed 
immediately by a right parenthesis ( i.e., x is not at the right end of one of the A; cycles), 
then cr(x) is the integer appearing immediately to the right of x. If x is followed by a 
right parenthesis, then c (x) is the number which is at the start of the cycle ending with 
x ( i.e., if x = a m; , for some i, then cr(x) = a m ._ ]+ \ (where mo is taken to be 0)). We 
can represent this description of cr by 


1 We shall see in Section 6 that the structure of Sq depends only on the cardinality of £2 , not on the 
particular elements of Cl itself, so if Cl is any finite set with n elements, then Sq “looks like” S„. 
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The product of all the cycles is called the cycle decomposition of < 7 . 

We now give an algorithm for computing the cycle decomposition of an element a 
of S„ and work through the algorithm with a specific permutation. We defer the proof 
of this algorithm and full analysis of the uniqueness aspects of the cycle decomposition 
until Chapter 4. 

Let n = 13 and let a e S\$ be defined by 

o(l) = 12, or (2) = 13. o(3) = 3, o(4) = 1, o(5) = 11, 

o(6) =9, o(7) = 5, o(8) = 10, o(9) = 6, o(10) = 4, 

o(l 1) = 7, o(12) = 8, o(13) = 2. 

Cycle Decomposition Algorithm 


Method Example 


Tb start a new cycle pick the smallest element of { 1 . 2 /t} (1 

which has not yet appeared in a previous cycle — call it a (if 
you are just starting, a = 1); begin the new cycle: ( a 


Read off a [a) from the given description of a — call it b. If ct(1) = 12 = b. 12 ^ 1 so write: 
b = a , close the cycle with a right parenthesis (without writing (112 
b down); this completes a cycle — return to step 1. If b ^ a, 
write b next to a in this cycle: (a b 


Read off a{b) from the given description of a — call it c. If <r(12) = 8, 8 ^ 1 so continue the 

c = a, close the cycle with a right parenthesis to complete the cycle as: (1 128 

cycle — return to step 1. If c ^ a, write c next to b in this 

cycle: (a be Repeat this step using the number c as the new 

value for b until the cycle closes. 

Naturally this process stops when all the numbers from {1,2 have appeared 
in some cycle. For the particular a in the example this gives 

a = (1 12 8 10 4)(2 13)(3)(5 11 7)(6 9). 

The length of a cycle is the number of integers which appear in it. A cycle of length 
t is called a t-cycle. Two cycles are called disjoint if they have no numbers in common. 
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Thus the element c above is the product of 5 (pairwise) disjoint cycles: a 5-cycle, a 
2 -cycle, a 1 -cycle, a 3 -cycle, and another 2 -cycle. 

Henceforth we adopt the convention that 1 -cycles will not be written. Thus if some 
integer, t, does not appear in the cycle decomposition of a permutation t it is understood 
that r (i) = i, i.e., that t fixes i . The identity permutation of S„ has cycle decomposition 
(1)(2) . . . (n) and will be written simply as 1. Hence the final step of the algorithm is: 

Cycle Decomposition Algorithm (cont.) 


Final Step: Remove all cycles of length 1 


The cycle decomposition for the particular a in the example is therefore 
o' = (1 12 8 10 4)(2 13)(5 11 7)(6 9) 

This convention has the advantage that the cycle decomposition of an element t of 
S„ is also the cycle decomposition of the permutation in S m for m > n which acts as r 
on {1 , 2, 3, .... n} and fixes each element of {n + 1, n + 2, . . . , m). Thus, for example, 
( 1 2 ) is the permutation which interchanges 1 and 2 and fixes all larger integers whether 
viewed in S%, S3 or £4, etc. 

As another example, the 6 elements of S 3 have the following cycle decompositions: 

The group S3 


Values of ay 

Cycle Decomposition of ai 

cti(1) = 1, cti(2) = 2, 0^(3) = 3 

1 

«2(1)= 1,02(2) = 3, 02(3) = 2 

(2 3) 

a 3 (l) = 3, or 3 (2) = 2, <t 3 (3) = 1 

(1 3) 

ct 4 (1) = 2, cr 4 (2) = 1, ct 4 (3) = 3 

(12) 

o s (l) = 2. o 5 ( 2) = 3, c 5 (3) = 1 

(12 3) 


For any a e S n , the cycle decomposition of cr _l is obtained by writing the num- 
bers in each cycle of the cycle decomposition of or in reverse order. For example, if 
a — (1 12 8 10 4) (2 13) (5 11 7) (6 9) is the element of £j 3 described before then 

a~ l = (4 10 8 12 1)(13 2)(7 11 5)(9 6 ). 

Computing products in S„ is straightforward, keeping in min d that when computing 
or o r in S„ one reads the permutations from right to left. One simply “follows” the 
elements under the successive permutations. For example, in the product (1 2 3) o 
(1 2) (3 4) the number 1 is sent to 2 by the first permutation, then 2 is sent to 3 by 
the second permutation, hence the composite maps 1 to 3. To compute the cycle 
decomposition of the product we need next to see what happens to 3. It is sent first to 4, 
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then 4 is fixed, so 3 is mapped to 4 by the composite map. Similarly, 4 is first mapped to 
3 then 3 is mapped to 1, completing this cycle in the product: (1 3 4). Finally, 2 is sent 
to 1, then 1 is sent to 2 so 2 is fixed by this product and so (1 2 3) o (1 2)(3 4) = (1 3 4) 
is the cycle decomposition of the product 
As additional examples, 

(12) o (13) = (13 2) and (1 3) o (1 2) = (1 2 3). 

In particular this shows that 

S n is a non-abelian group for all n > 3. 

Each cycle (a\ a 2 . . . a m ) in a cycle decomposition can be viewed as the permutation 
which cyclically permutes a\, a 2 , - . . , a w and fixes all other integers. Since disjoint 
cycles permute numbers which lie in disjoint sets it follows that 

disjoint cycles commute. 

Thus rearranging the cycles in any product of disjoint cycles (in particular, in a cycle 
decomposition) does not change the permutation. 

Also, since a given cycle, ( a\ a 2 . .. a m ), permutes {a\, a 2 , . . . ,a m } cyclically, the 
numbers in the cycle itself can be cyclically permuted without altering the permutation, 
he., 

(fl! a 2 ■ ■ ■ a m ) = ( a 2 a' i ...a m a\) = (a 3 a A . . . a,„ a v a 2 ) = . . . 

= ( a m a\ a 2 . ..a m - 1). 

Thus, for instance, (1 2) = (2 1) and (1 2 3 4) = (3 4 1 2). By convention, the smallest 
number appearing in the cycle is usually written first. 

One must exercise some care working with cycles since a permutation may be 
written in many ways as an arbitrary product of cycles. For instance, in £3, (1 2 3) = 
(1 2)(2 3) = (1 3)(1 3 2)(1 3) etc. But, (as we shall prove) the cycle decomposition of 
each permutation is the unique way of expressing a permutation as a product of disjoint 
cycles (up to rearranging its cycles and cyclically permuting the numbers within each 
cycle). Reducing an arbitrary product of cycles to a product of disjoint cycles allows 
us to determine at a glance whether or not two permutations are the same. Another 
advantage to this notation is that it is an exercise (outlined below) to prove that the order 
of a permutation is the Lc.m. of the lengths of the cycles in its cycle decomposition. 

EXERCISES 

1. Let <7 be the permutation 

1 1— > 3 2 1 — >> 4 3 1 — >- 5 4 !->■ 2 5 1 — > 1 

and let r be the permutation 

1 1— > 5 2i->3 3 i-> 2 4 1— > 4 5 1 > 1 . 

Find the cycle decompositions of each of the following permutations: cr, t, a 2 , or, xo, 
and t 2 cr. 
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2. Let a be the permutation 


lh+ 13 

2 i-> 2 

3^ 15 

4 m- 14 

5 M- 10 

6 1— v 6 

7 m- 12 

8 m- 3 

9m- 4 

10 M- 1 

11 1-> 7 

12 h* 9 

13 m- 5 

14 m- 11 

15 m- 8 

and let r be the permutation 




1 14 

2 i — v 9 

3 1— v 10 

4m- 2 

5 m- 12 

6 1— v 6 

7 5 

8 m- 11 

9 m- 15 

10 m- 3 

11 ^ 8 

12 m- 7 

13 m- 4 

14 m- 1 

15 m- 13. 


Find the cycle decompositions of the following permutations: a , r, a 2 , az, to, and r 2 o. 

3. For each of the permutations whose cycle decompositions were computed in the preceding 
two exercises compute its order. 

4. Compute the order of each of the elements in the following groups: (a) S3 (b) S4. 

5. Find the order of (1 12 8 10 4)(2 13)(5 11 7)(6 9). 

6. Write out the cycle decomposition of each element of order 4 in £4. 

7. Write out the cycle decomposition of each element of order 2 in S4. 

8. Prove that if Q = {1, 2, 3, . . .} then Sn is an infinite group (do not say 00! = 00). 

9. (a) Let a be the 12-cycle (1 23456789 10 11 12). For which positive integers i is 

a' also a 12-cycle? 

(b) Let r be the 8-cycle (1 2 3 4 5 6 7 8). For which positive integers i is z 1 also an 
8-cycle? 

(c) Let a) be the 14-cycle (1 2 3 4 5 6 7 8 9 1011 1213 14). For which positive integers 
/ is (J also a 14-cycle? 

10. Prove that if a is the /n -cycle ( a\ a% . . . o m ), then for all i € {1, 2, . . . , m}, a 1 (at) = ajt+i, 
where k + i is replaced by its least residue mod m when k + 2 > m. Deduce that |cr| = m. 

11. Let a be the m -cycle (12 ... m). Show that o‘ is also an 772 -cycle if and only if / is 
relatively prime to m. 

12. (a) If r = (1 2)(3 4)(5 6)(7 8)(9 10) determine whether there is a /1 -cycle a (n > 10) 

with x — a k for some integer k. 

(b) If r = (1 2)(3 4 5) determine whether there is an n-cycle a (n > 5) with r = o k for 
some integer k. 

13. Show that an element has order 2 in S n if and only if its cycle decomposition is a product 
of commuting 2-cycles. 

14. Let p be a prime. Show that an element has order p in S„ if and only if its cycle decom- 
position is a product of commuting p-cycies. Show by an explicit example that this need 
not be the case if p is not prime. 

15. Prove that the order of an element in S„ equals the least common multiple of the lengths 
of the cycles in its cycle decomposition. [Use Exercise 10 and Exercise 24 of Section 1 .] 

16. Show that if n > m then the number of /72-cycles in S„ is given by 

n(n — l)(n — 2) . . . (n — m 4- 1) 
m 

[Count the number of ways of forming an /72-cycle and divide by the number of represen- 
tations of a particular 7/2 -cycle.] 
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17. Show that jf n > 4 then the number of permutations in S n which are the product of two 
disjoint 2-cycles is n(n — l)(n — 2)(/i — 3)/8. 

18. Find all numbers n such that Ss contains an element of order n . [Use Exercise 15.] 

19. Find all numbers n such that Sj contains an element of order n. [Use Exercise 15.] 

20. Find a set of generators and relations for S 3 . 


1.4 MATRIX GROUPS 

In this section we introduce the notion of matrix groups where the coefficients come 
from fields. This example of a family of groups will be used for illustrative purposes 
in Part I and will be studied in more detail in the chapters on vector spaces. 

A field is the “smallest” mathematical structure in which we can perform all the 
arithmetic operations +, — , x, and (division by nonzero elements), so in particular 
every nonzero element must have a multiplicative inverse. We shall study fields more 
thoroughly later and in this part of the text the only fields F we shall encounter will 
be <Q. R and Z/ pZ, where p is a prime. The example Z/pZ is a finite field, which, to 
emphasize that it is a field, we shall denote by ¥ p . For the sake of completeness we 
include here the precise definition of a field. 

Definition. 

(1) A field is a set F together with two binary operations -I- and - on F such that 
(F, +) is an abelian group (call its identity 0) and (F — {0}, •) is also an abelian 
group, and the following distributive law holds: 

a ■ (jb + c) = (a - b) 4- (a ■ c), for all a, b, c e F. 

(2) For any field F let F x = F — {0}. 

All the vector space theory, the theory of matrices and linear transformations and 
the theory of determinants when the scalars come from R is true, mutatis mutandis, 
when die scalars come from an arbitrary field F. When we use this theory in Part 1 we 
shall state explicitly what facts on fields we are assuming. 

For each n € Z + let GL n (F) be the set of all n x n matrices whose entries come 
from F and whose determinant is nonzero, i.e., 

GL n (F) = {A|Aisannxfl matrix with entries from F and det(A) ^ 0 }, 

where the determinant of any matrix A with entries from F can be computed by the 
same formulas used when F = R. For arbitrary n x n matrices A and B let A B be the 
product of these matrices as computed by the same rules as when F = R. This product 
is associative. Also, since det(AB) = det(A) • det(B), it follows that if det(A) / 0 
and det(B) 0, then det(AB) 0, so GL„(F ) is closed under matrix multiplication. 
Furthermore, det(A) ^ 0 if and only if A has a matrix inverse (and this inverse can be 
computed by the same adjoint formula used when F = R), so each A e GL„(F) has 
an inverse. A -1 , in GL n (F ): 

AA -1 = A -1 A — /, 
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where I is the n x n identity matrix. Thus GL„(F) is a group under matrix multipli- 
cation, called the general linear group of degree n. 

The following results will be proved in Part III but are recorded now for convenience: 

(1) if F is a field and \F\ < oo, then |F] — p m for some prime p and integer m 

(2) if \F\ = q < oo, then \GL n (F)\ = (q n - 1)0?" - <?)(<?" - <? 2 ) . . . 0?" - q n ~ l f 


EXERCISES 


Let F be a field and let n e Z + . 

1. Prove that |GZ, 2 (F 2 >| = 6. 

2. Write out all the elements of GL 2 CF 2 ) and compute the order of each element. 

3. Show that GL 2 CF 2 ) is non-abelian. 

4 . Show that if n is not prime then Z/nZ is not a field. 

5. Show that GL„ (F) is a finite group if and only if F has a finite number of elements. 

6. If |F| = q is finite prove that |GL W (F)| < q n * . 

7. Let p be a prime. Prove that the order of GL^^Fp) is p 4 — p 3 — p 2 + p (do not just quote 
the order formula in this section). [Subtract the number of 2 x 2 matrices which are not 
invertible from the total number of 2 x 2 matrices over F p . You may use the fact that a 
2x2 matrix is not invertible if and only if one row is a multiple of the other.] 

8. Show that GL„ (F) is non-abelian for any n >2 and any F. 


9. Prove that the binary operation of matrix multiplication of 2 x 2 matrices with real number 
entries is associative. 


10 . LetG = {^“ ^ |a,fc,ceR, a # 0, c#0}. 
(a) Compute the product of ^ and ^ 


ai h. 
0 C2 


matrix multiplication. 

(b) Find the matrix inverse of 


) 


to show that G is closed under 


(s :) 


and deduce that G is closed under inverses. 


(c) Deduce that G is a subgroup of GL 2 (R) (cf. Exercise 26, Section 1). 

(d) Prove that the set of elements of G whose two diagonal entries are equal (i.e., a = c) 
is also a subgroup of GL 2 O&). 


The next exercise introduces the Heisenberg group over the field F and develops some of its 
basic properties. When F = M this group plays an important role in quantum mechanics 
and signal theory by giving a group theoretic interpretation (due to H. Weyl) of Heisenberg’s 
Uncertainty Principle. Note also that the Heisenberg group may be defined more generally — 
for example, with entries in Z. 


11 . Let H(F) 


(Y 

0 

\o 


a b\ 

JU 


| a, b, c € F] — called the Heisenberg group over F. Let 



( 1 

a 

b\ 


(\ 

d 

e \ 

X = 

0 

1 

c 

| and Y = 

0 

1 

f be elements of //(F) 


^0 

0 

0 



0 



(a) Compute the matrix product X Y and deduce that H (F) is closed under matrix mul- 
tiplication. Exhibit explicit matrices such that XY ^ fX (so that H(F) is always 
non-abelian). 
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(b) Find an explicit formula for the matrix inverse X -t and deduce that H (F) is closed 
under inverses. 

(c) Prove the associative law for H(F) and deduce that H(F) is a group of order |F| 3 . 
(Do not assume that matrix multiplication is associative.) 

(d) Find the order of each element of the finite group H (Z/ 2 Z). 

(e) Prove that every nonidentity element of the group H(E) has infinite order. 


1.5 THE QUATERNION GROUP 

The quaternion group, Q%, is defined by 

£>8 = (1, -1. /, -i, J , -j, k, -k} 

with product - computed as follows: 

1 ■ a = a ■ 1 = a, for all a € Q& 

(— 1) ■ (— 1) = 1, (— 1) • a — a ■ (-1) = —a, forallaegs 

i ‘ i = j * j = k k = -1 

i ■ j = k, j -i = —k 

j k = i, k- j = —i 

k i = j, i - k = - j. 

As usual, we shall henceforth write ab for a ■ b. It is tedious to check the associative 
law (we shall prove this later by less computational means), but the other axioms are 
easily checked. Note that Qg is a non-abelian group of order 8. 

EXERCISES 

1 . Compute the order of each of the elements in Qg. 

2 . Write out the group tables for S3, D% and < 28 - 

3 . Find a set of generators and relations for Qg. 

1.6 HOMOMORPHISMS AND ISOMORPHISMS 

In this section we make precise the notion of when two groups “look the same,” that is, 
have exactly the same group-theoretic structure. This is the notion of an isomorphism 
between two groups. We first define the notion of a homomorphism about which we 
shall have a great deal more to say later. 

Definition. Let (G, *) and (H, o) be groups. A map <p : G -> H such that 
(p(x ★ y) = <p(x) o <p(y), for all x, yeG 
is called a homomorphism. 
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When the group operations for G and H are not explicitly written, the homomor- 
phism condition becomes simply 


(pixy) = <p(*)<p(y) 

but it is important to keep in mind that the product xy on the left is computed in G 
and the product <p(x)<p(y) on the right is computed in H. Intuitively, a map <p is a 
homomorphism if it respects the group structures of its domain and codomain. 

Definition. The map <p : G H is called an isomorphism and G and H are said to 
be isomorphic or of the same isomorphism type, written G = H, if 

(1) <p is a homomorphism (i.e., <p{xy) = q>(x)<p(y)\ and 

(2) <p is a bijection. 

In other words, the groups G and H are isomorphic if there is a bijection between 
them which preserves the group operations. Intuitively, G and H are the same group 
except that the elements and the operations may be written differently in G and H. 
Thus any property which G has which depends only on the group structure of G (i.e., 
can be derived from the group axioms — for example, commutativity of the group) also 
holds in H. Note that this formally justifies writing all our group operations as ■ since 
changing the symbol of the operation does not change the isomorphism type. 

Examples 

(1) For any group G, G = G The identity map provides an obvious isomorphism but 
not, in general, the only isomorphism from G to itself. More generally, let Q be 
any nonempty collection of groups. It is easy to check that the relation = is an 
equivalence relation on Q and the equivalence classes are called isomorphism classes. 
This accounts for the somewhat symmetric wording of the definition of “isomorphism.” 

(2) The exponential map exp : R — R + defined by exp(x) = e x , where e is the base of 
the natural logarithm, is an isomorphism from (R, +) to (R + , x). Exp is a bijection 
since it has an inverse function (namely log^ ) and exp preserves the group operations 
since e* +>l = e x e y . In this example both the elements and the operations are different 
yet the two groups are isomorphic, that is, as groups they have identical structures. 

(3) In this example we show that the isomorphism type of a symmetric group depends 
only on the cardinality of the underlying set being permuted. 

Let A and £2 be nonempty sets. The symmetric groups 5 a and Sq are isomorphic 
if |A| = |£2|. We can see this intuitively as follows: given that |A| = |£2|, there is a 
bijection 6 from A onto f2. Think of the elements of A and as being glued together 
via 0, i.e., each x e A is glued to $(x) e £2. To obtain a map (p : 5a -> S& let a e 5a 
be a permutation of A and let <p(o) be the permutation of £2 which moves the elements 
of £2 in the same way a moves the corresponding glued elements of A; that is, if 
a(x) = y, for some x t y € A , then (p{p){6{x)) = 6{y) in Q. Since the set bijection 0 
has an inverse, one can easily check that the map between symmetric groups also has 
an inverse. The precise technical definition of the map <p and the straightforward, albeit 
tedious, checking of the properties which ensure p is an isomorphism are relegated to 
the following exercises. 

Conversely, if 5 A = Sq, then | A| = |D|; we prove this only when the underlying 
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sets are finite (when both A and are infinite sets the proof is harder and will be 
given as an exercise in Chapter 4 ). Since any isomorphism between two groups G 
and H is, a priori, a bijection between them, a necessary condition for isomorphism 
is |5aI = |5 n |. When A is a finite set of order n , then |SaI = n\. We actually only 
proved this for S„, however the same reasoning applies for 5a- Similarly, if Q is a 
finite set of order m, then |5nl = m\. Thus if 5a and 5n are isomorphic then n \ — m\, 
so m = n, i.e., |A[ = |fi|. 

Many more examples of isomorphisms will appear throughout the text. When 
we study different structures (rings, fields, vector spaces, etc.) we shall formulate 
corresponding notions of isomorphisms between respective structures. One of the 
centra] problems in mathematics is to determine what properties of a structure specify 
its isomorphism type (i.e., to prove that if G is an object with some structure (such as a 
group) and G has property V, then any other similarly structured object (group) X with 
property V is isomorphic to G). Theorems of this type are referred to as classification 
theorems. For example, we shall prove that 

any non-abelian group of order 6 is isomorphic to S3 

(so here G is the group S3 and V is the property “non-abelian and of order 6”)- From 
this classification theorem we obtain Dg = S3 and GL 2 (F 2 ) = S3 without having to 
find explicit maps between these groups. Note that it is not true that any group of order 
6 is isomorphic to S3. In fact we shall prove that up to isomorphism there are precisely 
two groups of order 6: S3 and Z/6Z (i.e., any group of order 6 is isomorphic to one 
of these two groups and S3 is not isomoiphic to Z/6Z). Note that the conclusion is 
less specific (there are two possible types); however, the hypotheses are easier to check 
(namely, check to see if the order is 6). Results of the latter type are also referred to as 
classifications. Generally speaking it is subtle and difficult, even in specific instances, 
to determine whether or not two groups (or other mathematical objects) are isomorphic 
— constructing an explicit map between them which preserves the group operations 
or proving no such map exists is, except in tiny cases, computationally unfeasible as 
indicated already in trying to prove the above classification of groups of order 6 without 
further theory. 

It is occasionally easy to see that two given groups are not isomorphic . For example, 
the exercises below assert that if (p : G — ► H is an isomorphism, then, in particular, 

(a) \C\ = \H\ 

(b) G is abelian if and only if H is abelian 

(c) for all x 6 G, \x\ = |^(x)|. 

Thus S3 and Z/6Z are not isomorphic (as indicated above) since one is abelian and the 
other is not. Also, (M— { 0 }, x)and(M, +) cannot be isomoiphic because in (M— ( 0 ), x) 
the element —1 has order 2 whereas (M, +) has no element of order 2, contrary to (c). 

Finally, we record one very useful fact that we shall prove later (when we discuss 
free groups) dealing with the question of homomorphisms and isomorphisms between 
two groups given by generators and relations; 

Let G be a finite group of order n for which we have a presentation and let 
S = {si, . . . , s w } be the generators. Let H be another group and {rj, . . . , r m } be el- 
ements of H. Suppose that any relation satisfied in G by the s; is also satisfied in H 
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when each s,- is replaced by r, . Then there is a (unique) homomorphism <p \ G -> H 
which maps s,- to r,. If we have a presentation for G, then we need only check the 
relations specified by this presentation (since, by definition of a presentation, every 
relation can be deduced from the relations given in the presentation). If H is generated 
by the elements {r t , . . . , r m }, then <p is surjective (any product of the r,’s is the image 
of the corresponding product of the Sj ’s). If, in addition, H has the same (finite) or- 
der as G, then any surjective map is necessarily injective, i.e., <p is an isomorphism: 
G = H. Intuitively, we can map the generators of G to any elements of H and obtain 
a homomorphism provided that the relations in G are still satisfied. 

Readers may already be familiar with the corresponding statement for vector spaces. 
Suppose V is a finite dimensional vector space of dimension n with basis S and W is 
another vector space. Then we can specify a linear transformation from V to W by 
mapping the elements of 5 to arbitrary vectors in W (here there are no relations to 
satisfy). If W is also of dimension n and the chosen vectors in W span W (and so are a 
basis for W) then this linear transformation is invertible (a vector space isomorphism). 

Examples 

(1) Recall that £>2„ = (r,s \r n = s 2 = \>sr — r -1 r ). Suppose H is a group containing 

elements a and b with a" = 1, b 2 = 1 and&o = a~ l b. Then there is a homomorphism 
from D'in to H mapping r to a and s to b. For instance, let k be an integer dividing n 
with/r > 3 and let £>2* = (ri, s\ | r* = = l.rirj = ). Define 

<fi : D 2n -> Du by <p(r) = r y and ^>(j) = s\. 

If we write n — km, then since r* = 1, also r” = (rf)"' = 1 . Thus the three relations 
satisfied by r, s in Din arc satisfied by n , s\ in Dik- Thus <p extends (uniquely) to a 
homomorphism from Din to Dik- Since {ri , si } generates Dik, <p is surjective. This 
homomorphism is not an isomorphism if it < n. 

(2) Following up on the preceding example, let G = £>6 be as presented above. Check 
that in H = S 3 the elements a = (1 23) and b = (12) satisfy the relations: a 3 = 1, 
b 2 — ] and ba = ab~ 1 . Thus there is a homomorphism from £>6 to S 3 which sends 
n->a and $ i-» b. One may further check that S3 is generated by a and b, so this 
homomorplusm is surjective. Since £>6 and S3 both have order 6, this homomorphism 
is an isomorphism: D(, = S 3 . 

Note that the element nr in the examples above need not have order n ( i.e., n need 
not be the smallest power of a giving the identity in H ) and similarly b need not have 
order 2 (for example b could well be the identity if a = a -1 ). This allows us to more 
easily construct homomorphisms and is in keeping with the idea that the generators and 
relations for a group G constitute a complete set of data for the group structure of G. 

EXERCISES 


Let G and H be groups. 

1. Let <p : G -> H be a homomorphism. 

(a) Prove that <p(x n ) = <p(x) n for all « € Z + . 

(b) Do part (a) for n = — 1 and deduce that <p(x") = ^(jc)” for all n € Z. 
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2. If : G -► /£ is an isomorphism, prove thal W(x)\ = |x| for all x e G. Deduce that any 
two isomorphic groups have the same number of elements of order n for each n € Z + . Is 
the result true if ^ is only assumed to be a homomorphism? 

3. If <fi : G — > H is an isomorphism, prove thal G is abelian if and only if H is abelian. If 
<p : G — > H is a homomorphism, what additional conditions on <p (if any) are sufficient to 
ensure that if G is abelian, then so is if? 

4. Prove that the multiplicative groups R — {0} and C — {0} are not isomorphic. 

5. Prove that the additive groups R and Q are not isomorphic. 

6. Prove that the additive groups Z and Q are not isomorphic. 

7. Prove that £>8 and Qg are not isomorphic. 

8. Prove that if n ^ m, S n and Sm are not isomorphic. 

9. Prove that £>24 and S 4 are not isomorphic. 

10. Fill in the details of the proof that the symmetric groups S& and Sq are isomorphic if 
|A| = | £2 1 as follows: let 6 : A — *■ ft be a bijection. Define 

<p : S& -> Sn by = B o a o 6 ~ l for all a e S& 

and prove the following: 

(a) <p is well defined, that is, if a is a permutation of A then 600 o 6 ~ l is a permutation 
of ft. 

(b) <p is a bijection from S& onto 5^. [Find a 2-sided inverse for cp.] 

(c) <p is a homomorphism, that is, <p(a ox)= <p(a) o <p( r). 

Note the similarity to the change of basis or similarity transformations for matrices (we 
shall see the connections between these later in the text). 

11. Let A and B be groups. Prove that A x B = B x A. 

12. Let A, B, and C be groups and let G = Ax B and H = BxC. Prove that GxC = Ax H. 

13. Let G and H be groups and let <p : G — *■ H be a homomorphism. Prove that the image 
of <p, ip(G ), is a subgroup of H (cf. Exercise 26 of Section 1). Prove that if <p is injective 
then G = <p(G). 

14. Let G and H be groups and let <p : G — > H be a homomorphism Define the kernel of 
<p to be [g € G | <p(g) = Ih 1 (so the kernel is the set of elements in G which map to 
the identity of H, i.e., is the fiber over the. identity of H). Prove that the kernel of 0 is a 
subgroup (cf. Exercise 26 of SectioD 1) of G. Prove drat <p is injective if and only if the 

kernel of <p is the identity subgroup of G. 

15. Define a map 7r : R 2 — > R by 7 r((jc, y)) = x. Prove that 7T is a homomorphism and find 
the kernel of n (cf. Exercise 14). 

16. Let A and B be groups and let G be their direct product, A x B. Prove that the maps 

tt\ : G —*■ A and 7T2 : G B defined by 7 T\((a,b)) = a and ir 2 ((a,b)) = b are 

homomorphisms and find their kernels (cf. Exercise 14). 

17. Let G be any group. Prove thal the map from G to itself defined by g i-» g -1 is a 
homomorphism if and only if G is abelian. 

18. Let G be any group. Prove that the map from G to itself defined by g g 2 is a homo- 
morphism if and only if G is abelian. 

19. Let G = {z e C | z n = 1 for somen e Z + }. Prove that for any fixed integer k > 1 
the map from G to itself defined by z i-y zf is a suijective homomorphism but is not an 
isomorphism. 
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20. Let G be a group and let Aut(G) be the set of all isomorphisms from G onto G. Prove that 
Aut(G) is a group under function composition (called the automorphism group of G and 
the elements of Aut(G) are called automorphisms of G). 

21. Prove that for each fixed nonzero k e Q the map from Q to itself defined by q j-v kq is an 
automorphism of Q (cf. Exercise 20). 

22. Let A be an abelian group and fix some keX. Prove that the map a t-+ a k is a homomor- 
phism from A to itself. If it = —1 prove that this homomorphism is an isomorphism (i.e., 
is an automorphism of A). 

23. Let G be a finite group which possesses an automorphism a (cf. Exercise 20) such that 
cr(j>) = g if and only if g = 1. If a 1 is the identity map from G to G, {wove that G is 
abelian (such an automorphism a is called fixed point free of order 2). [Show that every 
element of G can be written in the form x~ l <y(x) and apply a to such an expression.] 

24. Let G be a finite group and let x and y be distinct elements of order 2 in G that generate 
G. Prove that G = Di n , where n = \xy\. [See Exercise 6 in Section 2.] 

25. Let n e Z+, let r and s be the usual generators of Din and let 6 = 7jt/n. 

(a) Prove that the matrix [ COS f ) is the matrix of the linear transformation 

\ sin 6 cos 6 ) 

which rotates the x, y plane about the origin in a counterclockwise direction by 6 
radians. 

(b) Prove that the map <p : Din GLi(Jk) defined on generators by 

V (r) = ( “ S f ““"/l and ?(s) = ( ? ') 

V^sm# cos0 J \1 0/ 

extends to a homomorphism of Din into GLi(W). 

(c) Prove that the homomorphism <p in part (b) is injective. 

26. Let / and j be the generators of Qg described in Section 5. Prove that the map <p from Qg 
to GLaiC ) defined on generators by 



extends to a homomorphism. Prove that <p is injective. 


1.7 GROUP ACTIONS 

In this section we introduce the precise definition of a group acting on a set and present 
some examples. Group actions will be a powerful tool which we shall use both for 
proving theorems for abstract groups and for unravelling the structure of specific ex- 
amples. Moreover, the concept of an “action” is a theme which will recur throughout 
the text as a method for studying an algebraic object by seeing how it can act on other 
structures. 

Definition. A group action of a group G on a set A is a map from G x A to A (written 
as g -a, for all g € G and a € A) satisfying the following properties: 

(1) gi (g 2 ‘a) = (g\g 2 )-a, for all gi,g 2 e G, a e A, and 

(2) 1 -a=a, for all a € A. 
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We shall immediately become less formal and say G is a group acting on a set A. 
The expression g ■ a will usually be written simply as ga when there is no danger of 
confusing this map with, say, the group operation (remember, - is not a binary operation 
and ga is always a member of A). Note that on the left hand side of the equation in 
property (1) g 2 a is an element of A so it makes sense to act on this by g{. On the 
right hand side of this equation the product (gig 2 ) is taken in G and the resulting group 
element acts on the set element a . 

Before giving some examples of group actions we make some observations. Let 
the group G act on the set A. For each fixed g e G we get a map o g defined by 

o g : A -* A 

Og(a) = g-a . 

We prove two important facts: 

(i) for each fixed g € G, o s is a permutation of A , and 

(ii) the map from G to S A defined by g o g is a homomorphism. 

To see that o g is a permutation of A we show that as a set map from A to A it has a 
2-sided inverse, namely o g ~\ (it is then a permutation by Proposition 1 of Section 0.1). 
For all a e A 


(o g - 1 o o g )(a) = o g -i(o g (a)) 
= g~'-(ga) 
= (g~ l g)-a 
= 1 a = a 


(by definition of function composition) 
(by definition of a g ~\ and o g ) 

(by property (1) of an action) 

(by property (2) of an action). 


This proves er^-i o <j g is the identity map from A to A. Since g was arbitrary, we may 
interchange the roles of g and g~ l to obtain a g o o g - \ is also the identity map on A. 
Thus o g has a 2-sided inverse, hence is a permutation of A. 

To check assertion (ii) above let <p : G -v S A be defined by <p(g) = o g . Note that 
part (i) shows that o g is indeed an element of S A . To see that <p is a homomorphism 
we must prove (p{g,\gi) — <p{g\) o <p(g 2 ) (recall that S A is a group under function 
composition). The permutations <p(g\g 2 ) and <p(g\) o <p(g 2 ) are equal if and only if their 
values agree on every element a € A. For all a e A 


<p(£tg2)(a) ^Og^ia) 

= (gig2)-a 

= gi<g2‘a) 

= o gi (o g2 (a)) 

= (<P(gi)°<P(g2))(a) 
This proves assertion (ii) above. 


(by definition of <p ) 

(by definition of o glg2 ) 

(by property (1) of an action) 
(by definition of o gl and o ft ) 
(by definition of <p). 


Intuitively, a group action of G on a set A just means that every element g in G acts 
as a permutation on .A in a manner consistent with the group operations in G; assertions 
(i) and (ii) above make this precise. The homomorphism from G to S A given above is 
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called the permutation representation associated to the given action. It is easy to see 
that this process is reversible in the sense that if <p : G -v Sa is any homomorphism 
from a group G to the symmetric group on a set A, then the map from G x A to A 
defined by 

g ■ a = tp(g)(a ) for all g € G, and all a € A 

satisfies the properties of a group action of G on A. Thus actions of a group G on a 
set A and the homomorphisms from G into the symmetric group Sa are in bijective 
correspondence (i.e., are essentially the same notion, phrased in different terminology). 

We should also note that the definition of an action might have been more precisely 
named a left action since the group elements appear on the left of the set elements. We 
could similarly define the notion of a right action. 

Examples 

Let G be a group and A a nonempty set In each of the following examples the check of 

properties (1) and (2) of an action are left as exercises. 

(1) Let ga — a, for all g £ G , a £ A. Properties (1) and (2) of a group action follow 
immediately. This action is called the trivial action and G is said to act trivially on 

A. Note that distinct elements of G induce the same permutation on A (in this case 
the identity permutation). The associated permutation representation G — ► 5/ is the 
trivial homomorphism which maps every element of G to the identity. 

If G acts on a set B and distinct elements of G induce distinct permutations of 

B, the action is said to bz faithful. A faithful action is therefore one in which the 
associated permutation representation is injective. 

The kernel of the action of G on B is defined to be {g £ G \ gb = b for all b £ B], 
namely the elements of G which fix all the elements of B. For the trivial action, the 
kernel of the action is all of G and this action is not faithful when |G| > 1. 

(2) The axioms for a vector space V over a field F include the two axioms that the 
multiplicative group F x act on the set V. Thus vector spaces are familiar examples 
of actions of multiplicative groups of fields where there is even more structure (in 
particular, V must be an abelian group) which can be exploited. In the special case 
when V = R" and F = R the action is specified by 

afrl ,rz r„)r- («n , ar 2 ar n ) 

for all a £ R, (ri, r 2 , ... ,r„) £ R w , where an is just multiplication of two real 
numbers. 

(3) For any nonempty set A the symmetric group Sa acts on A by a a = cr(a), for all 
a £ Sa> a £ A. The associated permutation representation is the identity map from 
Sa to itself. 

(4) If we fix a labelling of the vertices of a regular n-gon, each element a of D 2n gives 
rise to a permutation a a of {1,2 by the way the symmetry a permutes the 

corresponding vertices. The map of D 2 n x {I, 2, .... n) onto (1,2 n) defined 

by (a, i) -*■ cr a (i) defines a group action of E> 2 n on {1, 2, . . . , n}. In keeping with 
our notation for group actions we can now dispense with the formal and cumbersome 
notation cr a (i) and write ai in its place. Note that this action is faithful: distinct 
symmetries of a regular n-gon induce distinct permutations of the vertices. 

When n = 3 the action of D& on the three (labelled) vertices of a triangle gives 
an injective homomorphism from D& to S 3 . Since these groups have the same order, 
this map must also be sinjective, i.e., is an isomorphism: D& = S 3 . This is another 
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proof of the same fact we established via generators and relations in the preceding 
section. Geometrically it says that any permutation of the vertices of a triangle is a 
symmetry. The analogous statement is not true for any n-gon with n >4 (just by order 
considerations we cannot have Dzn isomorphic to S„ for any n> 4). 

(5) Let G be any group and let A = G. Define a map from G x A to A by g-a = ga, 
for each g G G and a € A, where ga on the right hand side is the product of g and 
a in the group G. This gives a group action of G on itself, where each (fixed) g G G 
permutes the elements of G by left multiplication : 

g : a ga for all a e G 

(or, if G is written addjtively, we get a i->- g -f- a and call this left translation ). This 
action is called the left regular action of G on itself. By the cancellation laws, this 
action is faithful (check this). 

Other examples of actions are given in the exercises. 

EXERCISES 

1. Let F be a field. Show that the multiplicative group of nonzero elements of F (denoted 
by F x ) acts on the set F by g a = ga, where g g F x ,a g F and ga is the usual product 
in F of the two field elements (state clearly which axioms in the definition of a field are 
used). 

2. Show that the additive group Z acts on itself by z -a = z + a for all z, a g Z. 

3. Show that the additive group R acts on the x, y plane ® xlby r (x, y) = (x + r y. y). 

4. Let G be a group acting on a set A and fix some a G A. Show that the following sets are 
subgroups of G (cf. Exercise 26 of Section 1): 

(a) the kernel of the action, 

(b) [g G G \ ga = a] — this subgroup is called the stabilizer of a in G. 

5. Prove that the kernel of an action of the group G on the set A is the same as the kernel of 
the corresponding permutation representation G — ► Sa (cf. Exercise 14 in Section 6). 

6. Prove that a group G acts faithfully on a set A if and only if the kernel of the action is the 
set consisting only of the identity. 

7. Prove that in Example 2 in this section the action is faithful. 

8. Let A be a nonempty set and let k be a positive integer with k < | A | . The symmetric group 

S,t acts on Lhe set B consisting of all subsets of A of cardinality k by a ■ {nj , . . . , a^) = 
{er(zzi) <T(ak)). 

(a) Prove that this is a group action. 

(b) Describe explicitly how the elements (1 2) and (12 3) act on the six 2-element subsets 
of {1, 2, 3,4}. 

9. Do both parts of the preceding exercise with “ordered k -tuples” in place of “A:-element 
subsets,” where the action on A:-tuples is defined as above but with set braces replaced by 
parentheses (note that, for example, the 2- tuples (1,2) and (2,1) are different even though 
the sets (1,2) and {2, 1 } are the same, so the sets being acted upon are different). 

10 . With reference to the preceding two exercises determine: 

(a) for which values of k the action of S n on ^-element subsets is faithful, and 

(b) for which values of k the action of S„ on ordered A>tuples is faithful. 
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11 . Write out the cycle decomposition of the eight permutations in S4 corresponding to the 
elements of D& given by the action of Dg on the vertices of a square (where the vertices 
of the square are labelled as in Section 2). 

12 . Assume n is an even positive integer and show that D-m acts on the set consisting of pairs 
of opposite vertices of a regular n- gon. Find the kernel of this action (label vertices as 
usual). 

13 . Find the kernel of the left regular action. 

14 . Let G be a group and let A = G. Show that if G is non-abdian then the maps defined by 
g-a — ag for all g, a e G do not satisfy the axioms of a (left) group action of G on itself. 

15 . Let G be any group and let A = G. Show that the maps defined by g-a = ag -1 for all 
g,a e G do satisfy the axioms of a (left) group action of G on itself. 

16 . Let G be any group and let A = G. Show that the maps defined by g-a = gag -1 for all 
g, a e G do satisfy the axioms of a (left) group action (this action of G on itself is called 
conjugation ). 

17 . Let G be a group and let G act on itself by left conjugation, so each g e G maps G to G 
by 

x i-v gxg '. 

For fixed g e G, prove that conjugation by g is an isomorphism from G onto itself (i.e., 
is an automorphism of G — cf. Exercise 20, Section 6). Deduce that x and gxg -1 have 
the same order for all x in G and that for any subset A of G, |A| = |gAg -l | (here 
gAg -1 = {gag -1 | a e A}). 

18 . Let H be a group acting on a set A. Prove that the relation ~ on A defined by 

a ~ b if and only if a = hb for some h e H 

is an equivalence relation. (For each x e A the equivalence class of x under ~ is called 
the orbit of x under the action of H. The orbits under the action of H partition the set A.) 

19 . Let H be a subgroup (cf. Exercise 26 of Section 1) of the finite group G and let H act on 
G (here A = G) by left multiplication. Let x e G and let O be the orbit of x under the 
action of H. Prove that the map 

H —*■ O defined by h i-v hx 

is a bijection (hence all orbits have cardinality | H \ ). From this and the preceding exercise 
deduce Lagrange 's Theorem: 

ifG is a finite group and H is a subgroup of G then |//| divides |G|. 

20 . Show that the group of rigid motions of a tetrahedron is isomorphic to a subgroup (cf. 
Exercise 26 of Section 1) of S 4 . 

21 . Show that the group of rigid motions of a cube is isomorphic to S 4 . [This group acts on 
the set of four pairs of opposite vertices.] 

22 . Show that the group of rigid motions of an octahedron is isomorphic to a subgroup (cf. 
Exercise 26 of Section 1) of S 4 . [This group acts on the set of four pairs of opposite faces.] 
Deduce that the groups of rigid motions of a cube and an octahedron are isomorphic. (These 
groups are isomorphic because these solids are “dual” — see Introduction to Geometry 
by H. Coxeter, Wiley, 1961. We shall see later that the groups of rigid motions of the 
dodecahedron and icosahedron are isomorphic as well — these solids are also dual.) 

23 . Explain why the action of the group of rigid motions of a cube on the set of three pairs of 
opposite faces is not faithful. Find the kernel of this action. 
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CHAPTER 2 


Subgroups 


2.1 DEFINITION AND EXAMPLES 

One basic method for unravelling the structure of any mathematical object which is 
defined by a set of axioms is to study subsets of that object which also satisfy the 
same axioms . We begin this program by discussing subgroups of a group. A second 
basic method for unravelling structure is to study quotients of an object; the notion of 
a quotient group, which is a way (roughly speaking) of collapsing one group onto a 
smaller group, will be dealt with in the next chapter. Both of these themes will recur 
throughout the text as we study subgroups and quotient groups of a group, subrings and 
quotient rings of a ring, subspaces and quotient spaces of a vector space, etc. 


Definition. Let G be a group. The subset H of G is a subgroup of G if H is nonempty 
and H is closed under products and inverses (i.e., x,y e H implies x~ ] e H and 
xy e H). If H is a subgroup of G we shall write H <G. 


Subgroups of G are just subsets of G which are themselves groups with respect 
to the operation defined in G. i.e., the binary operation on G restricts to give a binary 
operation on H which is associative, has an identity in H, and lias inverses in II for all 
the elements of H. 

When we say that H is & subgroup of G we shall always mean that the operation 
for the group H is the operation on G restricted to H (in general it is possible that the 
subset H has the structure of a group with respect to some operation other than the 
operation on G restricted to //, cf. Example 5(a) following). As we have been doing for 
functions restricted to a subset, we shall denote the operation for G and the operation 
for the subgroup H by the same symbol. If H < G and //^Gwe shall write H < G 
to emphasize that the containment is proper. 

If H is a subgroup of G then, since the operation for H is the operation for G 
restricted to H t any equation in the subgroup H may also be viewed as an equation in 
the group G. Thus the cancellation laws for G imply that the identity for H is the same 
as the identity of G (in particular, every subgroup must contain 1, the identity of G) 
and the inverse of an element x in H is the same as the inverse of x when considered 
as an element of G (so the notation x -1 is unambiguous). 
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Examples 

(1) Z < Q and <Q> < R with the operation of addition. 

(2) Any group G has two subgroups: H = G and H = {1}; the latter is called the trivial 
subgroup and will henceforth be denoted by 1. 

(3) If G = Ihn is the dihedral group of order 2 /j, let H be {1, r, r 2 r” -1 }, the set of 

all rotations in G. Since the product of two rotations is again a rotation and the inverse 
of a rotation is also a rotation it follows that H is a subgroup of D% n of order n. 

(4) The set of even integers is a subgroup of the group of all integers under addition. 

(5) Some examples of subsets which are not subgroups: 

(a) <Q> — {0} under multiplication is not a subgroup of R under addition even though 
both are groups and <Q> — {0} is a subset of R; the operation of multiplication on 
<Q> — [0} is not the restriction of the operation of addition on R. 

(b) Z + (under addition) is not a subgroup of Z (under addition) because although 
Z + is closed under +, it does not contain the identity, 0, of Z and although each 
x e Z + has an additive inverse, — x, in Z, — x & Z + , i.e., Z + is not closed under 
the operation of taking inverses (in particular, Z + is not a group under addition). 
For analogous reasons, (Z — {0}, x) is not a subgroup of (Q — {0}, x). 

(c) £>6 is not a subgroup of Dg since the former is not even a subset of the latter 

(6) The relation “is a subgroup of’ is transitive: if H is a subgroup of a group G and K 
is a subgroup of H, then K is also a subgroup of G. 

As we saw in Chapter 1, even for easy examples checking that all the group axioms 
(especially the associative law) hold for any given binary operation can be tedious at 
best. Once we know that we have a group, however, checking that a subset of it is (or 
is not) a subgroup is a much easier task, since all we need to check is closure under 
multiplication and under taking inverses. The next proposition shows that these can be 
amalgamated into a single test and also shows that for finite groups it suffices to check 
for closure under multiplication. 

Proposition 1. (The Subgroup Criterion ) A subset H of a group G is a subgroup if and 
only if 

(1) //^0, and 

(2) for all x, y € H , xy -1 € H. 

Furthermore, if H is finite, then it suffices to check that H is nonempty and closed 
under multiplication. 

Proof: If H is a subgroup of G, then certainly (1) and (2) hold because H contains 
the identity of G and the inverse of each of its elements and because H is closed under 
multiplication. 

It remains to show conversely that if H satisfies both (1) and (2), then H < G. Let 
x be any element in H (such x exists by property (1)). Let y = x and apply property 
(2) to deduce that 1 = xx -1 € H, so H contains the identity of G. Then, again by (2), 
since H contains 1 and x, H contains the element lx -1 , i.e., x _l € H and H is closed 
under t aking inverses, finally, if x and y are any two elements of H, then H contains 
x and y -1 by what we have just proved, so by (2), H also contains x(y -1 ) -1 = xy. 
Hence H is also closed under multiplication, which proves H is a subgroup of G. 
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Suppose now that H is finite and closed under multiplication and let jr be any 
element in H. Then there are only finitely many distinct elements among x , x 2 , x 3 , . . . 
and so x a = x b for some integers a, b with b > a. If n = b — a, then x n = 1 so in 
particular every element* e H is of finite order. Then x”~ l — x~ l is an element of H t 
so H is automatically also closed under inverses. 

EXERCISES 

Let G be a group. 

1. In each of (a) - (e) prove that the specified subset is a subgroup of the given group: 

(a) the set of complex numbers of the form a + ai> a € R (under addition) 

(b) the set of complex numbers of absolute value 1, i.e., the unit circle in the complex 
plane (under multiplication) 

(c) for fixed n e Z + the set of rational numbers whose denominators divide n (under 
addition) 

(d) for fixed n e Z + the set of rational numbers whose denominators are relatively prime 
to n (under addition) 

(e) the set of nonzero real numbers whose square is a rational number (under multiplica- 
tion). 

2. In each of (a) - (e) prove that the specified subset is not a subgroup of the given group: 

(a) the set of 2-cycles in S n for n > 3 

(b) the set of reflections in £> 2 „ for n > 3 

(c) for n a composite integer > 1 and G a group containing an element of order n, the set 
{* € G | |x| = n} U {1} 

(d) the set of (positive and negative) odd integers in Z together with 0 

(e) the set of real numbers whose square is a rational number (under addition). 

3. Show that the following subsets of the dihedral group D$ are actually subgroups: 

(a) (b) (\,r 2 ,sr, sr 3 ). 

4. Give an explicit example of a group G and an infinite subset H of G that is closed under 
the group operation but is not a subgroup of G. 

5. Prove that G cannot have a subgroup H with \H\ = n — 1, where n = |G| > 2. 

6. Let G be an abelian group. Prove that {g € G | |g| < oo} is a subgroup of G (called the 
torsion subgroup of G). Give an explicit example where this set is not a subgroup when 
G is non-abelian. 

7. Fix some n € Z with n > 1 . Find the torsion subgroup (cf. the previous exercise) of 
Z x (Z/nZ). Show that the set of elements of infinite order together with the identity is 
not a subgroup of this direct product 

8. Let H and K be subgroups of G. Prove that H U K is a subgroup if and only if either 
H c K or K c H. 

9. Let G = GL„(F), where F is any field. Define 

SL„{F) — {A e GL n (F ) | del (A) = 1} 

(called the special linear group). Prove that SL n (F) < GL n (F). 

10. (a) Prove that if H and K are subgroups of G then so is their intersection H n K. 

(b) Prove that the intersection of an arbitrary nonempty collection of subgroups of G is 
again a subgroup of G (do not assume the collection is countable). 

11. Let A and B be groups. Prove that the following sets are subgroups of the direct product 
A x B: 
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(a) [(a, 1) | a € A} 

(b) {(1,*) | * e fi} 

(c) {(a, a) | a e A}, where here we assume B ~ A (called (he, diagonal subgroup). 

12. Let A be an abelian group and fix some n e Z. Prove that the following sets are subgroups 
of A: 

(a) [a n \ae A) 

(b) {a € A I a n = 1}. 

13. Let H be a subgroup of the additive group of rational numbers with the property that 
\fx € H for every nonzero element x of H. Prove that H = 0 or <Q>. 

14. Show that (jc e \ x 2 = 1} is not a subgroup of (here n > 3). 

15. Let H\ < Hi < ■ • • be an ascending chain of subgroups of G. Prove that U^, Hi is a 
subgroup of G. 

16. Let n € Z + and let F be a field. Prove that the set € GL„(F) | aij = 0 for all i > j } 
is a subgroup of GL n ( F ) (called the group of upper triangular matrices). 

17. Let n € Z + and let F be a field. Prove that the set {(o,y) e GL n (F) j aij = 0 for alii > j, 
and an = 1 for all/} is a subgroup of GL„(F). 


2J2 CENTRALIZERS AND NORMAL1ZERS, STABIUZERS AND KERNELS 

We now introduce some important families of subgroups of an arbitrary group G which 
in particular provide many examples of subgroups. Let A be any nonempty subset 
of G. 

Definition. Define Cc(A) = {g £ G \ gag~ l = a for alia e A). This subset of G 
is called the centralizer of A in G. Since gag~ l = a if and only if ga = ag, Cc(A) is 
the set of elements of G which commute with every element of A. 

We show C c (A) is a subgroup of G. First of all, Cc(A ) ^ 0 because 1 e Co (A ): 
the definition of the identity specifies that la = al, for all a € G (in particular, for 
all a € A) so 1 satisfies the defining condition for membership in Cc(A). Secondly, 
assume x,y e Cc(A), that is, for all a e A, xax~ l = a and yay~ v = a (note that 
this does not mean xy = yx). Observe first that since yay~ l = a, multiplying both 
sides of this first on the left by y -1 , then on the right by y and then simplifying gives 
a = y~ x ay t i.e., y _l € Cc(A) so that Cc(A) is closed under taking inverses. Now 

(xy)a(xy) 1 = (xy)a(y l x l ) (by Proposition 1.1(4) applied to (xy) -1 ) 

= x (y ay ~ 1 )x ~ 1 (by the associative law) 

= xax~ l (since y € C c (A) ) 

= a (since x e C c (A) ) 

so xy € Cc(A) and C c (A ) is closed under products, hence Cc(A) < G. 

In the special case when A = { a } we shall write simply Cc(a) instead of Cc({a})- 
In this case a n e Cc(a) for all n € Z. 
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For example, in an abelian group G, C G (A) = G, for all subsets A. One can 
check by inspection that Cg e (i) = {±1, ±/' }. Some other examples are specified in the 
exercises. 

We shall shortly discuss how to mini miz e the calculation of commutativities be- 
tween single group elements which appears to be inherent in the computation of cen- 
tralizers (and other subgroups of a similar nature). 

Definition. Define Z(G) = {g G G [ gx = xg for all x e G}, the set of elements 
commuting with all the elements of G. This subset of G is called the center of G. 

Note that Z(G) = C g (G), so the argument above proves Z(G) < G as a special 
case. As an exercise, the reader may wish to prove Z(G) is a subgroup directly. 

Definition. Define gAg~ l = {gag~ v \ a e A). Define the normalizer of A in G to 
be the set N G (A) = [g e G \ gAg~ l = A). 

Notice that if g e C G (A), then gag~ l = a e A for all a e A so C G (A) < N C (A). 
The proof that N G (A) is a subgroup of G follows the same steps which demonstrated 
that C g {A) < G with appropriate modifications. 

Examples 

(1) If G is abelian then all the elements of G commute, so Z(G) = G. Similarly, C G (A) = 
N g (A) = G for any subset A of G since gag~ l = gg~ l a = a for every g € G and 
every a € A. 

(2) Let G = £>8 be the dihedral group of order 8 with the usual generators r and s and 
let A = { 1 , r, r 2 , r 3 } be the subgroup of rotations in D&. We show that Co, ( A ) = A. 
Since all powers of r commute with each other, A < Co 6 (A). Since sr = r~^s ^ rs 
the element s does not commute with all members of A, i.e., 5 g Cz>g(A). Finally, the 
elements of D% that are not in A are all of the form sr 1 for some i e {0, 1, 2, 3). If 
the element sr‘ were in Cd s (A) then since C £> a (A) is a subgroup which contains r we 
would also have the element s = (sr')(r~‘) in Cd^A), a contradiction. This shows 
c Ds (A) = a. 

(3) As in the preceding example let G = D& and let A = {1, r, r 2 , r 3 }. We show that 
Nox(A) = £>g. Since, in general, the centralizer of a subset is contained in its nor- 
malizer, A < Nd & (A). Next compute that 

sAs -1 = {sis -1 , srs -1 , sr 2 s -t , sr 3 s -1 } = {1, r 3 , r 2 , r) = A, 

so that 5 € N Dfl ( A). (Note that the se/sAs -1 equals the 5 ^/ A even though the elements 
in these two sets appear in different orders — this is because s is in the normalizer of 
A but not in the centralizer of A.) Now both r and s belong to the subgroup Nds(A) 
and hence s' r* e Nd 6 (A) for all integers i and j, that is, every element of D% is in 
Ndj(A) (recall that r and s generate £>g). Since £>g < (A) we have Nq^(A) = Ds 

(the reverse containment being obvious from the definition of a normalizer). 

(4) We show that the center of £>g is the subgroup {1, r 2 ). First observe that the center 
of any group G is contained in C G (A) for any subset A of G. Thus by Example 2 
above Z(£>g) < Cng(A) = A, where A = {1, r, r 2 , r 3 }. The calculation in Example 
2 shows that r and similarly r 3 are not in Z(Z)g), so Z(Ds) < {1, r 2 }. To show the 
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reverse inclusion note that r commutes with r 2 and calculate that s also commutes 
with r 2 . Since r and s generate Dg, every element of Dg commutes with r 2 (and 1), 
hence {1 , r 2 } < Z(Dg) and so equality holds. 

(5) Let G = S3 and let A be the subgroup { 1 , (12)}. We explain why Cs-, (A) = Nsj (A) = 
A. One can compute directly that ( A) = A, using the ideas in Example 2 above to 

minim ize the calculations. Alternatively, since an element commutes with its powers, 
A < Cs, (A). By Lagrange’s Theorem (Exercise 19 in Section 1.7) the order of the 
subgroup Cs 3 (A) of S3 divides JS3 1 =6. Also by Lagrange’s Theorem applied to the 
subgroup A of the group C$ 3 (A) we have that 2 | ICs 3 (A)|. The only possibilities 
are: |C^(A)| = 2 or 6. If the latter occurs, Cs 3 (A) = S 3 , i.e., A < 2(53); this is a 
contradiction because (1 2) does not commute with (1 2 3). Thus |C$ (A)| = 2 and so 
A = Cs 3 (A). 

Next note that N^ (A) = A because a e Ns, (A) if and only if 
[o\a~' . 0(12)0-') = { 1 , ( 12 )}. 

Since ala -1 = 1, this equality of sets occurs if and only if o(l 2)a -1 = (1 2) as 
well, i.e., if and only if a e Cs 3 (A). 

The center of S 3 is the identity because Z(S3) < Cs 3 (A) = A and (1 2) £ Z(53). 


Stabilizers and Kernels of Group Actions 

The fact that the normalizer of A in G, the centralizer of A in G, and the center of G 
are all subgroups can be deduced as special cases of results on group actions, indicating 
that the structure of G is reflected by the sets on which it acts, as follows: if G is a 
group acting on a set S and s is some fixed element of S , the stabilizer of s in G is the 
set 

G s = {g e G | g -s = s} 

(see Exercise 4 in Section 1.7). We show briefly that G s < G: first 1 € G s by axiom 
(2) of an action. Also, if y e G s , 

s — \ ■ s ~ (y -1 y) • s 

= y V ■ (y ' s) (by axiom (1) of an action ) 

= y - y . s (since y € G s ) 

so y -1 e Gj as welL Finally, if x , y G G s , then 

(xy) ■ s = x ■ (y ■ s) (by axiom (I) of an action) 

= x ■ s (since y € G s ) 

= s (since x € G s ). 

This proves G s is a subgroup 1 of G. A similar (but easier) argument proves that the 
kernel of an action is a subgroup, where the kernel of the action of G on S is defined as 

{g € G | g ■ s = s, for all s e S} 

(see Exercise 1 in Section 1 .7). 

'Notice how the steps to prove G s is a subgroup are the same as those to prove Cc(A) < G with 
axiom (1) of an action taking the place of the associative law. 
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Examples 

(1) The group G = Dg acts on the set A of four vertices of a square (cf. Example 4 in 
Section 1.7). The stabilizer of any vertex a is the subgroup {1, t) of Dg, where t is 
the reflection about the line of symmetry passing through vertex a and the center of 
the square. The kernel of this action is the identity subgroup since only the identity 
symmetry fixes every vertex. 

(2) The group G = Dg also acts on the set A whose elements are the two unordered pairs 
of opposite vertices (in the labelling of Figure 2 in Section 1.2, A = { {l , 3} , {2, 4} }). 
The kernel of the action of Dg on this set A is the subgroup { 1 ,s,r 2 ,sr 2 } and for either 
element a e A the stabilizer of a in Dg equals the kernel of the action. 

Finally, we observe that the fact that centralizers, normalizers and kernels are sub- 
groups is a special case of the facts that stabilizers and kernels of actions are subgroups 
(this will be discussed further in Chapter 4). Let S — V(G), the collection of all subsets 
of G, and let G act on S by conjugation, that is, for each g e G and each B c G let 

g B -*■ gBg~ l where gBg~ l = {gbg~ x \ be B) 

(see Exercise 16 in Section 1.7). Under this action, it is easy to check that N G (A) is 
precisely the stabilizer of A in G (i.e., N G (A ) = G s where s = A e V(G)), so N G (A) 
is a subgroup of G. 

Next let the group N G (A) act on the set S — A by conjugation, i.e., for all g e 
N g (A) and a € A 

g-a gag~ x . 

Note that this does map A to A by the definition of N G (A) and so gives an action on 
A. Here it is easy to check that C G (A) is precisely the kernel of this action, hence 
C G (A ) < N g (A)\ by transitivity of the relation “< ” C G (A) < G. Finally, Z(G) is the 
kernel of G acting on S = G by conjugation, so Z(G) < G. 

EXERCISES 

1. Prove that C G (A) = {g e G | g~ x ag = a for all a e A). 

2. Prove that Cg(Z(G)) — G and deduce that Nc(Z(G)) = G. 

3. Prove that if A and B are subsets of G with A c B then C G {E) is a subgroup of C G (A). 

4. For each of £ 3 , Dg, and Qg compute the centralizers of each element and find the center of 
each group. Does Lagrange’s Theorem (Exercise 19 in Section 1.7) simplify your work? 

5. In each of parts (a) to (c) show that for the specified group G and subgroup A of G, 
C g (A) = A and N C (A) = G. 

(a) G = 53 and A = {1, (123), (132)}. 

(b) G = Dg and A = {1, s, r 2 , sr 2 }. 

(c) G = Dio and A — {1, r, r 2 , r 3 , r 4 }. 

6. Let H be a subgroup of the group G. 

(a) Show that H < N G (H). Give an example to show that this is not necessarily true if 
H is not a subgroup. 

(b) Show that H < C G (H) if and only if H is abelian. 

7. Let n e TL with n > 3. Prove the following: 

(a) Z(D 2 n) - 1 if n is odd 
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(b) Z(D^) = {l,r k } ifn = 2k. 

8 . Let G = S„, fix an i e {1, 2, . . . , n) and let Gy = {a e G \ a(i) = i} (the stabilizer of / in 
G). Use group actions to prove that Gj is a subgroup of G. Find |G; |. 

9. For any subgroup H of G and any nonempty subset A of G define N^(A) to be the set 
{h e H | hAh~ x = A). Show that Nh(A) = Ng(A) D H and deduce that Nu(A) is a 
subgroup of H (note that A need not be a subset of H). 

10. Let H be a subgroup of order 2 in G. Show that Ng(H) = Cg(H). Deduce that if 
N C (H) = G then H < Z(G). 

11. Prove that Z(G) < Ng(A) for any subset A of G. 

12. Let R be the set of all polynomials with integer coefficients in the independent variables 
xi , X 2 , X 3 , X 4 i.e., the members of R are finite sums of elements of the form ox[' x^x^x ^ 4 , 
where a is any integer and ry , .... r 4 are nonnegative integers. For example, 

12 jt^JC 2 X 4 — 18xfx3 + llxfxzxfxf 3 (*) 

is a typical element of R. Each a e S 4 gives a permutation of {xi, . . . , X 4 } by defining 
a • xy = x a (i). This may be extended to a map from R to R by defining 

tT • p (*i , X2 > X 3 > X 4 ) p (.Xff ( 1 ) , Xa (2) > (3) ■ ^cr (4) ) 

for all p(x i,X 2 , JC 3 , JC 4 ) e R (i.e., a simply permutes the indices of the variables). For 
example, if a = (1 2) (3 4) and p(xi , . . . , * 4 ) is the polynomial in (*) above, then 

a ■ p(x 1 , X 2 , X 3 , X 4 ) = 12 x 2 * 1 X 3 — 18*1*4 + 11 * 1*1 * 4 Xj 3 
= 12x1x2X3 — 18 x 1*4 + ll*i*|*| 3 *|. 

(a) Let p = p(x 1 * 4 ) be the polynomial in (*) above, let a = (1 2 3 4) and let 

r = (1 2 3). Compute a ■ p, x ■ (a ■ p), (roc), p , and (a or) - p. 

(b) Prove that these definitions give a (left) group action of 54 on R. 

(c) Exhibit all permutations in S 4 that stabilize X 4 and prove that they form a subgroup 
isomorphic to S 3 . 

(d) Exhibit all permutations in S 4 that stabilize the element xi + *2 and prove that they 
form an abelian subgroup of order 4. 

(e) Exhibit all permutations in S 4 that stabilize the element xix 2 + * 3*4 and prove that 
they form a subgroup isomorphic to the dihedral group of order 8. 

(f) Show that the permutations in S 4 that stabilize the element (xi + * 2 ) (*3 + * 4 ) are 
exactly the same as those found in part (e). (The two polynomials appearing in parts 
(e) and (f) and the subgroup that stabilizes them will play an important role in the 
study of roots of quartic equations in Section 14.6.) 

13. Let n be a positive integer and let R be the set of all polynomials with integer coefficients in 
the independent variables x \ , X 2 , . . . , x n , i.e., the members of R are finite sums of elements 
of the form ax J' x£ ■ • ■ x' n " , where a is any integer and ry , .... r n are nonnegative integers. 
For each 0 e S„ define a map 

(TlR^fR by O ■ p{x i,X2, *n) = p(*<r(l). * & (2). . *<nn))- 

Prove that this defines a (left) group action of S„ on R. 

14. Let H(F) be the Heisenberg group over the field F introduced in Exercise 1 1 of Section 
1.4. Determine which matrices he in the center of H(F) and prove that Z(H(F)) is 
isomorphic to the additive group F. 
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2.3 CYCLIC GROUPS AND CYCLIC SUBGROUPS 


Let G be any group and let x be any element of G. One way of for ming a subgroup H 
of G is by letting H be the set of all integer (positive, negative and zero) powers of x 
(this guarantees closure under inverses and products at least as far as x is concerned). 
In this section we study groups which are generated by one element. 

Definition. A group H is cyclic if H can be generated by a single element, i.e., there 
is some element x e H such that H = [x n \ n € Z} (where as usual the operation is 
multiplication). 

In additive notation H is cyclic if H = { nx \ n e Z}. In both cases we shall write 
H = ( x ) and say H is generated by jc (and x is a generator of //). A cyclic group 
may have more than one generator. For example, if H = (x ), then also H = ( jc -1 ) 
because (x" 1 )' 1 ~ x~ n and as n runs over all integers so does —n so that 

(/|neZ} = {(x -1 )" | n e Z}. 

We shall shortly show how to determine all generators for a given cyclic group H . One 
should note that the elements of ( x ) are powers of x (or multiples of x, in groups 
written additively) and not integers. It is not necessarily true that all powers of x are 
distinct. Also, by the laws for exponents (Exercise 19 in Section 1.1) cyclic groups are 
abelian. 


Examples 

(1) Let G = £> 2 * = ( r, s | r n = s 2 = 1, rs = rr -1 ), n > 3 and let H be the subgroup 
of all rotations of the n- gon. Thus H = (r) and the distinct elements of H are 
1, r, r 2 , . . . , r" -1 (these are all the distinct powers of r). In particular, |//| = n and 
the generator, r, of H has order n . The powers of r “cycle” (forward and backward) 
with period n, that is, 

r n = 1, r n+1 =r. r n+2 =r 2 ,... 
r _l = r n ~ l , r~ 2 = r n ~ 2 , . . . etc. 


In general, to write any power of r, say r', in the form r k , for some k between 0 and 
n — 1 use the Division Algorithm to write 

t = nq + k, where 0 < k < n, 


so that 

ft _ r nq+k _ _ r * 

For example, in Dg, r 4 = 1 so r 105 = r 4 ^ +1 = r and r -42 = r 4 ( -I1 >+2 — r 2 . 
Observe that D^, itself is not a cyclic group since it is non-abeiian. 

(2) Let H = Z with operation +. Thus H = ( 1 ) (here 1 is the integer 1 and the identity 
of H is 0) and each element in H can be written uniquely in the form n ■ 1, for some 
n € Z. In contrast to the preceding example, multiples of the generator are all distinct 
and we need to take both positive, negative and zero multiples of the generator to 
obtain all elements of H. In this example \H\ and the order of the generator 1 are 
both co. Note also that H = ( — 1 ) since each integer x can be written (uniquely) as 
(-x)(-l). 
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Before discussing cyclic groups further we prove that the various properties of finite 
and infinite cyclic groups we observed in the preceding two examples are generic. This 
proposition also validates the c laim (in Chapter 1) that the use of the terminology for 
“order” of an element and the use of the symbol | | are consistent with the notion of 
order of a set. 

Proposition 2. If H = { x ), then |//| = \x\ (where if one side of this equality is 
infinite, so is the other). More specifically 

(1) if \H\ = n < oo,thenx" = 1 and 1, x, x 2 , . . . , x n ~ l are all the distinct elements 
of H, and 

(2) if | H\ = oo, then jc" ^ 1 for all n ^ 0 and x a ^ x b for all a b in Z. 

Proof: Let |jc| = n and first consider the case when n < oo. The elements 
1, jc, x 2 , . . . , jc" -1 are distinct because if x a = x h , with, say, 0 < a < b < n, then 
x h ~ a = jc° = 1 , contrary to n being the smallest positive power of x giving the identity. 
Thus H has at least n elements and it remains to show that these are all of them. As we 
did in Example 1 , if x l is any power of jc, use the Division Algorithm to write t = nq+k, 
where 0 < k < n, so 

X 1 = jc "*+* = (x n ) q x k = l Q x k = X k e {1, JC, X 2 , . . . , jc'*- 1 }, 

as desired. 

Next suppose |jc| = oo so no positive power of jc is the identity. If x a = x b > for 
some a and b with, say, a < b, then x^ a = 1, a contradiction. Distinct powers of jc 
are distinct elements of H so | H\ = oo. This completes the proof of the proposition. 

Note that the proof of the proposition gives the method for reducing arbitrary 
powers of a generator in a finite cyclic group to the ‘least residue” powers. It is not a 
coincidence that the calculations of distinct powers of a generator of a cyclic group of 
order n are carried out via arithmetic in Z/nZ. Theorem 4 following proves that these 
two groups are isomorphic. 

First we need an easy proposition. 

Proposition 3. Let G be an arbitrary group, jc e G and let m, n 6 Z. If jc" — I and 
x™ = 1, then x d = I, where d = (m, n). In particular, if jc w = 1 for some meZ, then 
|jc | divides m. 

Proof: By the Euclidean Algorithm (see Section 0.2 (6)) there exist integers r and 
s such that d = mr + ns, where d is the g.c.cL of m and n. Thus 

x d - x mr+ns = (x m ) r {x n ) s = l r V = L 
This proves the first assertion. 

If jc m = 1, let n = \x\. If m = 0, certainly n | m, so we may assume m f 0. Since 
some nonzero power of jc is the identity, n < oo. Let d = (m, n) so by the preceding 
result jc^ = 1 . Since 0 < d < n and n is the smallest positive power of jc which gives 
the identity, we must have d = n, that is, n I m, as asserted 
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Theorem 4. Any two cyclic groups of the same order are isomorphic. More specifically, 
(1) if n e and ( x ) and ( y ) are both cyclic groups of order n, then the map 


<P : (x) 

Jc 


(y) 


is well defined and is an isomorphism 
(2) if ( x ) is an infinite cyclic group, the map 

(p : Z — > ( x ) 

k\-+ x k 

is well defined and is an isomorphism. 


Proof: Suppose ( x ) and ( y ) are both cyclic groups of order n. Let ip : (x) — ► (y) 
be defined by = y k ; we must first prove <p is well defined, that is, 

ifx r =x\ then <p(x r ) = <p(x s ). 

Since x r ' s = 1, Proposition 3 implies n | r — s. Write r = tn + s so 

<p(x r ) = <p(x tn+s ) 

= (y")V 

= / = <P(* s y 


This proves <p is well defined. It is immediate from the laws of exponents that (p(x a x b ) = 
(p(x a )<p(x b ) (check this), that is, <p is a homomorphism. Since the element y k of ( y ) 
is the image of x k under (p, this map is surjective. Since both groups have the same 
finite order, any suijection from one to the other is a bijection, so <p is an isomorphism 
(alternatively, <p has an obvious two-sided inverse). 

If ( x ) is an i nfini te cyclic group, let<p : Z ( x ) be defined by tp(k) = x k . Note 
that this map is already well defined since there is no ambiguity in the representation 
of elements in the domain. Since (by Proposition 2) x u ^ x h , for all distinct a, b e Z, 
<p is injective. By definition of a cyclic group, <p is surjective. As above, the laws of 
exponents ensure <p is a homomorphism, hence <p is an isomorphism, completing the 
proof. 

We chose to use the rotation group ( r ) as our prototypical example of a finite cyclic 
group of order n (instead of the isomorphic group Z/nZ) since we shall usually write 
our cyclic groups multiplicatively: 

Notation: For each n e Z + , let Z n be the cyclic group of order n (written multiplica- 
tively). 

Up to isomorphism, Z„ is the unique cyclic group of order n and Z„ = Z/nZ. On 
occasion when we find additive notation advantageous we shall use the latter group as 
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our representative of the isomorphism class of cyclic groups of order n . We shall occa- 
sionally say “let ( x ) be the infinite cyclic group” (written multiplica tively), however 
we shall always use Z (additively) to represent the infinite cyclic group. 

As noted earlier, a given cyclic group may have more than one generator. The next 
two propositions determine precisely which powers of x generate the group (x ). 

Proposition 5. Let G be a group, let x G G and let a € Z — {0}. 

(1) If \x\ — co, then |x°| = oo. 

(2) If \x\ = n < oo, then \x a \ — — 

(n,a) 

(3) In particular, if |x| = n < co and a is a positive integer dividing n, then 


Proof: (1) By way of contradiction assume |x| — oo but \x a \ = m < oo. By 
definition of order 

1 = (x a ) ,n = x am . 


Also, 

j K- 0 ” = (X™)- 1 = l" 1 = 1. 

Now one of aw or —am is positive (since neither a nor m is 0) so some positive power of 
x is the identity. This contradicts the hypothesis |x| = oo, so the assumption \x a \ < oo 
must be false, that is, (1) holds. 

(2) Under the notation of (2) let 

y=x a t ( n,a)=d and write n = db, a = dc, 

for suitable fc,ce Z with b > 0. Since d is the greatest common divisor of n and a, 
the integers b and c are relatively prime: 

(■ b , c) = l. 

To establish (2) we must show |y| = b. First note that 

y b = x ah = x dch = {x db ) c = (jc n ) c = \ c = 1 
so, by Proposition 3 applied to ( y ), we see that |y| divides b . Let k — |y|. Then 

= y k = 1 

so by Proposition 3 applied to ( x ), n \ak, i.e., db | dck . Thus b \ ck. Since b and c 
have no factors in common, b must divide k. Since b and k are positive integers which 
divide each other, b = k y which proves (2). 

(3) This is a special case of (2) recorded for future reference. 

Proposition 6. Let H = (x). 

(1) Assume |jc| = oo. Then H = (x ° ) if and only if a — ±1. 

(2) Assume |x| = n < oo. Then H = ( x a ) if and only if (a, n) = 1. In particular, 
the number of generators of H is (pin) (where <p is Euler’s ^-function). 
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Proof: We leave (1) as an exercise. In (2) if \x \ = n < oo, Proposition 2 says x a 
generates a subgroup of H of order |x a |. This subgroup equals all of H if and only if 
Ix^l = |x|. By Proposition 5, 

|x fl | = \x\ if and only if — n, i.e. if and only if (a, n) = 1. 

(a,n) 

Since <p(n ) is, by definition, the number of a € {1, 2, . . . , n} such that (a, n ) = 1, this 
is the number of generators of H. 

Example 

Proposition 6 tells precisely which residue classes mod n generate Z/nZ: namely, a gen- 
erates Z/nZ if and only if (a, n ) = 1. For instance, 1, 5, 7 and 1 1 are the generators of 
Z/12Z and <p(Yl) = 4. 

The final theorem in this section gives the complete subgroup structure of a cyclic 
group. 

Theorem 7. Let H = ( x ) be a cyclic group. 

(1) Every subgroup of H is cyclic. More precisely, if K < H 1 then either K ~ {1} 
or K = ( x d ), where d is the smallest positive integer such that x d e K. 

(2) If \H\ — oo, then for any distinct nonnegative integers a and b, (x a ) / 

(jc*). Furthermore, for every integer rn, (x ,n ) = ), where \m\ denotes 

the absolute value of m, so that the nontrivial subgroups of H correspond 
bijectively with the integers 1, 2, 3 

(3) If \H\ = n < oo, then for each positive integer a dividing n there is a unique 

subgroup of H of order a . This subgroup is the cyclic group (x d ), where d = - . 

a 

Furthermore, for every integer m, (x ,n ) = ( ), so that the subgroups of 
H correspond bijectively with the positive divisors of n. 

Proof: (1) Let K < H> If K — {1}, the proposition is true for this subgroup, so we 
assume K ^ {1}. Thus there exists some a ^ 0 such that x a € K. If a < 0 then since 
K is a group also x~ a — (jc°) _ 1 e K. Hence K always contains some positive power 
ofx. Let 

V = lb | b e and x b e K). 

By the above, Visa nonempty set of positive integers. By the Well Ordering Principle 
(Section 0.2) V has a minimum element — call it d. Since K is a subgroup and x d € K> 
(x d ) < K. Since K is a subgroup of H , any element of K is of the form x° for some 
integer a. By the Division Algorithm write 

a = qd + r 0 < r < d. 

Then x r — x (a ~ qd) = x a (x d )~ Q is an element of K since both x° and x d are elements of 
K . By the minimality of d it follows that r =0, i.e., a ~qd andsox* = ( x d ) q e ( x d ). 
This gives the reverse containment K < ( x d ) which proves (1). 

We leave the proof of (2) as an exercise (the reasoning is similar to and easier than 
the proof of (3) which follows). 
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(3) Assume \H\ = n < oo and a n. Let d = - and apply Proposition 5(3) to 

a 

obtain that ( x d ) is a subgroup of orders, showing the existence of a subgroup of order 
a. To show uniqueness, suppose K is any subgroup of H of order a. By part (1) we 
have 

K = (x b ) 


where b is the smallest positive integer such that x b e K. By Proposition 5 


-= a = \K\^\x h \ = 
a 


n 


so d = (w. b). In particular, d b. Since b is a multiple of d , x b e ( x d ), hence 


K = (x b )<(x d ). 


Since |( x d )| = a — |M| , we have K = (x d ). 

The final assertion of (3) follows from the observation that ( x m ) is a subgroup of 
( x (».*»<) ) (check this) and, it follows from Proposition 5(2) and Proposition 2 that they 
have the same order. Since (n, m) is certainly a divisor of n, this shows that every 
subgroup of II arises from a divisor of n, completing the proof. 


Examples 

(1) We can use Proposition 6 and Theorem 7 to list all the subgroups of Z/nZ for any 
given n. For example, the subgroups of Z/12Z are 

(a) Z/12Z =J \ ) = (5) = (7) = (IT) (order 12) 

(b) (2) = (10) (order 6) 

(c) ( 3 ) = ( 9 ) (order 4) 

(d) (4) = ( 8 ) (order 3) 

(e) ( 6 ) (order 2) 

(f) ( 0) (order 1). 

The inclusions between them are given by 

( o ) < ( b ) if and only if ( b , 12) | (a, 12), 1 < a, b < 12. 

(2) We can also combine the results of this section with those of the preceding one. For 
example, we can obtain subgroups of a group G by forming Cc((x)) and Nc((x )), 
for each x € G. One can check that an element g in G commutes with x if and only 
if g commutes with all powers of jc, hence 

Cc«*)) = C c (je). 

As noted in Exercise 6, Section 2, (x) < No(( x )) but equality need not hold. For 
instance, if G = Q$ and x = i, 

C c «/» = {±l.=t/} = (i) and N c ((i))=Q8. 

Note that we already observed the first of the above two equalities and the second is 
most easily computed using the result of Exercise 24 following. 
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EXERCISES 


1. Find all subgroups of Z45 = (x), giving a generator for each. Describe the containments 
between these subgroups. 

2. If x is an element of the finite group G and \x\ = \G\, prove that G = (x). Give an 
explicit example to show that this result need not be true if G is an infini te group. 

3. Find all generators for Z/48Z. 

4. Find all generators for Z/202Z. 

5. Find the number of generators for Z/49000Z. 

6. In Z/48Z write out all elements of ( a ) for every a. Find all inclusions between subgroups 
in Z/48Z. 

7. Let Z48 = (x) and use the isomorphism Z/48Z = Z48 given by 1 i-*- x to list all subgroups 
of Z4B as computed in the preceding exercise. 

8- Let Z48 = ( x ). For which integers a does the map <p a defined by <p a : \ (-► x a extend to 
an isomorphism from Z/48Z onto Z48. 

9. Let Z 36 = (x ). For which integers a does the map defined by ijf a : I (-> x a extend 
to a well defined homomorphism from Z/48Z into Z36. Can ijf a ever be a surjective 
homomorphism? 

10. What is the order of 30 in Z/54Z? Write out all of the elements and their orders in ( 30). 

11. Find all cyclic subgroups of D&. Find a proper subgroup of Dg which is not cyclic. 

12. Prove that the following groups are not cyclic: 

(a) Z2 x Z2 

(b) Z 2 x Z 

(c) Z x Z. 

13. Prove that the following pairs of groups are not isomorphic: 

(a) Z x Z2 and Z 

(b) Q x Z 2 and Q. 

14. Let a = (123456789 10 11 12). For each of the following integers a compute o a \ 
a = 13, 65, 626, 1195, -6, -81, -570 and -1211. 

15. Prove that Q x Q is not cyclic. 

16. Assume |jc| = n and |y| = m. Suppose that x and y commute : xy = yx. Prove that 
\xy\ divides the least common multiple of m and n. Need this be true if x and y do not 
commute? Give an example of commuting elements x, y such that the order of xy is not 
equal to the least common multiple of \x\ and |y|. 

17. Find a presentation for Z„ with one generator. 

18. Show that if H is any group and h is an element of H with h" = 1, then there is a unique 
homomorphism from Z n = (x ) to H such that x h. 

19. Show that if H is any group and h is an element of H , then there is a unique homomorphism 
from Z to H such that 1 i h. 

20. Let p be a prime and let n be a positive integer. Show that if x is an element of the group 
G such that x ^ = 1 then ]x| = pf n for some m < n. 

21. Let p be an odd prime and let n be a positive integer. Use the Binomial Theorem to show 
that (1 + p) p “ 1 = l(mod p n ) but (1 + p )P" 2 ^ lfmod p n ). Deduce that 1 + p is an 
element of order p n X in the multiplicative group (Z/p" Z)*. 
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22. Let n be an integer > 3. Use the Binomial Theorem to show that (1 +2 2 ) T 2 = I (mod 2") 
but (1 + 2 2 ) 2 " 3 ^ l(mod 2 n ). Deduce that 5 is an element of order 2" -2 in the multi- 
plicative group (Z/2” Z) * . 

23. Show that (Z/2" Z) * is not cyclic for any n > 3. [Find two distinct subgroups of order 2.] 

24. Let G be a finite group and let x € G. 

(a) Prove that if g 6 Nc({x » then gxg -1 = x a for some a e Z. 

(b) Prove conversely that if gxg~ l = x a for some a e Z then g € Nc((x )). [Show 
first that gx k g~ l = (gxg -1 )* = jc"* for any integer k , so that g ( x ) g -1 < ( x ). If 
x has order n, show the elements gx'g -1 , / = 0, 1, .... n — 1 are distinct, so that 
| g (x) g -I | = | < jc )| = n and conclude that g (x) g -1 = {x ).] 

Note that this cuts down some of the work in computing normalizes of cyclic subgroups 
since one does not have to check ghg~ ] e (x) for every h € (x). 

25. Let G be a cyclic group of order n and let k be an integer relatively prime to n. Prove 

that the map x i-> x k is surjective. Use Lagrange's Theorem (Exercise 19, Section 1.7) 

to prove the same is true for any finite group of order n. (For such k each element has a 
k 1 ^ root in G. It follows from Cauchy’s Theorem in Section 3.2 that if k is not relatively 
prime to the order of G then the map x \-+ x k is not surjective.) 

26. Let Z„ be a cyclic group of order n and for each integer a let 

cr a : Z„ — ► Z n by cr fl (x) = x a for all x e Z„. 

(a) Prove that o a is an automorphism of Z„ if and only if a and n are relatively prime 

(automorphisms were introduced in Exercise 20, Section 1 .6). 

(b) Prove that a a = o f> if and only if a = b (mod «). 

(c) Prove that every automorphism of Z n is equal to o a for some integer a . 

(d) Prove that a n o = o a b ■ Deduce that the map a >-► o a is an isomorphism of (Z/nZ) x 
onto the automorphism group of Z n (so Aut(Z„) is an abelian group of order <p(n)). 


2.4 SUBGROUPS GENERATED BY SUBSETS OF A GROUP 

The method of forming cyclic subgroups of a given group is a special case of the general 
technique where one forms the subgroup generated by an arbitrary subset of a group. In 
the case of cyclic subgroups one takes a singleton subset {jc} of the group G and forms 
all integral powers of x, which amounts to closing the set {*} under the group operation 
and the process of taking inverses. The resulting subgroup is the smallest subgroup of 
G which contains the set {*} (smallest in the sense that if H is any subgroup which 
contains {x}, then H contains ( x )). Another way of saying this is that ( x ) is the unique 
minim al element of the set of subgroups of G containing x (ordered under inclusion). 
In this section we investigate analogues of this when {*} is replaced by an arbitrary 
subset of G. 

Throughout mathematics the following theme recurs: given an object G (such as 
a group, field, vector space, etc.) and a subset A of G , is there a unique minim al 
subobject of G (subgroup, subfield, subspace, etc.) which contains A and, if so, how 
are the elements of this subobject computed? Students may already have encountered 
this question in the study of vector spaces. When G is a vector space (with, say, real 
number scalars) and A = {l>i , V2 , . . . , u„}, then there is a unique smallest subspace of 
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G which contains A, namely the (linear) span of vi , i> 2 > * • • « v n and each vector in this 

span can be written as k\v\ + kiV 2 -I + k n v„, for some k \ , . . . , k n gR, When A is 

a single nonzero vector, v, the span of {u} is simply the 1 -dimensional subspace or line 
containing v and every element of this subspace is of the form kv for some JteR. This 
is the analogue in the theory of vector spaces of cyclic subgroups of a group. Note that 
the 1 -dimensional subspaces contain kv, where k e M, not just kv, where k e Z; the 
reason being that a subspace must be closed under all the vector space operations (e.g-, 
scalar multiplication) not just the group operation of vector addition. 

Let G be any group and let A be any subset of G. We now make precise the notion 
of the subgroup of G generated by A. We prove that because the intersection of any set 
of subgroups of G is also a subgroup of G , the subgroup generated by A is the unique 
smallest subgroup of G containing A; it is “ smallest” in the sense of being the minimal 
element of the set of all subgroups containing A. We show that the elements of this 
subgroup are obtained by closing the given subset under the group operation (and taking 
inverses). In succeeding parts of the text when we develop the theory of other algebraic 
objects we shall refer to this section as the paradigm in proving that a given subset 
is contained in a unique smallest subobject and that the elements of this subobject are 
obtained by closing the subset under the operations which define the object Since in the 
latter chapters the details will be omitted, students should acquire a solid understanding 
of the process at this point. 

In order to proceed we need only the following. 

Proposition 8. If ^4 is any nonempty collection of subgroups of G, then the intersection 
of all members of .A is also a subgroup of G. 

Proof: This is an easy application of the subgroup criterion (see also Exercise 10, 
Section 1). Let 

k = n* 

HeA 

Since each H € A is a subgroup, 1 e H, so I e K, that is, K # 0. If a, b e K, 
then a,b g H, for all H g A. Since each H is a group, ab~ x € H, for all H , hence 
ab~ l € K. Proposition 1 gives that K < G. 

Definition. If A is any subset of the group G define 

<A)= f|H. 

A£H 

H<G 

This is called the subgroup of G generated by A. 

Thus ( A ) is the intersection of all subgroups of G containing A. It is a subgroup 
of G by Proposition 8 applied to the set A = {H < G | AC H] (A is nonempty since 
G e Al). Since A lies in each H e A, A is a subset of their intersection, ( A ). Note that 
( A ) is the unique minima l element of A as follows: ( A ) is a subgroup of G containing 
A, so ( A ) € A; and any element of A contains the intersection of all elements in A , 
i.e., contains (A). 
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When A is the finite set {a 1 , 02 , , a„} we write ( a\,ci 2 , . .. ,a n ) for the group 
generated by a\, 02 , . . . , a n instead of ( [a\, ai , . . . , a n ) ). If A and B are two subsets 
of G we shall write (A, B) in place of ( A U B ), 

This “top down” approach to defining ( A ) proves existence and uniqueness of the 
smallest subgroup of G containing A but is not too enlightening as to how to construct 
the elements in it. As the word “generates” suggests we now define the set which is the 
closure of A under the group operation (and the process of taking inverses) and prove 
this set equals {A). Let 

A = [a]'a^ ■ \ n € Z, n > 0 and a t € A, e j = ±1 for each i } 

where A = (1} if A = 0, so that A is the set of all finite products (called words) of 
elements of A and inverses of elements of A. Note that the a^s need not be distinct, 
so a 2 is written aa in the notation defining A. Note also that A is not assumed to be a 
finite (or even countable) set. 

Proposition 9. A = ( A ). 

Proof: We first prove A is a subgroup. Note that A / 0 (even if A = 0). If 
a,b e A with a = a\'a% . . . a 6 " and b = b^b^ . . . b ^ , then 

ab~ l =a\'a%... - b^b~^ 1 . . - b f 4 ' 

(where we used Exercise 15 of Section 1.1 to compute b^ 1 ). Thus ab~ l is a product 
of elements of A raised to powers ±1, hence ab~ l e A. Proposition 1 implies A is a 
subgroup of G. 

Since each a € A may be written a 1 , it follows that A c A, hence (A) c A. But 
( A ) is a group containing A and, since it is closed under the group operation and the 
process of taking inverses, ( A ) contains each element of the form . . . a*\ that 
is, A c ( A ). This completes the proof of the proposition. 

We now use ( A ) in place of A and may take the definition of A as an equivalent 
definition of ( A ). As noted above, in this equivalent definition of ( A ), products of the 
form a-a,a‘a-a,a- a~ l t etc. could have been simplified to a 1 , a 3 , 1, etc. respectively, 
so another way of writing ( A ) is 

(A) = {«“' af 2 . . . a“ n | for each i, a-, € A , a-, € Z, a-, ^ a i+ 1 and n e Z + }. 

In fact, when A — {*} this was our definition of ( A ). 

If G is abelian, we could commute the a, ’s and so collect all powers of a given 
generator together. For instance, if A were the finite subset [ai, 02 , ■ ■ . , af) of the 
abelian group G, one easily checks that 

(A) = [a{'c£f . . . | ctj € % for each /}. 

If in this situation we further assume that each a, has finite order di, for all i, then 
since there are exactly dj distinct powers of a,-, the total number of distinct products of 
the form <?“' cPf . . . a“* is at most dyd^ . . . d^, that is, 

|( A )| < d\di ...dk. 
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It may happen that a a b^ = a y b s even though tf* ^ a y and bfi b s . We shall explore 
exactly when this happens when we study direct products in Chapter 5. 

When G is non-abelian the situation is much more complicated- For example, let 
G = Dg and let r and 5 be the usual generators of Dg (note that the notation Dg = ( r, s ) 
is consistent with the notation introduced in Section 1.2). Let a = s, let b = rs and 
let A ~ {a, b }. Since both s and r (= rs ■ s) belong to ( a, b ), G — ( a,b ), i.e., G is 
also generated by a and b. Both a and b have order 2, however Dg has order 8. This 
means that it is not possible to write every element of Dg in the form a a b^ t a, e 2. 
More specifically, the product aba cannot be simplified to a product of the form a a b ^ . 
In fact, if G = Di n for any n > 2, and r, s, a, b are defined in the same way as above, 
it is still true that 


\a\ = \b\ = 2, D 2 r> = {a,b) and \D 2n \ = 2n. 

This means that for large n, long products of the form abab . ..ab cannot be further 
simplified. In particular, this illustrates that, unlike the abelian (or, better yet, cyclic) 
group case, the order of a (finite) group cannot even be bounded once we know the 
orders of the elements in some generating set. 

Another example of this phenomenon is S n : 

S n = ( (1 2), (1 23 . . .n) ). 


Thus S n is generated by an element of order 2 together with one of order n, yet | S„ | = n ! 
(we shall prove these statements later after developing some more techniques). 

One final example emphasizes the fact that if G is non-abelian, subgroups of G 
generated by more than one element of G may be quite complicated. Let 


G = GL 2 (K), 



so a 2 = b 2 = 1 but ab = ^ ^ J- It is easy to see that ab has infin ite order, so 

(a.b) is an infinite subgroup of GL 2 (\ R) which is generated by two elements of order 

2. 


These examples illustrate that when \A\ > 2 it is difficult, in general, to compute 
even the order of the subgroup generated by A, let alone any other structural properties. 
It is therefore impractical to gather much information about subgroups of a non-abelian 
group created by taking random subsets A and trying to write out the elements of (or 
other information about) (A). For certain “well chosen” subsets A, even of a non- 
abelian group G, we shall be able to make both theoretical and computational use of 
the subgroup generated by A. One example of this might be when we want to find 
a subgroup of G which contains (x ) properly; we might search for some element y 
which commutes with x (i.e., y e Cc( *)) and form (x,y). It is easy to check that 
the latter group is abelian, so its order is bounded by |x||y[. Alternatively, we might 
instead take y in Nq({ x )) — in this case the same order bound holds and the structure 
of ( x, y ) is again not too complicated (as we shall see in the next chapter). 

The complications which arise for non-abelian groups are generally not quite as 
serious when we study other basic algebraic systems because of the additional algebraic 
structure imposed. 
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EXERCISES 


1 . 


2 . 


3 . 


4 . 


5 . 


6 . 


7 . 


8 . 


9 . 


10 . 


11 . 

12 . 

13. 


14 . 


15 . 


16 . 


17 . 


Prove that if H is a subgroup of G then (H) = H. 

Prove that if A is a subset of B then ( A } < (B). Give an example where A c B with 
A but(A) = {B). 

Prove that if H is an abelian subgroup of a group G then ( H, Z(G) ) is abelian. Give an 
explicit example of an abelian subgroup H of a group G such that ( H, Cg(H) > is not 
abelian. 

Prove that if H is a subgroup of G then H is generated by the set H — {1}. 

Prove that the subgroup generated by any two distinct elements of order 2 in S3 is all of 
S3. 

Prove that the subgroup of S4 generated by (1 2) and (1 2) (3 4) is a noncyclic group of 
order 4. 


Prove that the subgroup of S4 generated by (1 2) and (1 3) (2 4) is isomorphic to the 
dihedral group of order 8. 

Prove that S 4 = < (1 2 3 4) , (1 2 4 3) >. 

Prove that SL2OF3) is the subgroup of GL2QF3) generated by ^ * j ^ and ^ j ^ . [Re- 
call from Exercise 9 of Section 1 that 5L2CF3) is the subgroup of matrices of determinant 
1 . You may assume this subgroup has order 24 — this will be an exercise in Section 3.2.] 

Prove that the subgroup of 5L2OF3) generated by (? '.'MI -') is isomorphic 

to the quaternion group of order 8. [Use a presentation for Q&.] 

Show that SL 2 (¥ 3 ) and S4 are two nonisomorphic groups of order 24. 


Prove that the subgroup of upper triangular matrices in GL 3 CF 2 ) is isomorphic to the 
dihedral group of order 8 (cf . Exercise 16, Section 1 ). [First find the order of this subgroup.] 

Prove that the multiplicative group of positive rational numbers is generated by the set 
{i | pisaprime}. 

A group H is called finitely generated if there is a finite set A such that H = (A). 

(a) Prove that eveiy finite group is finitely generated. 

(b) Prove that 7L is finitely generated. 

(c) Prove that every finitely generated subgroup of the additive group <Q> is cyclic. [If H 

is a finitely generated subgroup of Q, show that H < {-), where k is the product of 

k 

all the denominators which appear in a set of generators for H .] 

(d) Prove that Q is not finitely generated. 

Exhibit a proper subgroup of Q which is not cyclic. 

A subgroup M of a group G is called a maximal subgroup M f G and the only subgroups 
of G which contain Af are M and G. 

(a) Prove that if H is a proper subgroup of the finite group G then there is a maximal 
subgroup of G containing H. 

(b) Show that the subgroup of all rotations in a dihedral group is a maxim al subgroup. 

(c) Show that if G — { x ) is a cyclic group of order n > 1 then a subgroup H is maximal 
if and only H = { x p > for some prime p dividing n. 

This is an exercise involving Zorn’s Lemma (see Appendix I) to prove that every nontrivial 
finitely generated group possesses maximal subgroups. Let G be a finitely generated 
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group, say G = ( gi , gi> ■ ■ ■ , gn ). and let S be the set of aU proper subgroups of G. Then 
S is partially ordered by inclusion. Let C be a chain in S. 

(a) Prove that the union, H , of all the subgroups in C is a subgroup of G. 

(b) Prove that H is a proper subgroup. [If not, each gi must lie in H and so must lie in 
some element of the chain C. Use the definition of a chain to arrive at a contradiction.] 

(c) Use Zorn’s Lemma to show that S has a maximal element (which is, by definition, a 
maximal subgroup). 

18. Let p be a prime and let Z = [z G C | z^ 7 " =1 for some n e Z+ } (so Z is the multiplicative 

group of all p- power roots of unity in Q. For each k e lA let Hk = {z 6 Z | = 1} 

(the group of p*th roots of unity). Prove the following: 

(a) Hk < H„, if and only if k < m 

(b) Hk is cyclic for all k (assume that for any n e Z + , {e 2jT ' ,/r 1 1 = 0, 1, . . . , n - 1} is 
the set of all w* roots of 1 in C) 

(c) every proper subgroup of Z equals Hk for some k e Z + (in particular, every proper 
subgroup of Z is finite and cyclic) 

(d) Z is not finitely generated. 

19. A nontrivial abelian group A (written multiplicatively) is called divisible if for each element 
a 6 A and each nonzero integer k there is an element x e A such that x* = a, i.e., each 
element has a it* root in A (in additive notation, each element is the it* multiple of some 
element of A). 

(a) Prove that the additive group of rational numbers, 0, is divisible. 

(b) Prove that no finite abelian group is divisible. 

20. Prove that if A and B are nontrivial abelian groups, then Ax Bis divisible if and only if 
both A and B are divisible groups. 


2.5 THE LATTICE OF SUBGROUPS OF A GROUP 

In this section we describe a graph associated with a group which depicts die relation- 
ships among its subgroups. This graph, called the lattice 2 of subgroups of the group, is 
a good way of “visualizing” a group — it certainly illuminates the structure of a group 
better than the group table. We shall be using lattice diagrams, or parts of them, to 
describe both specific groups and certain properties of general groups throughout the 
chapters on group theory. Moreover, the lattice of subgroups of a group will play an 
important role in Galois Theory. 

The lattice of subgroups of a given finite group G is constructed as follows: plot 
all subgroups of G starting at the bottom with 1, ending at the top with G and, roughly 
speaking, with subgroups of laiger order positioned higher on die page than those of 
smaller order. Draw paths upwards between subgroups using the rule that there will 
be a line upward from A to B if A < B and there are no subgroups properly between 
A and B. Thus if A < B there is a path (possibly many paths) upward from A to B 
passing through a chain of intermediate subgroups (and a path downward from B to 
A if B > A). The initial positioning of the subgroups on the page, which is, a priori, 
somewhat arbitrary, can often (with practice) be chosen to produce a simple picture. 
Notice that for any pair of subgroups H and K of G the unique smallest subgroup 

2 The term “lattice” has a precise mathematical meaning in terms of partially ordered sets. 
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which contains both of them, namely (H, K) (called the join of H and K ), may be 
read off from the lattice as follows: trace paths upwards from H and K until a common 
subgroup A which contains H and K is reached (note that G itself always contains all 
subgroups so at least one such A exists). To ensure that A = (H, K) make sure there is 
no A j < A (indicated by a downward path from A to A i ) with both H and K contained 
in A[ (otherwise replace A with Ay and repeat the process to see if A\ = ( H, K )). By 
a symmetric process one can read off the largest subgroup of G which is contained in 
both H and AT, namely their intersection (which is a subgroup by Proposition 8). 

There are some limitations to this process, in particular it cannot be carried out per 
se for infini te groups. Even for finite groups of relatively small order, lattices can be 
quite complicated (see the book Groups of Order 2 n , n < 6 by M. Hall and J. Senior, 
Macmillan, 1964, for some hair-raising examples). At the end of this section we shall 
describe how parts of a lattice may be drawn and used even for infinite groups. 

Note that isomorphic groups have the same lattices (i.e., the same directed graphs). 
Nonisomorphic groups may also have identical lattices (this happens for two groups of 
order 16 — see the following exercises). Since the lattice of subgroups is only part of 
the data we shall carry in our descriptors of a group, this will not be a serious drawback 
(indeed, it might even be useful in seeing when two nonisomorphic groups have some 
common properties). 

Examples 

Except for the cyclic groups (Example 1) we have not proved that the following lattices 
are correct (e.g., contain all subgroups of the given group or have the right joins and 
intersections). For the moment we shall take these facts as given and, as we build up more 
theory in the course of the text, we shall assign as exercises the proofs that these are indeed 
correct 

( 1 ) For G = Z n = Z/wZ, by Theorem 7 the lattice of subgroups of G is the lattice of 
divisors of n (that is, the divisors of n are written on a page with n at the bottom, 1 at 
the top and paths upwards from a to b if b \ a). Some specific examples for various 
values of n follow. 


Z/2Z - 
I 

= <1> 



Z/4Z 

1 

-(1) (note: (1) = (3)) 

1 

(2) - 

= (0} 



1 

(2) 






(4) 

= {0} 



Z/8Z = 

1 

(1) 

(note: ( 

1 ) = (3) = (5) = {7» 



1 

(2) 






(4) 






(8) = 

fO> 
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In general, if p is a prime, the lattice of Z/j/’Z is 


%!p n % =< 1 ) 



( 12 ) = { 0 } 


( 2 ) The Klein 4 -group (Viergruppe), V4, is the group of order 4 with multiplication table 



Note that V4 is abelian and is not isomorphic to Z4 (why?). We shall see that Dg has 
an isomorphic copy of V4 as a subgroup, so it will not be necessary to check that the 
associative law holds for the binary operation defined above. 
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(3) The lattice of £3 is 



1 


(4) Using our usual notation for Dg = ( r, s ), the lattice of £>g is 


*>8 



1 


(5) The lattice of subgroups of Q& is 


Q% 



(-D 


1 
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(6) The lattice of Dj6 is not a planar graph (cannot be drawn on a plane without lines 
crossing). One way of drawing it is 


I>16 



In many instances in both theoretical proofs and specific examples we shall be 
interested only in information concerning two (or some small number of) subgroups of 
a given group and their interrelationships. To depict these graphically we shall draw a 
sublattice of the entire group lattice which contains the relevant joins and intersections. 
An unbroken line in such a sublattice will not, in general, mean that there is no subgroup 
in between the endpoints of the line. These partial lattices for groups will also be used 
when we are dealing with infinite groups. For example, if we wished to discuss only 
the relationship between the subgroups ( sr 2 , r 4 ) and ( r 2 ) of D\$ we would draw the 
sublattice 

£>16 


(s,r 2 ) 



1 
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Note that ( s, r 2 ) and ( r 4 ) are precisely the join and intersection, respectively, of these 
two subgroups in D\s. 

Finally, given the lattice of subgroups of a group, it is relatively easy to compute 
nonnalizers and centralizers. For example, in £>g we can see that Cp „ ( 5 ) = (s, r 2 ) be- 
cause we first calculate that r 2 € Co& CO (see Section 2). This proves (s,r 2 ) < Cpg(s) 
(note that an element always belongs to its own centralizer). The only subgroups which 
contain (s,r 2 ) are that subgroup itself and all of Dg. We cannot have Cp t (s) = Dg 
because r does not commute with s (i.e., r & Cp^is)). This leaves only the claimed 
possibility for Cp s (s). 


EXERCISES 

1. Let H and K be subgroups of G. Exhibit all possible sublaltices which show only G, 1, 
H, K and their joins and intersections. What distinguishes the different drawings? 

2. In each of (a) to (d) list all subgroups of Dig that satisfy the given condition. 

(a) Subgroups that are contained in (sr 2 , r 4 > 

(b) Subgroups that are contained in (sr 1 , r 4 ) 

(c) Subgroups that contain ( r 4 ) 

(d) Subgroups that contain (s ). 

3. Show that the subgroup (s, r 2 ) of £>8 is isomorphic to V4. 

4. Use the given lattice to find all pairs of elements that generate Dg (there are 12 pairs). 

5. Use the given lattice to find all elements x e D\(, such that Di6 = (x,s) (there are 16 
such elements jc). 

6. Use the given lathees to help find the centralizers of every element in the following groups: 
(a) Dg (b) Qg (c) S 3 (d) Di6- 

7. Find the center of Oi6- 

8. In each of the following groups find the nonnalizer of each subgroup: 

(a) S 3 (b) Qg. 

9. Draw the lattices of subgroups of the following groups: 

(a) Z/16Z (b) Z/24Z (c) Z/48Z. [See Exercise 6 in Section 3.] 

10. Classify groups of order 4 by proving that if \G\ =4 then G = Z4 or G = V4. [See 
Exercise 36, Section 1.1.) 

11. Consider the group of order 16 with the following presentation: 

QDis = (cr, X I or 8 = T 2 = 1, Ot = TCT 3 ) 

(called the quasidihedral or semidihedral group of order 16). This group has three sub- 
groups of order 8: (z t o 2 ) = Dg, (cr ) = Zg and ( cr 2 . ax) = Qg and every proper 
subgroup is contained in one of these three subgroups. Fill in the missing subgroups in the 
lattice of all subgroups of the quasidihedral group on the following page, exhibiting each 
subgroup with at most two generators. (This is another example of a nonplanar lattice.) 

The next three examples lead to two nonisomorphic groups that have the same lattice of sub- 
groups. 

12. The group A = Z 2 x Z4 = (a, b \ a 2 = h 4 = 1, ab = ba) has order 8 and has 
three subgroups of order 4: ( a, b 2 ) = V4, ( b ) = Z4 and {ab ) = Z4 and every proper 
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subgroup is contained in one of these three. Draw the lattice of all subgroups of A, giving 
each subgroup in terms of at most two generators. 

13. The group G = Z2 x Zg = ( x, y \ x z = y 8 = 1, xy = yx ) has order 16 and has three 
subgroups of order 8: (x, y 2 ) = Z2 x Z4, (y ) = Zg and (xy) = Zg and every proper 
subgroup is contained in one of these three. Draw the lattice of all subgroups of G, giving 
each subgroup in terms of at most two generators (cf. Exercise 12). 

14. Let M be the group of order 16 with the following presentation: 

( m, v | m 2 = u 8 = 1, vu = uv 5 ) 

(sometimes called the modular group of order 16). It has three subgroups of order 8: 
( u, v 2 ), ( v ) and (mu) and every proper subgroup is contained in one of these three. 
Prove that (u, u 2 ) = Z2 x Z4, (v) = Zg and («d) = Zg. Show that the lattice of 
subgroups of M is the same as the lattice of subgroups of Z2 x Zg (cf. Exercise 13) but 
that these two groups are not isomorphic. 

15. Describe the isomorphism type of each of the three subgroups of Dm of order 8. 

16 . Use the lattice of subgroups of the quasidihedral group of order 16 to show that every 
element of order 2 is contained in the proper subgroup ( r, o 1 ) (cf Exercise 1 1). 

17 . Use the lattice of subgroups of the modular group M of order 16 to show that the set 

€ M | Jt 2 = 1 } is a subgroup of M isomorphic to the Klein 4-group (cf. Exercise 14). 

18. Use the lattice to help find the centralizer of every element of QD\$ (cf. Exercise 11). 

19. Use the lattice to help find Nd^((s, r 4 )). 

20. Use the lattice of subgroups of QDye (cf. Exercise 1 1) to help find the normalizers 

(a) Nq Du> (( tot)) (b)N <2 D lt ((r.a 4 ». 
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CHAPTER 3 


Quotient Groups and 
Homomorphisms 


3.1 DEFINITIONS AND EXAMPLES 

In this chapter we introduce the notion of a quotient group of a group G, which is 
another way of obtaining a “smaller” group from the group G and, as we did with 
subgroups, we shall use quotient groups to study the structure of G. The structure of 
the group G is reflected in the structure of the quotient groups and the subgroups of G. 
For example, we shall see that the lattice of subgroups for a quotient of G is reflected 
at the ‘"top” (in a precise sense) of the lattice for G whereas the lattice for a subgroup 
of G occurs naturally at the “bottom. ” One can therefore obtain information about the 
group G by combining this information and we shall indicate how some classification 
theorems arise in this way. 

The study of the quotient groups of G is essentially equivalent to the study of the 
homomorphisms of G, i.e., the maps of the group G to another group which respect 
the group structures. If <p is a homomorphism from G to a group H recall that the 
fibers of <p are the sets of elements of G projecting to single elements of H, which we 
can represent pictorially in Figure 1 , where the vertical line in the box above a point a 
represents the fiber of <p over a. 


G 


T t 


<P 


y 

♦ 


♦ * 


■* — » 


H 


Fig.l 
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The group operation in H provides a way to multiply two elements in the image 
of (p (i.e., two elements on the horizontal line in Figure 1). This suggests a natural 
multiplication of the fibers lying above these two points making the set of fibers into 
a group: if X a is the fiber above a and X b is the fiber above b then the product of X a 
with Xb is defined to be the fiber X a b above the product ab, i.e., X a Xb = X ab . This 
multiplication is associative since multiplication is associative in H, the identity is the 
fiber over the identity of H, and the inverse of the fiber over a is the fiber over 
as is easily checked from the definition. For example, the associativity is proved as 
follows: ( X a X b )X e = (X 'ab)X c = X( ob ^ and X a (X b X c ) = X 0 (X bc ) = X a(bc) . Since 
( ab)c = a(bc ) in H, ( X a X b )X c = X a (X b X c ). Roughly speaking, the group G is 
partitioned into pieces (the fibers) and these pieces themselves have the structure of a 
group, called a quotient group of G (a formal definition follows the example below). 

Since the multiplication of fibers is defined from the multiplication in H, by con- 
struction the quotient group with this multiplication is naturally isomorphic to the image 
of G under the homomorphism <p (fiber X a is identified with its image a in H). 

Example 

Let G = Z, let H = Z„ = ( x ) be the cyclic group of order n and define <p : Z — ► Z w by 
<p(a) — x a . Since 

<p(a + b) = x a+b = x a x b = tp(a)<p(b) 

it follows that <p is a homomorphism (note that the operation in Z is addition and the 
operation in Z w is multiplication). Note also that <p is suijective. The fiber of tp over x° is 
then 


<p~ l {x a ) = {m € Z \ x m = x Q ] = [m € Z \ x m ~ a = 1} 

= [m € Z | « divides m — a) (by Proposition 2.3) 

= [m € Z | m = a (mod n)} — a, 

i.e., the fibers of <p are precisely the residue classes modulo n . Figure 1 here becomes: 



z 


-n 


v n — 1 


Fig. 2 
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The multiplication in Z„ is just jc^jc^ = x a+b . The corresponding fibers are a , b, and a + b t 
so the corresponding group operation for the fibers is a - b = a + b. This is just the group 
Z/nZ under addition, a group isomorphic to the image of <p (all of Z„). 

The identity of this group (the fiber above the identity in Z„) consists of all the multiples 
of n in Z, namely nZ, a subgroup of Z, and the remaining fibers are just translates, a + nZ, 
of this subgroup. The group operation can also be defined directly by taking represents! ives 
from these fibers, adding these representatives in Z and taking the fiber containing this sum 
(this was the original definition of the group Z/nZ). From a computational point of view 
computing the product of a and b by simply adding representatives a and b is much easier 
than first computing the image of these fibers under <p (namely, x a and multiplying 
these in H (obtaining xf +b ) and then takin g the fiber over this product. 


We first consider some basic properties of homomorphisms and their fibers. The 
fiber of a homomorphism <p . G H lying above the identity of H is given a name: 

Definition. If <p is a homomorphism (p : G //.the kernel of <p is the set 

{* e G | (pig) = 1} 

and will be denoted by ker <p (here 1 is the identity of H). 

Proposition 1. Let G and H be groups and let <p : G — > H be a homomorphism. 

(1) (p(\ c ) = 1 Hi where 1 c and 1« are the identities of G and H f respectively. 

(2) <p(g~ l ) = <p(g)~ l for all g e G. 

(3) <p(g n ) = <p(g) n for all n e Z. 

(4) ker ^ is a subgroup of G. 

(5) im (<p), the image of G under <p, is a subgroup of H. 

Proof: (1) Since <p(lc ) = ^(lclc) = <p(Ig)<P(\g), the cancellation laws show 
that(l) holds. 

(2) ^(l c ) = (p(gg~ l ) = (p(g)<p(g~ l ) and, by part (1), <p(\ c ) = 1//, hence 

1 h = <P(g)<Pig~ i )- 

Multiplying both sides on the left by <pig)~ l and simplifying gives (2). 

(3) This is an easy exercise in induction for n e Z + . By part (2), conclusion (3) 
holds for negative values of n as well. 

(4) Since 1 c € ker<p, the kernel of <p is not empty. Let x,y e ker (p, that is 
(p(x) = (p(y) = 1 h- Then 

<Pi.xy~ x ) = <p(x)<p(y~ l ) = <p(x)<p(y)~ l = 1/yl^ 1 = l H 

that is, xy -1 G ker <p. By the subgroup criterion, ker <p < G. 

(5) Since <p{\ c) = 1 h, the identity of H lies in the image of <p, so im(<p) is nonempty. 
If x and y are in im(p), say x = <p(a), y = <p(b), then y _1 = <p(b~ l ) by (2) so that 
xy~ l = <p(a)<p(b~ l ) = <p(ab~ l ) since <p is a homomorphism. Hence also Jty -1 is in 
the image of <p, so im(^) is a subgroup of H by the subgroup criterion. 

We can now define some terminology associated with quotient groups. 
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Definition. Let <p : G —*■ H be a homomorphism with kernel K . The quotient group 
or factor group , G/K (read G modulo K or simply G mod K), is the group whose 
elements are the fibers of <p with group operation defined above: namely if X is the 
fiber above a and Y is the fiber above b then the product of X with Y is defined to be 
the fiber above the product ab. 

The notation emphasizes the fact that the kernel K is a single element in the group 
G/K and we shall see below (Proposition 2) that, as in the case of Z/nZ above, the 
other elements of G/K are just the “translates” of the kernel K. Hence we may think 
of G/K as being obtained by collapsing or “dividing out” by K (or more precisely, by 
equivalence modulo K). This explains why G/K is referred to as a “quotient” group. 

The definition of the quotient group G/K above requires the map <p explicitly, 
since the multiplication of the fibers is performed by first projecting the fibers to H 
via <p, multiplying in H and then determining the fiber over this product Just as for 
Z/nZ above, it is also possible to define the multiplication of fibers directly in terms 
of representatives from the fibers. This is computationally simpler and the map <p does 
not enter explicitly. We first show that the fibers of a homomorphism can be expressed 
in terms of the kernel of the homomorphism just as in the example above (where the 
kernel was nL and the fibers were translates of the form a + nZ). 

Proposition 2. Let <p : G — >• H be a homomorphism of groups with kernel K. Let 
X € G/K be the fiber above a , i.e., X = (p~ y (a). Then 

(1) For any u e X, X = {uk \ k € K] 

(2) For any u e X, X = [ku | k € K). 

Proof: We prove (1) and leave the proof of (2) as an exercise. Let u e X so, by 
definition of X , <p(u) = a. Let 

uK = {uk | k e K }. 

We first prove uK C X. For any k e K, 

<p(uk) = <p(u)<p(k) (since q> is a homomorphism) 

— <p{u) 1 (since k e ker <p) 

— a, 

that is, uk e X. This proves uK C X. To establish the reverse inclusion suppose 
g € X and let k = u~ l g. Then 

(p(k) = <p(u~ l )(p(g) = <p(u)~ l <p(g) (by Proposition 1) 

= a~ l a = 1. 

Thus k e ker^. Since k = u~ l g, g = uk e. uK, establishing the inclusion X C uK. 
This proves (1). 

The sets arising in Proposition 2 to describe the fibers of a homomorphism <p are 
defined for any subgroup K of G, not necessarily the kernel of some homomorphism 
(we shall determine necessary and sufficient conditions for a subgroup to be such a 
kernel shortly) and are given a name: 
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Definition. For any N < G and any g e G let 

gN — {gn | n G N] and Ng = { ng \ n e N} 

called respectively a left coset and a right coset of N in G. Any element of a coset is 
called a representative for the coset 

We have already seen in Proposition 2 that if A is the kernel of a homomorphism 
and g\ is any representative for the coset gN then g[N = gN (and if g\ € Ng then 
Ngi — Ng). We shall see that this fact is valid for arbitrary subgroups N in Proposition 
4 below, which explains the terminology of a representative . 

If G is an additive group we shall write g + N and N + g for the left and right 
cosets of N in G with representative g, respectively. In general we can think of the left 
coset, gN, of N in G as the left translate of N by g. (The reader may wish to review 
Exercise 18 of Section 1.7 which proves that the right cosets of N in G are precisely 
the orbits of N acting on G by left multiplication.) 

In terms of this definition. Proposition 2 shows that the fibers of a homomoiphism 
are the left cosets of the kernel (and also the right cosets of the kernel), i.e., the elements 
of the quotient G/K are the left cosets gK, g € G. In the example of Z/nZ the 
multiplication in the quotient group could also be defined in terms of representatives 
for the cosets. The following result shows the same result is true for G /K in general 
(provided we know that K is the kernel of some homomorphism), namely that the 
product of two left cosets X and Y in G/K is computed by choosing any representative 
u of X, any representative v of Y, multiplying u and v in G and forming the coset 
(uv)K. 

Theorem 3. Let G be a group and let K be the kernel of some homomorphism from 
G to another group. Then the set whose elements are the left cosets of K in G with 
operation defined by 

uK o vK = ( uv)K 

forms a group, G/K. In particular, this operation is well defined in the sense that if w i is 
any element in uK and u* is any element in vK , then U\V\ e uvK , i.e., u\V] K = uvK 
so that the multiplication does not depend on the choice of representatives for the cosets. 
The same statement is true with “right coset” in place of “left coset." 

Proof: Let XJe G/K and let Z = XY in G/K, so that by Proposition 2(1) X, 
Y and Z are (left) cosets of K. By assumption, K is the kernel of some homomorphism 
<p : G — ► H so X = (p~ x {a) and Y = <p~ l (b) for some a,b e H. By definition of 
the operation in G/K , Z = <p~ x (ab). Let u and v be arbitrary representatives of X, 


y, respectively, so that <p(u) = a, <p(y) 

= b and X = uK, Y = vK We must show 

uv e Z. Now 


uv € Z 

uv € <p~ x (ab) 


(p(uv) ~ ab 


<p(u)<p(y) = ab . 
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Since the latter equality does hold, uv €. Z hence Z is the (left) coset uvK. (Exercise 
2 below shows conversely that every z € Z can be written as uv, for some u e X and 
L> € Y.) This proves that the product of X with Y is the coset uvK for any choice 
of representatives u € X, v e Y completing the proof of the first statements of the 
theorem. The last statement in the theorem follows immediately since, by Proposition 
2, uK — Ku and vK = Kv for all u and v in G. 

In terms of Figure 1, the multiplication in Gj K via representatives can be pictured 
as in the following Figure 3. 



G 



H 


Fig* 3 


We emphasize the fact that the multiplication is independent of the particular rep- 
resentatives chosen . Namely, the product (or sum, if the group is written additively) of 
two cosets X and Y is the coset uvK containing the product uv where u and v are any 
representatives for the cosets X and Y, respectively. This process of considering only 
the coset containing an element, or “reducing mod K" is the same as what we have been 
doing, in particular, in Z/nZ. A useful notation for denoting the coset uK containing 
a representative u is u. With this notation (which we introduced in the Preliminaries in 
dealing with Z/nZ), the quotient group G/K is denoted G and the product of elements 
u and v is simply the coset containing uv, i.e., uv. This notation also reinforces the fact 
that the cosets uK in G/K are elements it in G/K. 

Examples 

(1) The first example in this chapter of the homomorphism <p from Z to Z„ has fibers the 
left (and also the right) cosets a + nZ of the kernel nZ. Theorem 3 proves that these 
cosets form a group under addition of representatives, namely Z/nZ, which explains 
the notation for this group. The group is naturally isomorphic to its image under <p, so 
we recover the isomorphism Z/nZ = Zn of Chapter 2. 

(2) If <p : G -> H is an isomorphism, then K = 1, the fibers of tp are the singleton 
subsets of G and so G/\ = G. 
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(3) Let G be any group, let H = 1 be the group of order 1 and define <p : G -*■ H by 
<p (g) = 1, for all g e G. It is immediate that <p is a homomorphism. This map is called 
the trivial homomorphism. Note that in this case ker <p = G and G/G is a group with 
the single element, G , i.e., G/G = Z\ = {1}. 

(4) Let G — R 2 (operation vector addition), let H = R (operation addition) and define 
<p : R 2 -> R by <p( (x, y)) = x. Thus <p is projection onto the x-axis. We show <p is a 
homomorphism: 

#>((-*1. yi) + (*2, yi)) = ?((* i + *2.yi + y2)) 

= xi +x 2 = « 3 ((^l, yi)) + V>((-t2,y2))- 

Now 

ker ip = {(x,y) \ <p((x,y)) =0} 

= {(x, y) | x = 0} = the y-axis. 

Note that ker <p is indeed a subgroup of R 2 and that the fiber of <p over a e R is the 
translate of the y-ax is by a, i.e., the line x = a. This is also the left (and the right) coset 
of the kernel with representative (a, 0) (or any other representative point projecting to 

a ) : 

(a, 0) = (a, 0) + y-axis. 

Hence Figure 1 in this example becomes 
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Fig. 4 


The group operation (written additively here) can be described either by using the map 
<p: the sum of the line (x = a) and the line (x = h) is the line (x = a+b)\ or directly in 
terms of coset representatives: the sum of the vertical line containing the point (a, yO 
and the vertical line containing the point (i\ yi) is the vertical line containing the point 
(a + h, yi + y 2 ). Note in particular that the choice of representatives of these vertical 
lines is not important (i.e., the y-coordinates are not important). 

(5) (An example where the group G is non-abelian.) Let G = Qg and let H = V 4 be the 
Klein 4-group (Section 2.5, Example 2). Define y : Qg ->■ V 4 by 

<p(±l) = 1, <o(±i) = a , <p(±j) = b, <p(±k) = c. 


Sec. 3.1 Definitions and Examples 


79 




The check that sp is a homomorphism is left as an exercise — relying on symmetry 
minimizes the work in showing <p(xy) = <p(x)<p(y) for all x and y in Q%. It is clear 
that <p is smjective and that ker <p — {±1}. One might think of <p as an “absolute value” 
function on Q% so the fibers of ip are the sets E = {±1}, A = {±/}, B = {±j) and 
C = {±k} r which are collapsed to 1. a, b, and c respectively in Qg/( ±1 ) and these 
are the left (and also the right) cosets of ker <p (for example, A = i ■ ker <p = { i , — i } = 
ker (p - i). 

By Theorem 3, if we are given a subgroup AT of a group G which we know is the 
kernel of some homomorphism, we may define the quotient G/K without recourse to 
the homomorphism by the multiplication uKvK = uvK. This raises the question of 
whether it is possible to define the quotient group G/N similarly for any subgroup N 
of G. The answer is no in general since this multiplication is not in general well defined 
(cf. Proposition 5 later). In fact we shall see that it is possible to define the structure 
of a group on the cosets of N if and only if N is the kernel of some homomorphism 
(Proposition 7). We shall also give a criterion to determine when a subgroup N is such 
a kernel — this is the notion of a normal subgroup and we shall consider non-normal 
subgroups in subsequent sections. 

We first show that the cosets of an arbitrary subgroup of G partition G (i.e., their 
union is all of G and distinct cosets have trivial intersection). 

Proposition 4. Let N be any subgroup of the group G. The set of left cosets of N in G 
form a partition of G. Furthermore, for all i/.neG, uN = vN if and only if v~ l u e N 
and in particular, u N = v N if and only if u and v are representatives of the same coset 

Proof: First of all note that since AT is a subgroup of G, 1 e N . Thusg = g-1 e gN 
for all g e G, i.e., 

G=U*W. 

geG 

To show that distinct left cosets have empty intersection, suppose uN n vN / 0. We 
show uN = vN. Let* e uN D vN. Write 

x = un = vm , for some n,m e N . 

In the latter equality multiply both sides on the right by n~ l to get 

u = vmn~ [ = vm\, where mj = mn' 1 6 N. 

Now for any element ut of uN (t € N), 

ut — = v(m\t) € vN. 

This proves uN c vN. By interchanging the roles of u and v one obtains similarly that 
vN c uN. Thus two cosets with nonempty intersection coincide. 

By the first part of the proposition, uN = vN if and only if u e vN if and only 
if u = vn, for some n e N if and only if v~ l u € N, as claimed. Finally, v € uN is 
equivalent to saying v is a representative for uN, hence uN = vN if and only if u and 
v are representatives for the same coset (namely the coset u N = vN). 
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Proposition 5. Let G be a group and let N be a subgroup of G. 

(1) The operation on the set of left cosets of N in G described by 

uN ■ vN = (uv)N 

is well defined if and only if gng~ l e N for all g e G and all n e N. 

(2) If the above operation is well defined, then it makes the set of left cosets of N 
in G into a group. In particular the identity of this group is the coset IN and 
the inverse of gN is the coset g -1 N i.e., ( gN)~ l = g _1 N. 

Proof: (1) Assume first that this operation is well defined, that is, for all w, v € G, 

if u, u\ € uN and u, t>i € vN then uvN = u\v\N. 

Let £ be an arbitrary element of G and let n be an arbitrary element of N. Letting 
u — 1, u, = n and v = v\ = g~* and applying the assumption above we deduce that 

lg- 1 N=ng~ l N i.e., g~ l N = ng~ l N. 

Since 1 € N, ng~ l - 1 € ng~ ] N. Thus ng~ l e g~ [ N, hence ng~ l = g~ l n lt for some 
n i e N. Multiplying both sides on the left by g gives gng~ l = n\ e N, as c laim ed. 

Conversely, assume gng~ l € A for all £ e G and all n € N. To prove the operation 
stated above is well defined let u> u\ e uN and v, v\ e vN. We may write 

u\ = un and tij = vm, for some n,m € N. 

We must prove that u\v\ e uvN\ 

U\V\ = (un)(vm) — 

= (uv)(v~ l nv)m = ( uv)(nim ), 

where n j = v~ l nv = (u -l )n(u -1 ) -1 is an element of N by assumption. Now N is 
closed under products, so n\m e N. Thus 

= (uv)n 2 , for some € N. 

Thus the left cosets uvN and u\v\N contain the common element u\v j. By the pre- 
ceding proposition they are equal. This proves that the operation is well defined. 

(2) If the operation on oosets is well defined the group axioms are easy to check 
and are induced by their validity in G. For example, the associative law holds because 
for all u, v, w e G, 


(itN)(vNwN) = uN(vwN) 

= u(vw)N 

— ( uv)wN = ( uNvN)(wN ), 

since u(vw) = ( uv)w in G. The identity in G/N is the coset IN and the inverse of 
gN is £ -1 N as is immediate from the definition of the multiplication. 

As indicated before, the subgroups N satisfying the condition in Proposition 5 for 
which there is a natural group structure on the quotient G/N are given a name: 
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Definition. The element gng~ l is called the conjugate of n £ N by g. The set 
gNg = (gng~ l | n e iV) is called the conjugate of N by g. The element g is 
said to normalize N if gNg~ l = N. A subgroup N of a group G is called normal if 
every element of G normalizes N, i.e., if gNg~' = Nfor all g € G. If N is a normal 
subgroup of G we shall write N < G. 


Note that the structure of G is reflected in the structure of the quotient G/N when 
N is a normal subgroup (for example, the associativity of the multiplication in G/N is 
induced from the associativity in G and inverses in G/N are induced from inverses in 
G ). We shall see more of the relationship of G to its quotient G/N when we consider 
the Isomorphism Theorems later in Section 3. 

We s umm arize our results above as Theorem 6. 

Theorem 6. Let N be a subgroup of the group G. The following are equivalent: 

(1) N < G 

(2) Nc (N) = G (recall Ng (N) is the normalizer in G of AO 

(3) gN = Ng for all g £ G 

(4) the operation on left cosets of N in G described in Proposition 5 makes the set 
of left cosets into a group 

(5) gNg~ l C N for all g £ G. 

Proof: We have already done the hard equivalences; the others are left as exercises. 

As a practical matter, one tries to minimize the computations necessary to determine 
whether a given subgroup N is normal in a group G. In particular, one tries to avoid as 
much as possible the computation of all the conjugates gng~ ] for n e N and g £ G. For 
example, the elements of N itself normalize N since AMs a subgroup. Also, if one has a 
set of generators for N, it suffices to check that all conjugates of these generators lie in 
N to prove that N is a normal subgroup (this is because the conjugate of a product is the 
product of the conjugates and the conjugate of the inverse is the inverse of the conjugate) 
— this is Exercise 26 later. Similarly, if generators for G are also known, then it suffices 
to check that these generators for G nor maliz e N. in particular, if generators for both 
N and G are known, this reduces the calculations to a small number of conjugations 
to check. If N is a finite group then it suffices to check that the conjugates of a set 
of generators for N by a set of generators for G are again elements of N (Exercise 
29). Finally, it is often possible to prove directly that Ng(N) = G without excessive 
computations (some examples appear in the next section), again proving that AMs a 
normal subgroup of G without mindlessly computing all possible conjugates gng~ x . 

We now prove that the normal subgroups are precisely the same as the kernels of 
homo morphi sms considered earlier. 

Proposition 7. A subgroup N of the group G is normal if and only if it is the kernel of 
some homomorphism. 

Proof: If AMs the kernel of the homomorphism <p t then Proposition 2 shows that 
the left cosets of N are the same as the right cosets of N (and both are the fibers of the 
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map (p). By (3) of Theorem 6, A is then a normal subgroup. (Another direct proof of 
this from the definition of normality for TV is given in the exercises). 

Conversely, if A < G, let H = G/N and define rr : G -> G/N by 

n(g) = gN for all g € G. 

By definition of the operation in G/N, 

n(gig 2 ) = (g\g 2 )N = giNg 2 N = 

This proves n is a homomorphism. Now 

ker?r = {g 6 G \ n(g) = IN) 

= {gzG\gN=lN) 

= [g e G | g e N) = TV. 

Thus TV is the kernel of the homomorphism n. 

The homomorphism n constructed above demonstrating the normal subgroup N 
as the kernel of a homomorphism is given a name: 

Definition. Let TV < G. The homomorphism 7r : G G/N defined by n(g) = gN 
is called the natural projection ( homomorphism ) x of G onto G/N . If H < G/N is a 
subgroup of G/N, the complete preimage Of H in G is the preimage of H under the 
natural projection homomorphism. 

The complete preimage of a subgroup of G/N is a subgroup of G (cf. Exercise 1) 
which contains the subgroup TV since these are the elements which map to the identity 
T € H. We shall see in the Isomorphism Theorems in Section 3 that there is a natural 
correspondence between the subgroups of G that contain TV and the subgroups of the 
quotient G/N. 

We now have an “internal” criterion which determines precisely when a subgroup 
TV of a given group G is the kernel of some homomorphism, namely, 

(TV) = G. 

We may thus think of the normalizer of a subgroup TV of G as being a measure of 
“how close” A is to being a normal subgroup (this explains the choice of name for this 
subgroup). Keep in mind that the property of being normal is an embedding property, 
that is, it depends on the relation of A to G, not on the internal structure of TV itself 
(the same group A may be a normal subgroup of G but not be normal in a larger group 
containing G ). 

We began the discussion of quotient groups with the existence of a homomorphism 
<p of G to H and showed the kernel of this homomorphism is a normal subgroup A of 
G and the quotient G/N (defined in terms of fibers originally) is naturally isomorphic 

'The word “natural” has a precise mathematical meaning in the theory of categories; for our 
purposes we use the term to indicate that the definition of this homomorphism is a “coordinate free” 
projection Le.. is described only in terms of the elements themselves, not in terms of generators for G 
or N (cf. Appendix H). 
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to the image of G under (p in H. Conversely, if TV < G, we can find a group H 
(namely, G/N) and a homomorphism n \ G H such that kern = TV (namely, 
the natural projection). The study of homomorphic images of G (i.e., the images of 
homomorphisms from G into other groups) is thus equivalent to the study of quotient 
groups of G and we shah use homomorphisms to produce normal subgroups and vice 
versa. 

We developed the theory of quotient groups by way of homomorphisms rather than 
simply defining the notion of a normal subgroup and its associated quotient group to 
emphasize the fact that the elements of the quotient are subsets (the fibers or cosets of the 
kernel TV) of the original group G. The visualization in Figure 1 also emphasizes that TV 
(and its cosets) are projected (or collapsed) onto single elements in the quotient G/N. 
Computations in the quotient group G/N are performed by taking representatives from 
the various cosets involved. 

Some examples of normal subgroups and their associated quotients follow. 


Examples 

Let G be a group. 

(1) The subgroups 1 and G are always normal in G; G/\ SG and G/G = 1. 

(2) If G is an abelian group, any subgroup TV of G is normal because for all g e G and 
all n e TV, 

gng~ X = 8g~ l n = n e N. 

Note that it is important that G be abelian, not just that TV be abelian. The structure of 
G/N may vary as we take different subgroups TV of G. For instance, if G = TL, then 
every subgroup TV of G is cyclic: 

TV = ( n ) = ( — n ) = nZ, for some n eZ 

and G/N = Z/nZ is a cyclic group with generator 1 = 1 + nZ (note that 1 is a 
generator for G). 

Suppose now that G = Z* is the cyclic group of order k. Let x be a generator of 
G and let TV < G. By Proposition 2.6 TV = {x d ), where d is the smallest power of x 
which lies in TV. Now 


G/N = [gN \ g e G)~ {x a N | a€Z) 
and since x a N = (xN) a (see Exercise 4 below), it follows that 

G/N = ( xN ) i.e., G/N is cyclic with x TV as a generator. 

|G| 

By Exercise 5 below, the order of xTV in G/N equals d. By Proposition 2.5, d = 

In summary, 

quotient groups of a cyclic group are cyclic 


and the image of a generator g for G is a generator g for the quotient If in addition G 

\G\ 

is a finite cyclic group and TV < G, then |G/TV| = gives a formula for the order 


of the quotient group. 

(3) If TV < Z(G), then TV < G because for all g e G and all n € TV, gng~ l = n e TV, 
generalizing the previous example (where the center Z(G) is all of G). Thus, in 
particular, Z(G) < G. The subgroup { — 1 ) of was previously seen to be the kernel 

of a homomorphism but since { — 1 ) = 2(Qg) we obtain normality of this subgroup 
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now in another fashi on. We already saw that Qg/( — 1 ) = V4. The discussion for 
in the next paragraph could be applied equally well to Qg to give an independent 
identification of the isomorphism type of the quotienL 

Let G = Dg and let 2 = {r 2 ) = Z(D%). Since Z = {1, r 2 }, each coset, gZ, 
consists of the two element set { g , gr 2 ). Since these cosets partition the 8 elements of 
D & into pairs, there must be 4 (disjoint) left cosets of Z in D&: 

1 = 1Z, r = rZ, s = sZ, and rs = rsZ. 

Now by the classification of groups of order 4 (Exercise 10, Section 2.5) we know that 
Dg/Z(D%) = Z4 or V4. To determine which of these two is correct (i.e., determine 
the isomorphism type of the quotient) simply observe that 

(r) 2 = r 2 Z = 1Z = I 

(s) 2 = s 2 Z = 1Z = I 
(FI) 2 = {rs) 2 Z = 1Z = 1 

so every nonidentity element in D&/Z has order 2. In particular there is no element 
of order 4 in the quotient, hence Dg/Z is not cyclic so Dg/Z(Dg) = V4. 


EXERCISES 


Let G and H be groups. 

1. Let <p : G -*■ H be a homomorphism and let E be a subgroup of H. Prove that ^> -I (£) < G 
(i.e., the preimage or pullback of a subgroup under a homomorphism is a subgroup). If 
E <3 H prove that (p~\E ) <] G. Deduce that kertp <] G. 

2. Let <p : G -*■ H be a homomorphism of groups with kernel K and let a, b £ <p(G). 
Let X € G/K be the fiber above a and let Y be the fiber above b, i.e., X = 

Y = <p~ l ( b ). Fix an element u of X (so <p(u) = a). Prove that if XY = Z in the quotient 
group G /K and w is any member of Z, then there is some v e Y such that uv = w. (Show 
u~ l w e F.] 

3. Let A be an abelian group and let B be a subgroup of A . Prove that A /B is abelian. Give 
an example of a non-abelian group G containing a proper normal subgroup N such that 
G/N is abelian. 

4. Prove that in the quotient group G/N, (gN) a ~ g a N for all a e Z. 

5. Use the preceding exercise to prove that the order of the element gN in G/N is n, where 
n is the smallest positive integer such that g" € N (and gN has infini te order if no such 
positive integer exists). Give an example to show that the order of gN in G/N may be 
strictly smaller than the order of g in G. 

6. Define <p : -»■ (±1) by letting (p(x) be x divided by the absolute value of x . Describe 

the fibers of (p and prove that <p is a homomorphism. 

7. Define n : R 2 — ► R by tt( (jc, y) ) = x + y. Prove that n is a surjective homomorphism 
and describe the kernel and fibers of n geometrically. 

8. Let <p : R* ->■ R x be the map sending x to the absolute value of x. Prove that tp is a 
homomorphism and find the image of <p. Describe the kernel and the fibers of <p. 

9. Define tp : C x — > R* by <p(a + bi) = a 2 + b 2 . Prove that (p is a homomorphism and find 
the image of <p. Describe the kernel and the fibers of (p geometrically (as subsets of the 
plane). 
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10. Let (p : Z/8Z -*■ Z/4Z by <p(a) — a. Show that this is a well defined, suijective 
homomorphism and describe its fibers and kernel explicitly (showing that <p is well defined 
involves the fact that a has a different meaning in the domain and range of <p). 


11. Let F be a field and let G = { ^ ^ | a, b, c e F, ac ^ 0} < GL 2 (F). 

(a) Prove that the map <p : ^ ^ i->- a is a suijective homomorphism from G onto 

F x (recall that F x is the multiplicative group of nonzero elements in F). Describe 
the fibers and kernel of <p. 


(b) Prove that the map \p 


'(; :) 


(a, c) is a suijective homomorphism from G 


onto F x x F x . Describe the fibers and kernel of rp. 


(c) Let 


:) 


j b e F). Prove that H is isomorphic to the additive group F. 


12. Let G be the additive group of real numbers, let H be the multiplicative group of complex 
numbers of absolute value 1 (the unit circle S 1 in the complex plane) and let <p : G -*■ H 
be the homomorphism <p \ r i-> e hjlT . Draw the points on a real line which lie in the 
kernel oi<p. Describe similarly the elements in the fibers of <p above the points — 1, i, and 
e 4jr '/ 3 of H. (Figure 1 of the text for this homomorphism <p is usually depicted using the 
following diagram.) 



13. Repeat the preceding exercise with the map ip replaced by the map e 4nir . 

14. Consider the additive quotient group Q/Z. 

(a) Show that every coset of Z in Q con tains exactly one representative q e Q in the 
range 0 < q < 1. 

(b) Show that every element of <Q>/Z has finite order but that there are elements of arbi- 
trarily large order. 

(c) Show that Q/Z is the torsion subgroup of R/Z (cf. Exercise 6, Section 2.1). 

(d) Prove that Q/Z is isomorphic to the multiplicative group of root of unity in <C X . 

15. Prove that a quotient of a divisible abelian group by any proper subgroup is also divisible. 
Deduce that Q/Z is divisible (cf. Exercise 19, Section 2.4). 

16. Let G be a group, let IV be a normal subgroup of G and let G = G/N. Prove that if 
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G = (x,y) then G = ( x, y ). Prove more generally that if G = (S') for any subset S of 
G, then G = ( S ). 

17. Let G be the dihedral group of order 16 (whose lattice appears in Section 2.5): 

G — ( r, s | r 8 = s 2 = 1 , rs — $r -1 ) 

and let G = G/( r 4 ) be the quotient of G by the subgroup generated by r 4 (this subgroup 
is the center of G, hence is normal). 

(a) Show that the order of G_is 8. 

(b) Exhibit each element of G in the form l a r b , for some integers a and b. 

(c) Find the order of each of the elements of G exhibited in (b). 

(d) Write each of the following elements of G in the form for some integers a and 
b asin(b): rs, sr~ 2 s, s~ i r~ i sr. 

(e) Prove that H = ( s , r 2 ) is a normal subgroup of G and H is isomorphic to the Klein 
4- group. Describe the isomorphism type of the complete preimage of H in G. 

(f) Find the center of G and describe the isomorphism type of G/Z(G). 

18. Let G be the quasidihedral group of order 16 (whose lattice was computed in Exercise 1 1 
of Section 2.5): 

G = (a, x | o* = t 2 = 1. ox = to 3 ) 

and let G = G/( a 4 ) be the quotient of G by the subgroup generated by a 4 (this subgroup 
is the center of G , hence is normal). 

(a) Show that the order of G is 8. 

(b) Exhibit each element of G in the form x°o b , for some integers a and b. 

(c) Find the order of each of the elements of G exhibited in (b). 

(d) Write each of the following elements of G in the form for some integers a and 

&asin(b): ox, xa~ 2 x, x~ 1 o~ i xo. 

(e) Prove that G = Dg. 

19. Let G be the modular group of order 16 (whose lattice was computed in Exercise 14 of 
Section 2.5): 

G = ( u, v | u 2 = u 8 = 1, vu = uv 5 ) 

and let G = G/( v 4 ) be the quotient of G by the subgroup generated by v 4 (this subgroup 
is contained in the center of G. hence is normal). 

(a) Show that the order of G is 8. 

(b) Exhibit each element of G in the form for some integers a and b. 

(c) Find the order of each of the elements of G exhibited in (b). 

(d) Write each of the following elements of G in the form u a v b , for seme integers a and 

basin(b): vu, uv~ 2 u, u _1 u -1 uu. 

(e) Prove that G is abelian and is isomorphic to Zi x Z4. 

20. Let G = Z/24Z and let G = Gf( 12), where for each integer a we simplify notation by 
writing a as S’. 

(a) Show that G = {0, 1, . . . , 11). ^ 

(b) Find the order of each element of G. 

(c) Prove that G = Z/12Z. (Thus (Z/24Z) /(12Z/24Z) = Z/12Z, just as if we inverted 
and cancelled the 24Z’s.) 

21. Let G = Z4 x Z4 be given in terms of the following generators and relations: 

G = (x,y \x 4 =y 4 = l, xy = yx). 
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Let G = G/( x 2 y 2 ) (note that every subgroup of the abelian group G is normal). 

(a) Show that the order of G is 8. 

(b) Exhibit each element of G in the form x°y h , for some integers a and b. 

(c) Find the order of each of the elements of G exhibited in (b). 

(d) Prove that G = Z4 x Z2. 

22. (a) Prove that if H and K are normal subgroups of a group G then their intersection 

H H K is also a normal subgroup of G. 

(b) Prove that the intersection of an arbitrary nonempty collection of normal subgroups 
of a group is a normal subgroup (do not assume the collection is countable). 

23. Prove that the join (cf. Section 2.S) of any nonempty collection of normal subgroups of a 
group is a normal subgroup. 

24. Prove that if N < G and H is any subgroup of G then N n H < H. 

25. (a) Prove that a subgroup N of G is normal if and only if gNg~ x C N for all g e G. 

(b) Let G = GL 2 (0), let N be the subgroup of upper triangular matrices with integer 

entries and 1 ’s on the diagonal, and let g be the diagonal matrix with entries 2, 1 . Show 
that gNg -1 c N but g does not normalize N . 

26. Let a, b e G. 

(a) Prove that the conjugate of the product of a and b is the product of the conjugate of 
a and the conjugate of b. Prove that the order of a and the order of any conjugate of 
a are the same. 

(b) Prove that the conjugate of a -1 is the inverse of the conjugate of a. 

(c) Let N = ( S ) for some subset S of G. Prove that N < GifgSg -1 C //for all g € G. 

(d) Deduce that if N is the cyclic group ( x ) , then N is normal in G if and only if for each 
g e G, gxg~ l = x k for some k e 2. 

(e) Let n be a positive integer. Prove that the subgroup N of G generated by all the 
elements of G of order n is a normal subgroup of G. 

27. Let/'/ be ^finite subgroup of a group G. Show that gNg~ l c //if and only if gNg -1 = N. 
Deduce that Ng(N) = [g € G | gNg~ 1 C //}. 

28. Let N be a finite subgroup of a group G and assume N = ( S ) for some subset S of G. 
Prove that an element g e G normalizes N if and only if gSg~ l C //. 

29. Let N be a finite subgroup of G and suppose G = ( T ) and N = ( S ) for some subsets S 
and T of G. Prove that N is normal in G if and only if tSt~ 1 c N for all z € T. 

30. Let N < G and let g e G. Prove that gN = Ng if and only if g e Nc (AO- 

31. Prove that if H < G and N is a normal subgroup of H then H < Nq(N). Deduce that 
Ng(N) is the largest subgroup of G in which N is normal (i.e., is the join of all subgroups 
H for which N < H ). 

32. Prove that every subgroup of Q% is normal. For each subgroup find the isomorphism type 
of its corresponding quotient. [You may use the lattice of subgroups for £>8 m Section 
25.] 

33. Find all normal subgroups of D& and for each of these find the isomorphism type of its 
corresponding quotient. [You may use the lattice of subgroups for Ds in Section 2.5.] 

34. Let €> 2 n = (r,s \ r n = s 2 = 1 , rs = sr~ l ) be the usual presentation of the dihedral 
group of order 2 n and let k be a positive integer dividing n. 

(a) Prove that (r k ) is a normal subgroup of Uhn- 

(b) Prove that D 2 n/{r k ) = D^k- 
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35. Prove that SL„(F) <J GL„(F ) and describe the isomorphism type of the quotient group 
(cf. Exercise 9* Section 2.1). 

36. Prove that if G/Z(G) is cyclic then G is abelian. [If G/Z(G) is cyclic with generator 
xZ(G), show that every element of G can be written in the form x°z for some integer 
a e 2 and some element z e Z(G).] 

37. Let A and B be groups. Show that {(a, 1) [ a e A) is a normal subgroup of Ax B and the 
quotient of A x B by this subgroup is isomorphic to B. 

38. Let A be an abelian group and let D be the (diagonal) subgroup {(a, a) \ a e A) of A x A. 
Prove that D is a normal subgroup of A x A and (A x A)/D = A. 

39. Suppose A is the non-abelian group S3 and D is the diagonal subgroup 
{(a, a) | a € A] of A x A. Prove that D is not normal in A x A. 

40. Let G be a group, let N be a normal subgroup of G and let G = G/N. Prove that x and 
y commute in G if and only if x~ l y~ l xy e N. (The element x -1 y -, xy is called the 
commutator of x and y and is denoted by [x, y].) 

41. Let G bea group. Prove that N = (x~ l y~ l xy | x, y e G ) is a normal subgroup of G and 
G/N is abelian (N is called the commutator subgroup of G). 

42. Assume both H and K are normal subgroups of G with H IT K = 1 . Prove that xy = yx 
for all x e H and y e K. [Show x -1 y“*xy € H (T /(.] 

43. Assume V = {Aj \ i € /} is any partition of G with the property that V is a group under 
the “quotient operation” defined as follows: to compute the product of A -, with Aj take any 
element a, of Aj and any element aj of Aj and let A,- Aj be the element of V containing 
(this operation is assumed to be well defined). Prove that the element of V that contains 
the identity of G is a normal subgroup of G and the elements of V are the cosets of this 
subgroup (so V is just a quotient group of G in the usual sense). 


3.2 MORE ON COSETS AND LAGRANGE’S THEOREM 


In this section we continue the study of quotient groups. Since for finite groups one 
of the most important invariants of a group is its order we first prove that the order of 

\G\ 

a quotient group of a finite group can be readily computed: \G/N\ = j^-. In fact 


we derive this as a consequence of a more general result, Lagrange’s Theorem (see 
Exercise 19, Section 1.7). This theorem is one of the most important combinatorial 
results in finite group theory and will be used repeatedly. After indicating some easy 
consequences of Lagrange’s Theorem we study more subtle questions concerning cosets 
of non-normal subgroups. 

The proof of Lagrange’s Theorem is straightforward and important. It is the same 
line of reasoning we used in Example 3 of the preceding section to compute | D%/Z(D% ) | . 


Theorem 8 . (Lagrange's Theorem) If G is a finite group and H is a subgroup of G, 
then the order of H divides the order of G (i.e., |//| I | G |) and the number of left cosets 


of H in G equals 


\G[ 

\H\‘ 


Proof: Let \H\ — n and let the number of left cosets of H in G equal k. By 
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Proposition 4 the set of left cosets of H in G partition G. By definition of a left coset 
the map: 

H -v gH defined by h\-+ gh 

is a suijection from H to the left coset gH. The left cancellation law implies this map 
is injective since gh\ = ghj implies h\ = I 12 . This proves that H and gH have the 
same order 

\gH\ = \H\=n. 


Since G is partitioned into k disjoint subsets each of which has cardinality n,\G\ = kn. 

Thus k = completing the proof, 

n |tf| 


Definition* If G is a group (possibly infinite) and H < G, the number of left cosets 
of H in G is called the index of H in G and is denoted by | G : H | . 


\G | 

In the case of finite groups the index of H in G is . For G an infinite group 

|£j 

the quotient - does not make sense. Infinite groups may have subgroups of finite 

I ** I 

or infinite index (e.g., {0} is of infinite index in Z and (n) is of index n in Z for every 
n > 0). 

We now derive some easy consequences of Lagrange’s Theorem. 


Corollary 9. If G is a finite group and x e G, then the order of x divides the order of 
G. In particular x ,C| = 1 for all x in G. 


Proof: By Proposition 2.2, \x\ = | ( x ) | . The first part of the corollary follows from 
Lagrange’s Theorem applied to H = (x ). The second statement is clear since now | G | 
is a multiple of the order of x. 


Corollary 10. If G is a group of prime order p, then G is cyclic, hence G = Z p . 

Proof: Let x e G, x ^ 1. Thus |(x)| > 1 and |{jc)| divides |G|. Since \G\ 
is prime we must have \{x)\ = |G|, hence G = (x ) is cyclic (with any nonidentity 
element x as generator). Theorem 2.4 completes the proof. 

With Lagrange’s Theorem in hand we examine some additional examples of normal 
subgroups. 

Examples 

(1) Let H = ( (1 2 3) ) < S 3 and let G = S 3 . We show H < S 3 . As noted in Section 2.2, 

H < N G (H) < G. 

By Lagrange’s Theorem, the order of H divides the order of N G (H) and the order 
of A 'g(H) divides the order of G. Since G has order 6 and H has order 3, the only 
possibilities for are H or G. A direct computation gives 

(1 2)(1 2 3)(1 2) = (1 3 2) = (1 2 3) -1 . 
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Since (1 2) = (1 2) -1 , this calculation shows that (1 2) conjugates a generator of H to 
another generator of H . By Exercise 24 of Section 2.3 this is sufficient to prove that 
(1 2) e Ng(H)- Thus Ng(H ) ^ H so Ng(H) = G, i.e., // < S 3 , as claimed. This 
argument illustrates that checking normality of a subgroup can often be reduced to a 
small number of calculations. A generalization of this example is given in the next 
example. 

(2) Let G be any group containing a subgroup H of index 2. We prove H < G. Let 
g e G — H so, by hypothesis, the two left cosets of H in G are [H and gH. Since 
1 H = H and the cosets partition G, we must have gH = G — H. Now the two right 
cosets of H in G are H 1 and Hg. Since HI = H, we again must have Hg = G — H . 
Combining these gives gH = Hg, so every left coset of H in G is a right coset- By 
Theorem 6 , H < G. By definition of index, \G/H\ — 2, so that G/H = Z 2 - One 
must be careful to appreciate that the reason H is normal in this case is not because we 
can choose the same coset representatives 1 and g for both the left and right cosets of 
H but that there is a type of pigeon-hole principle at work: since 1 H = H = HI for 
any subgroup H of any group G, the index assumption forces the remaining elements 
to comprise the remaining coset (either left or right). We shall see that this result is 
itself a special case of a result we shall prove in the next chapter. 

Note that this result proves that ( i),(j ) and ( k ) are normal subgroups of Qg 
and that (s,r 2 ), ( r ) and ( sr, r 2 ) are normal subgroups of Dg. 

(3) The property “is a normal subgroup of’ is not transitive. For example, 

(s) < (s, r 2 ) <3 Dg 

(each subgroup is of index 2 in the next), however, ( s ) is not normal in Dg because 
rsr~ l = sr 2 g (s ). 

We now examine some examples of non-normal subgroups. Although in abelian 
groups every subgroup is normal, this is not the case in non-abelian groups (in some 
sense Qg is the unique exception to this). In fact, there are groups G in which the 
only normal subgroups are the trivial ones: 1 and G. Such groups are called simple 
groups (simple does not mean easy, however). Simple groups play an important role 
in the study of general groups and this role will be described in Section 4. For now 
we emphasize that not every subgroup of a group G is normal in G; indeed, normal 
subgroups may be quite rare in G. The search for normal subgroups of a given group 
is in general a highly nontrivial problem. 

Examples 

(1) Let H = ( (1 2) ) < S 3 . Since H is of prime index 3 in S 3 , by Lagrange’s Theorem 
the only possibilities for N $ 3 (H) are H or S 3 . Direct computation shows 

(1 3)(1 2)(1 3 ) -1 = (1 3)(1 2)(1 3) = (2 3 )#H 

so Ns 3 (H) ± S^, that is, H is not a normal subgroup of S 3 . One can also see this by 
considering the left and right cosets of H \ for instance 

(1 3)H = {(13), (12 3)} and H(\ 3) = {(1 3), (1 3 2)). 

Since the left coset (1 3 )H is the unique left coset of H containing (1 3), the right 
coset H( 1 3) cannot be a left coset (see also Exercise 6 ). Note also that the “group 
operation” on the left cosels of H in S 3 defined by multiplying representatives is not 
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even well defined. For example, consider the product of the two left cosets 1 H and (13 )H. 
The elements 1 and (1 2) are both representatives for the coset 1 H, yet 1 ■ (1 3) = (1 3) 
and (1 2) - (1 3) = (1 3 2) are not both elements of the same left coset as they should be if 
the product of these cosets were independent of the particular representatives chosen. This 
is an example of Theorem 6 which states that the cosets of a subgroup form a group only 
when the subgroup is a normal subgroup. 

(2) Let G = S„ for some n e Z + and fix some j € (1, 2, ... , n). As in Section 2.2 let 

Gi = {a e G [ a(i) = i) 

be the stabilizer of the point /. Suppose x € G and r(i) = j. It follows directly 
from the definition of G; that for all a € G;, zcr(i) = j. Furthermore, if fj, e G and 
/a(i) = j, then r - V(0 = i, that is, r - V € G, , so/t € rG, . This proves that 

zGi = {/x € G | fi(i) = j), 

i.e., the left coset rG; consists of the permutations in S n which take i to j. We can 
clearly see that distinct left cosets have empty intersection and that the number of 
distinct left cosets equals the number of distinct images of the integer i under the 
action of G, namely there are n distinct left cosets. Thus |G : G;| = n. Using the 
same notation let k = r -1 (/), so that r (k) = i. By similar reasoning we see that 

G; r = {X e G | *(*) = /}, 

i.e., the right coset G; r consists of the permutations in S n which take ktoi. Ifn > 2, for 
some nonidentity element r we have r Gi G,r since there are certainly permutations 
which take i to j but do not take k to i. Thus G t is not a normal subgroup. In fact 
Nc(Gj) = Gi by Exercise 30 of Section 1, so G; is in some sense far from being 
normal in Thi s example generalizes the preceding one. 

(3) In Z)g the only subgroup of order 2 which is normal is the center ( r 2 ). 


We shall see many more examples of non-normal subgroups as we develop the 
theory 


The full converse to Lagrange's Theorem is nvi true: namely, if G is a finite group 
and n divides |G|, then G need not have a subgroup of order n . For example, let A be the 
group of symmetries of a regular tetrahedron. By Exercise 9 of Section 1.2, |A| = 12. 


Suppose A had a subgroup H of order 6. Since 


|A| 

\H\ 


= 2, H would be of index 2 in 


A, hence H < A and A/H = Z 2 - Since the quotient group has order 2, the square of 
every element in the quotient is the identity, so for all g e A, (gH) 2 = 1//, that is, for 
all g e A, g 2 e H. If g is an element of A of order 3, we obtain g = (g 2 ) 2 e H, that 
is, H must contain all elements of A of order 3. This is a contradiction since \ H\ — 6 
but one can easily exhibit 8 rotations of a tetrahedron of order 3. 

There are some partial converses to Lagrange’s Theorem. For finite abelian groups 
the full converse of Lagrange is true, namely an abelian group has a subgroup of order 
n for each divisor n of |G| (in fact, this holds under weaker assumptions than “abelian”; 
we shall see this in Chapter 6). A partial converse which holds for arbitrary finite groups 
is the following result 
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Theorem 11. (Cauchy's Theorem) If G is a finite group and p is a prime dividing |G|, 
then G has an element of order p. 

Proof: We shall give a proof of this in the next chapter and another elegant proof 
is outlined in Exercise 9. 

The strongest converse to Lagrange's Theorem which applies to arbitrary finite 
groups is the following: 

Theorem 12. (Sylow) If G is a finite group of order p a m> where p is a prime and p 
does not divide m, then G has a subgroup of order p a . 

We shall prove this theorem in the next chapter and derive more information on the 
number of subgroups of order p a . 

We conclude this section with some useful results involving cosets. 

Definition. Let H and K be subgroups of a group and define 

HK = [hk | h g H, k G £}. 


Proposition 13. If H and K are finite subgroups of a group then 


\HK\ = 


|g||*l 

\HDK\' 


Proof: Notice that HK is a union of left cosets of K , namely, 

HK = 

heH 


Since each coset of K has | K | elements it suffices to find the number of distinct left 
cosets of the form hK, h G H. But hiK = h 2 K for h\ t h 2 € H if and only if 
h^ y h\ G K . Thus 

h\K = h 2 K & h^hieHnK h y (H C\ K) = h 2 (H n K). 


Thus the number of distinct cosets of the form hK, for h g H is the number of distinct 
cosets h(H n K), for h € H. The latter number, by Lagrange’s Theorem, equals 
1^1 

— — — . Thus H K consists of — distinct cosets of K (each of which has I AT I 

\HHK\ \HHK\ 

elements) which gives the formula above. 


Notice that there was no assumption that HK be a subgroup in Proposition 13. 
For example, if G = S 2l H = ( (1 2) ) and K = ( (23) ), then \H\ ~ \K\ — 2 and 
\H O K\ = 1 , so \H K\ = 4. By Lagrange’s Theorem HK cannot be a subgroup. As a 
consequence, we must have S 3 = ( (1 2), (2 3) ). 
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Proposition 14 . If H and K are subgroups of a group, HK is a subgroup if and only 
if HK = KH. 


Proof: Assume first that HK = KH and let a,b e H K. We prove ab 1 6 HK 
so HK is a subgroup by the subgroup criterion. Let 

a = h\ki and b = 

for some h i , hi G H andfci,^ £ K. Thus b -1 = kf l h^ \ so ab~ l = h\k\kf x h^ l . 
Let &3 = k\kf { G K and A 3 = AJ 1 . Thus ab~ l = h\kjh3. Since HK = KH, 

k^h-i = h^kq, for some /14 g H , Lj G A". 

Thusafr -1 = 1/14^4, and since /ji/14 G //, £4 G A!, we obtain a fc -1 G // AT, as desired. 

Conversely, assume that HK is a subgroup of G. Since K < HK and H < HK, 
by the closure property of subgroups, KH c HK. To show the reverse containment 
let AiA: G HK. Since HK is assumed to be a subgroup, write hk = a~\ for some 
a G H K . If a = h\k\ , then 

hk = (MO -1 = e KH < 

completing the proof. 

Note that HK = KH does not imply that the elements of H commute with those 
of K (contrary to what the notation may suggest) but rather that every product hk is of 
the form Kh' (h need not be h 1 nor k be K) and conversely. For example, if G = D?j,, 
H — (r) and K = ( s ), then G = HK = K H so that HK is a subgroup and 
rs = sr~ l so the elements of H do not commute with the elements of K. This is an 
example of the following sufficient condition for H K to be a subgroup: 

Corollary IS. If H and K are subgroups of G and H < Nc(K), then HK is a subgroup 
of G. In particular, if K < G then HK < G for any H < G. 

Proof: We prove HK = KH. Let h g H, k g K. By assumption, hkh~ x e K, 
hence 

hk = ( hkh~ x )h € KH. 

This proves HK c KH. Similarly, kh = h(h~ x kh) G HK, proving the reverse 
containment The corollary follows now from the preceding proposition. 

Definition. If A is any subset of Nc(K) (or Cg(AQ), we shall say A normalizes K 
( centralizes K, respectively). 

With this terminology. Corollary 15 states that HK is a subgroup if H normalizes 
K (similarly, HK is a subgroup if K normalizes H). 

In some instances one can prove that a finite group is a product of two of its 
subgroups by simply using the order formula in Proposition 13 . For example, let 
G — £4, H = Dg and let AT = ( (1 23 ) ), where we consider D& as a subgroup of 
S4 by identifying each symmetry with its permutation on the 4 vertices of a square 
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(under some fixed labelling). By Lagrange’s Theorem, H D K = 1 (see Exercise 8). 
Proposition 13 then shows \HK\ = 24 hence we must have HK = S 4 . Since H K is 
a group, HK = KH. We leave as an exercise the verification that neither H nor K 
normalizes the other (so Corollary 15 could not have been used to give HK = KH). 

Finally, throughout this chapter we have worked with left cosets of a subgroup. 
The same combinatorial results could equally well have been proved using right cosets. 
For normal subgroups this is trivial since left and right cosets are the same, but for non- 
normal subgroups some left cosets are not right cosets (for any choice of representative) 
so some (simple) verifications are necessary. For example, Lagrange’s Theorem gives 
that in a finite group G 

\G\ 

the number of right cosets of the subgroup H is . 

I#l 

Thus in a finite group the number of left cosets of H in G equals the number of right 
cosets even though the left cosets are not right cosets in general. This is also true for 
infinite groups as Exercise 12 below shows. Thus for purely combinatorial purposes 
one may use either left or right cosets (but not a mixture when a partition of G is 
needed). Our consistent use of left cosets is somewhat arbitrary although it will have 
some benefits when we discuss actions on cosets in the next chapter. Readers may 
encounter in some works the notation H\G to denote the set of right cosets of H in G. 

In some papers one may also see the notation G/H used to denote the set of left 
cosets of H in G even when H is not normal in G (in which case G/H is called the 
coset space of left cosets of H in G). We shall not use this notation. 

EXERCISES 


Let G be a group. 

1. Which of the following are permissible orders for subgroups of a group of order 120: 1, 
2, 5, 7, 9, 15, 60, 240? For each permissible order give the corresponding index. 

2. Prove that the lattice of subgroups of S 3 in Section 2.5 is correct (i.e., prove that it contains 
all subgroups of S 3 and that their pairwise joins and intersections are correctly drawn). 

3. Prove that the lattice of subgroups of Q% in Section 2.5 is correct. 

4. Show that if | G\ = pq for some primes p and q (not necessarily distinct) then either G is 
abelian or Z(G) = 1. [See Exercise 36 in Section L] 

5. Let H be a subgroup of G and fix some element g e G. 

(a) Prove that gHg ' 1 is a subgroup of G of the same order as H. 

(b) Deduce that if n e Z + and H is the unique subgroup of G of order n then H < G. 

6 . Let H < G and let g e G. Prove that if the right coset H g equals some left coset of H in 
G then it equals the left coset gH and g must be in Nc (H). 

7. Let H < G and define a relation ~ on G by a ~ b if and only if b~ l a e H. Prove 
that ~ is an equivalence relation and describe the equivalence class of each a e G. Use 
this to prove Proposition 4. 

8. Prove that if H and K are finite subgroups of G whose orders are relatively prime then 
HC\ K = 1. 
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9. This exercise outlines a proof of Cauchy’s Theorem due to James McKay (Another proof 
of Cauchy's group theorem , Amer. Math. Monthly, 66(1959), p. 119). Let G be a finite 
group and let p be a prime dividing |G|. Let S denote the set of p-tuples of elements of 
G the product of whose coordinates is l: 

S = {(xi . X 2 , - . . , x p ) I Xi G G and x\X 2 ■ ■ ■ x p = 1}. 

(a) Show that S has |G| P_I elements, hence has order divisible by p. 

Define the relation ~ on S by letting a ~ if is a cyclic permutation of a. 

(b) Show that a cyclic permutation of an element of S is again an element of S. 

(c) Prove that ~ is an equivalence relation on S. 

(d) Prove that an equivalence class contains a single element if and only if it is of the 
form (x, x , . . . , x) with x p = 1. 

(e) Prove that every equivalence class has order 1 or p (this uses the fact that p is a prime). 
Deduce that |G| P_I = k + pd, where k is the number of classes of size 1 and d is the 
number of classes of size p. 

(f) Since {(1, 1, . . . , 1)} is an equivalence class of size 1, conclude from (e) that there 
must be a nonidentity element x in G with x p = 1, i.e., G contains an element of 
order p. [Show p \ k and so k > 1.] 

10. Suppose H and K are subgroups of finite index in the (possibly infinite) group G with 
| G : H\ = m and |G : K | = n. Prove that l.c.m.(m, n) < \G : H O K\ < mn. Deduce 
that if m andn are relatively prime then |G : H n K\ = |G : H\-\G : K\. 

11. Let H < K < G. Prove that |G : H | = |G : Jf[ - \K : H | (do not assume G is finite). 

12. Let H < G. Prove that the map x x -1 sends each left coset of H in G onto a right 
coset of H and gives a bijection between the set of left cosets and the set of right cosets of 
H in G (hence the number of left cosets of H in G equals the number of right cosets). 

13. Fix any labelling of the vertices of a square and use this to identify D% as a subgroup of 
S 4 . Prove that the elements of D% and ((123)) do not commute in S4. 

14. Prove that S 4 does not have a normal subgroup of order 8 or a normal subgroup of order 3. 

15. Let G = S n and for fixed 1 6 {1.2 /i)letGjbethestabilizerof». Prove thatG,- = S n -i- 

16. Use Lagrange’s Theorem in the multiplicative group (Z/pZ) x to prove Fermat’s Little 
Theorem: if p is a prime then a p = n(mod p) for all a € Z. 

17. Let p be a prime and let n be a positive integer. Find the order of p in (Z/(p n — 1 )Z) y and 
deduce that n \ <p(p n — 1) (here <p is Euler’s function). 

18. Let G be a finite group, let H be a subgroup of G and let N < G. Prove that if \H\ and 
|G : N\ are relatively prime then H < N. 

19. Prove that if N is a normal subgroup of the finite group G and (|Af |, |G : N |) = 1 then N 
is the unique subgroup of G of order \ N\. 

20. If A is an abelian group with A < G and B is any subgroup of G prove that A n B < AB. 

21. Prove that Q has no proper subgroups of finite index. Deduce that Q/Z has no proper 
subgroups of finite index, [Recall Exercise 21, Section 1.6 and Exercise 15, Section 1.] 

22. Use Lagrange’s Theorem in the multiplicative group (Z/nZ) x to prove Euler’s Theorem : 
a v(") = i mod ti for eveiy integer a relatively prime to /?, where <p denotes Euler’s <p- 
function. 

23. Determine the last two digits of 3 3 ' 00 . [Determine 3 100 mod #>(100) and use the previous 
exercise.] 
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3.3 THE ISOMORPHISM THEOREMS 


Id this section we derive some straightforward consequences of the relations between 
quotient groups and homomorphisms which were discussed in Section 1. In particular 
we consider the relation between the lattice of subgroups of a quotient group. G/N, 
and the lattice of subgroups of the group G. The first result restates our observations in 
Section 1 on the relation of the image of a homomorphism to the quotient by the kernel 
(sometimes called the Fundamental Theorem of Homomorphisms): 

Theorem 16. (The First Isomorphism Theorem) If (p : G -+ H is a homomorphism of 
groups, then ker <p < G and G/ ker <p = <p(G). 

Corollary 17. Let <p : G H be a homomorphism of groups. 

(1) <p is injective if and only if ker <p = 1. 

(2) \G : ker (fi\ = \<p(G)l 

Proof: Exercise. 

When we consider abstract vector spaces we shall see that Corollary 17(2) gives 
a formula possibly already famili ar from the theory of linear transformations: if 
<p : V -> W is a linear transformation of vector spaces, then dim V = rank^+nullity <p. 

Theorem 18. (The Second or Diamond Isomorphism Theorem ) Let G be a group, let 
A and B be subgroups of G and assume A < Ng(B). Then AB is a subgroup of G, 
B < AB, A D B < A and AB/B = A/A n B. 

Proof: By Corollary 15, AB is a subgroup of G. Since A < Ng(B) by assumption 
and B < Nc(B ) trivially, it follows that AB < Ng(B), i.e., B is a normal subgroup of 
the subgroup AB . 

Since B is normal in AB, the quotient group AB/B is well defined. Define the map 
<p : A — * AB/B by <p(a) = nB. Since the group operation in AB/B is well defined it 
is easy to see that <p is a homomorphism: 

(p(a\af) = (a^af)B =a\B ■ a 2 B = <o(ai)^(fl 2 ). 

Alternatively, the map <p is just the restriction to the subgroup A of the natural projection 
homomorphism 7r : AB — ► AB/B, so is also a homomorphism. It is clear from the 
definition of A B that <p is surjective. The identity in AB/B is the coset 1 B, so the kernel 
of (p consists of the elements a £ A with aB ~ IB, which by Proposition 4 are the 
elements a € B, i.e., ker <p = A O B. By the First Isomorphism Theorem, A O B < A 
and A/ A H B = AB/B, completing the proof. 

Note that this gives a new proof of the order formula in Proposition 13 in the special 
case that A < Nc(B). The reason this theorem is called the Diamond Isomorphism is 
because of the portion of the lattice of subgroups of G involved (see Figure 6). The 
markings in the lattice lines indicate which quotients are isomorphic. The “quotient” 
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AB/ A need not be a group (i.e., A need not be normal in AB), however we still have 
\AB \ A\ = \B : AD B|. 

G 

I 

AB 


A B 



The third Isomorphism Theorem considers the question of t akin g quotient groups 
of quotient groups. 

Theorem 19. ( The Third Isomorphism Theorem ) Let G be a group and let H and K be 
normal subgroups of G with H < K. Then K/H <1 G/H and 

( G/H)/(K/H ) £ G/K. 

If we denote the quotient by H with a bar, this can be written 

G/K ^ G/K. 

Proof: We leave as an easy exercise the verification that K/H < G/H. Define 

<p : G/H -> G/K 
( gH ) gK. 

To show <p is well defined suppose g] H = g 2 H. Then gi = g 2 h, for some h € H. 
Because H < A!", the element h is also an element of K, hence g\K = giK i.e., 
(p{g\ H) = <p(g 2 H)> which shows tp is well defined. Since g may be chosen arbitrarily 
in G t (p is a surjective homomorphism. Finally, 

ker cp = [gH e G/H \ <p(gH) = IK) 

= [ 8 HgG/H\ 8 K = IK) 

= [gH gG/H\ 8 gK} = K/H. 

By the First Isomorphism Theorem, (G/H)/(K/H) = G/K. 

An easy aid for remembering the Third Isomorphism Theorem is: “invert and 
cancel” (as one would for fractions). This theorem shows that we gain no new structural 
information from taking quotients of a quotient group. 

The final isomorphism theorem describes the relation between the lattice of sub- 
groups of the quotient group G/N and the lattice of subgroups of G. The lattice for 
G/N can be read immediately from the lattice for G by collapsing the group N to the 
identity. More precisely, there is a one-to-one correspondence between the subgroups 
of G containing N and the subgroups of G/N , so that the lattice for G/N (or rather, 
an isomorphic copy) appears in the lattice for G as the collection of subgroups of G 
between N and G. In particular, the lattice for G/N appears at the “top” of the lattice 
for G, a result we mentioned at the beginning of the chapter. 
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Theorem 20. (The Fourth or Lattice Isomorphism Theorem) Let G be a group and let 
N be a normal subgroup of G. Then there is a bijection from the set of subgroups A of 
G which contain N onto the set of subgroups A = A/N of G/N. In particular, every 
subgroup of G is of the form A/N for some subgroup A of G containing N (namely, 
its preimage in G under the natural projection homomorphism from G to G/N). This 
bijection has the following properties: for all A, B < G with N < A and N < B, 

(1) A < B if and only if A < B, 

(2) if A < B, then \B : A\ = \B : A|, 

(3) (A t B ) = (A,B), 

(4) A D B = A n B, and_ 

(5) A < G if and only if A < G. 


Proof: The complete preimage of a subgroup in G/N is a subgroup of G by 
Exercise 1 of Section 1 . The numerous details of the theorem to check are all completely 
straightforward. We therefore leave the proof of this theorem to the exercises. 


Examples 

(I) Let G = Q& and let N be the normal subgroup ( — 1 ). The (isomorphic copy of the) 
lattice of G/N consists of the double lines in the lattice of G below. Note that we 
previously proved that Qg/{ — 1 ) = ty and the two lattices do indeed coincide (see 
Section 2.5 for the lattices of Qg and V 4 ). 

Q& 

/l\ 

in u) <*> 

\\/ 


(2) The same process gives us the lattice of Dg/{ r 2 > (the double lines) in the lattice of 
Dg: 



Note that in the second example above there are subgroups of G which do not 
directly correspond to subgroups in the quotient group G/N, namely the subgroups 
of G which do not contain the normal subgroup N. This is because the subgroup 
N projects to a point in G/N and so several subgroups of G can project to the same 
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subgroup in the quotient. The image of the subgroup H of G under the natural projection 
homomorphism from G to G/N is the same as the image of the subgroup HN of G, 
and the subgroup HN of G contains N. Conversely, the preimage of a subgroup H of 
G/N contains N and is the unique subgroup of G containing N whose image in G/N 
is H. It is the subgroups of G containing N which appear explicitly in the lattice for 
G/N. 

The two lattices of groups of order 8 above emphasize the fact that the isomorphism 
type of a group cannot in general be determined from the knowledge of the isomorphism 
types of G/N and N, since ()&/(— l ) = D%/{r 2 ) and ( — 1 ) = (r 2 ) yet Q% and Dg 
are not isomorphic. We shall discuss this question further in the next section. 

We shall often indicate the index of one subgroup in another in the lattice of sub- 
groups, as follows: 

A 

n 
B 

where the integer n equals | A : £|. For example, all the unbroken edges in the lattices 
of Q% and would be labelled with 2. Thus the order of any subgroup. A, is the 
product of all integers which label any path upward from the identity to A. Also, by 
Theorem 20(2) these indices remain unchanged in quotients of G by normal subgroups 
of G contained in B t i.e., the portion of the lattice for G corresponding to the lattice of 
the quotient group has the correct indices for the quotient as well. 

Finally we include a remark concerning the definition of homomorphism s on quo- 
tient groups. We have, in the course of the proof of the isomorphism theorems, encoun- 
tered situations where a homomorphism <p on the quotient group G/N is specified by 
giving the value of <p on the coset gN in terms of the representative g alone. In each 
instance we then had to prove <p was well defined, i.e., was independent of the choice 
of g. In effect we are defining a homomorphism, #>, on G itself by specifying the value 
of (p at g. Then independence of g is equivalent to requiring that 0 be trivial on N, so 
that 

<p is well defined on G /N if und only if N < ker 0. 

This gives a simple criterion for defining homomorphisms on quotients (namely, define 
a homomorphism on G and check that N is contained in its kernel). In this situation we 
shall say the homomorphism 0 factors through N and <p is the induced homomorphism 
on G/N. This can be denoted pictorially as in Figure 7, where the diagram indicates 
that 0 = (poTT, i.e., the image in H of an element in G does not depend on which path 
one takes in the diagram. If this is the case, then the diagram is said to commute. 

G - — >- G/N 

9 

T 

H Fig. 7 

At this point we have developed all the background material so that Section 6.3 on 
free groups and presentations may now be read. 
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EXERCISES 


Let G be a group. 

1. Let F be a finite field of order q and Let neZ + . Prove that \ GL„ (F) : SL„ (F)| = q — 1. 
[See Exercise 35, Section L] 

2. Prove all parts of the Lattice Isomorphism Theorem. 

3. Prove that if H is a normal subgroup of G of prime index p then for all K < G either 

(i) K < H or 

(ii) G = H K and \K : K n H \ = p. 

4. Let C be a normal subgroup of the group A and let D be a normal subgroup of the group 
B. Prove that (C x D) < (A x B) and (A x B)/(C x D) = ( A/C) x ( B/D ). 

5. Let QDyt = (a, r ) be the quasidihedral group described in Exercise 11 of Section 2.5. 
Prove that ( a 4 ) is normal in QD\(, and use the Lattice Isomorphism Theorem to draw the 
lattice of subgroups of QD\^/( <r 4 ). Which group of order 8 has the same lattice as this 
quotient? Use generators and relations for QD\^/(u A ) to decide the isomorphism type 
of this group. 

6. Let M = ( v, u } be the modular group of order 16 described in Exercise 14 of Section 
2.5. Prove that ( u 4 ) is normal in M and use the Lattice Isomorphism Theorem to draw 
the lattice of subgroups of M/( v 4 ). Which group of order 8 has the same lattice as this 
quotient? Use generators and relations for M/( v 4 ) to decide the isomorphism type of this 
group. 

7. Let M and N be normal subgroups of G such that G = MN. Prove that 
G/(M CiN) = ( G/M ) x (G/N). [Draw the lattice.] 

8. Let p be a prime and let G be the group of p-power roots of 1 in C (cf. Exercise 18, 
Section 2.4). Prove that the map z z p is a suijective homomorphism. Deduce that G 
is isomorphic to a proper quotient of itself. 

9. Let p be a prime and let G be a group of order p a tn, where p does not divide m . Assume 
P is a subgroup of G of order p a and N is a normal subgroup of G of order p b n, where 
p does not divide n. Prove that \P n N| = p b and \PN/N\ = p a ~ b . (The subgroup P 
of G is called a Sylow p-subgroup of G. This exercise shows that the intersection of any 
Sylow p-subgroup of G with a normal subgroup N is a Sylow p-subgroup of N.) 

10. Generalize the preceding exercise as follows. A subgroup H of a finite group G is called 
a Hall subgroup of G if its index in G is relatively prime to its order: (|G : H\, |/?|) = 1. 
Prove that if H is a Hall subgroup of G and N < G, then H n N is a Hall subgroup of N 
and HN/N is a Hall subgroup of G/N. 

3.4 COMPOSITION SERIES AND THE HOLDER PROGRAM 

The remarks in the preceding section on lattices leave us with the intuitive picture that 
a quotient group G/N is the group whose structure (e.g., lattice) describes the structure 
of G “above” the normal subgroup N. Although this is somewhat vague, it gives at least 
some notion of the driving force behind one of the most powerful techniques in finite 
group theory (and even some branches of infinite group theory): the use of induction. In 
many instances the application of an inductive procedure follows a pattern s imil ar to the 
following proof of a special case of Cauchy’s Theorem. Although Cauchy’s Theorem is 
valid for arbitrary groups (cf. Exercise 9 of Section 2), the following is a good example 
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of the use of information on a normal subgroup N and on the quotient G/N to dete rmine 
information about G, and we shall need this particular result in Chapter 4. 

Proposition 21. If G is a finite abelian group and p is a prime dividing \G\, then G 
contains an element of order p. 

Proof: The proof proceeds by induction on |G|, namely, we assume the result 
is valid for every group whose order is strictly smaller than the order of G and then 
prove the result valid for G (this is sometimes referred to as complete induction). Since 
|G| > 1, there is an element x e G with x ^ 1. If |G| = /? then x has order p by 
Lagrange’s Theorem and we are done. We may therefore assume |G] > p. 

Suppose p divides |jc| and write \x\ = pn. By Proposition 2.5(3), |jc” | = p, and 
again we have an element of order p. We may therefore assume p does not divide \x\. 

|G| 

Let N = { x ). Since G is abelian, N < G. By Lagrange’s Theorem, \G/N\ = 

and since N ^ I, \G/N\ < |Gj. Since p does not divide \N\, we must have p | \G/N\. 
We can now apply the induction assumption to the smaller group G/N to conclude it 
contains an element, y = yN , of order p. Since y & N (y ^ I) but y p € N (y p = I), 
we must have (y p ) ^ (y), that is, |y p | < |y|. Proposition 2.5(2) implies p | |y |. We 
are now in the situation described in the preceding paragraph, so that argument again 
produces an element of order p. The induction is complete. 

The philosophy behind this method of proof is that if we have a sufficient amount of 
information about some normal subgroup, N, of a group G and sufficient information 
on G/N , then somehow we can piece this information together to force G itself to have 
some desired property. The induction comes into play because both N and G/N have 
smaller order than G. In general, just how much data are required is a delicate matter 
since, as we have already seen, the full isomorphism type of G cannot be determined 
from the isomorphism types of N and G/N alone. 

Clearly a basic obstruction to this approach is the necessity of producing a normal 
subgroup, N, of G with TV ^ 1 or G. In the preceding argument this was easy since 
G was abelian. Groups with no nontrivial proper normal subgroups are fundamental 
obstructions to this method of proof. 

Definition. A (finite or infinite) group G is called simple if ]G| > 1 and the only 
normal subgroups of G are 1 and G. 

By Lagrange’s Theorem if | G | is a prime, its only subgroups (let alone normal ones) 
are 1 and G, so G is simple. In fact, every abelian simple group is isomorphic to Z p , 
for some prime p (cf. Exercise 1). There are non-abelian simple groups (of both finite 
and infinite order), the smallest of which has order 60 (we shall introduce this group as 
a member of an infinite famil y of simple groups in the next section). 

Simple groups, by definition, cannot be “factored” into pieces like N and G/N and 
as a result they play a role analogous to that of the primes in the arithmetic of Z. This 
analogy is supported by a “unique factorization theorem” (for finite groups) which we 
now describe. 
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Definition. In a group G a sequence of subgroups 

1 = Nq < N\ < N 2 < ■ ■ ■ < Nk—i < A/* = G 

is called a composition series if AT,- < Af,-+i and N l+ i/A/,- a simple group, 0 < i < k — 1 . 
If the above sequence is a composition series, the quotient groups A/, + i/N; are called 
composition factors of G. 

Keep in mind that it is not assumed that each Nj < G, only that N f < A/;+i. Thus 

1 < (s) < (s,r 2 ) < D 8 and 1 < ( r 2 > < (r ) < D & 

are two composition series for Dg and in each series there are 3 composition factors, 
each of which is isomorphic to (the simple group) Z 2 - 

Theorem 22. (Jordan-Holder) Let G be a finite group with G l. Then 

(1) G has a composition series and 

(2) The composition factors in a composition series are unique, namely, if 

1 = No < N\ < ■ ■ ■ < N r — G and 1 = Mo < Afi < - ■ ■ < M s = G are 
two composition series for G, then r = s and there is some permutation, tt, of 
{1, 2, . . . , r} such that 

= Nj/Nj-i, 1 < i < r. 


Proof: This is fairly straightforward. Since we shall not explicitly use this theorem 
to prove others in the text we outline the proof in a series of exercises at the end of this 
section. 

Thus every finite group has a “factorization” (i.e., composition series) and although 
the series itself need not be unique (as Dg shows) the number of composition factors and 
their isomorphism types are uniquely determined. Furthermore, nonisomorphic groups 
may have the same (up to isomorphism) list of composition factors (see Exercise 2). 
This motivates a two-part program for classifying all finite groups up to isomorphism: 

The Holder Program 

(1) Classify all finite simple groups. 

(2) Find all ways of “putting simple groups together” to form other groups. 

These two problems form part of an underlying motivation for much of the development 
of group theory. Analogues of these problems may also be found as recurring themes 
throughout mathematics. We include a few more comments on the current status of 
progress on these problems. 

The classification of finite simple groups (part (1) of the Holder Program) was 
completed in 1980, about 100 years after the formulation of the Holder Program. Efforts 
by over 100 mathematicians covering between 5,000 and 10,000 journal pages (spread 
over some 300 to 500 individual papers) have resulted in the proof of the following 
result: 
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Theorem. There is a list consisting of 18 (infinite) families of simple groups and 26 
simple groups not belonging to these families (the sporadic simple groups) such that 
every finite simple group is isomorphic to one of the groups in this list. 

One example of a family of simple groups is [Z p | p a prime}. A second infinite 
family in the list of finite simple groups is: 

{SZ, W (F)/Z(SX„(F)) | n e Z + , n > 2 and F a finite field }. 

These groups are all simple except for SZ^( F 2 ) and SL 2 (F 3 ) where F 2 is the finite field 
with 2 elements and F 3 is the finite field with 3 elements. This is a 2-parameter family 
(n and F being independent parameters). We shall not prove these groups are simple 
(although it is not technically beyond the scope of the text) but rather refer the reader to 
the book Finite Group Theory (by M. Aschbacher, Cambridge University Press, 1986) 
for proofs and an extensive discussion of the simple group problem. A third family of 
finite simple groups, the alternating groups, is discussed in the next section; we shall 
prove these groups are simple in the next chapter. 

To gain some idea of the complexity of the classification of finite simple groups the 
reader may wish to peruse the proof of one of the cornerstones of the entire classification: 

Theorem. (Feit-Thompson) If G is a simple group of odd order, then G = Z p for some 
prime p. 

This proof takes 255 pages of hard mathematics . 2 

Part (2) of the Holder Program, sometimes called the extension problem , was rather 
vaguely formulated. A more precise description of “putting two groups together” is: 
given groups A and B, describe how to obtain all groups G containing a normal subgroup 
N such that N = B and G/N = A. For instance, if A = B = Z 2 , there are precisely 
two possibilities for G, namely, Z 4 and V 4 (see Exercise 10 of Section 2.5) and the 
Holder program seeks to describe how the two groups of order 4 could have been built 
from two Z 2 ’s without a priori knowledge of the existence of the groups of order 4. This 
part of the Holder Program is extremely difficult, even when the subgroups involved 
are of small order. For example, all composition factors of a group G have order 2 
if and only if |G| = 2" , for some n (one implication is easy and we shall prove both 
implications in Chapter 6 ). It is known, however, that the number of nonisomorphic 
groups of order 2 n grows (exponentially) as a function of 2 ", so the number of ways 
of putting groups of 2-power order together is not bounded. Nonetheless, there are a 
wealth of interesting and powerful techniques in this subtle area which serve to unravel 
file structure of large classes of groups. We shall discuss only a couple of ways of 
building larger groups from smaller ones (in the sense above) but even from this limi ted 
excursion into the area of group extensions we shall construct numerous new examples 
of groups and prove some classification theorems. 

One class of groups which figures prominently in the theory of polynomial equations 
is the class of solvable groups: 


2 Solvability of groups of odd order, Pacific Journal of Mathematics, 13(1963), pp. 775-1029. 
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Definition. A group G is solvable if there is a chain of subgroups 

1 = Go ^ G\ <! G 2 ^ ^ G s = G 

such that G, + i/Gi is abelian for i = 0, 1, . . . , s — 1. 


The terminology comes from the correspondence in Galois Theory between these 
groups and polynomials which can be solved by radicals (which essentially means there 
is an algebraic formula for the roots). Exercise 8 shows that finite solvable groups are 
precisely those groups whose composition factors are all of prime order. 

One remarkable property of finite solvable groups is the following generalization 
of Sylow’s Theorem due to Philip Hall (cf. Theorem 6.1 1 and Theorem 19.8). 

Theorem. The finite group G is solvable if and only if for every divisor n of | G| such 

that (n, ^-) = l, G has a subgroup of order n. 
n 

As another illustration of how properties of a group G can be deduced from com- 
bined information from a normal subgroup N and the quotient group G/N we prove 

if N and G/N are solvable, then G is solvable. 

To see this let G — G/N, let 1 = Nq < N\ < - . . < N n = N be a chain of subgroups 
of N such that JV.+i/A/f is abelian, 0 < « < n and let I = Go < G\ <!...< G m = G 
be a chain of subgroups of G such that G,-+i/G; is abelian, 0 < / < m. By the Lattice 
Isomorphism Theorem there are subgroups G, of G with N < G, such that Gi/N = G, 
and G, < G; + i, 0 < z < m. By the Third Isomorphism Theorem 

G7^/G7 = (G i+1 /N)/(G,/N) ^ G i+ i/ Gi. 


Thus 

1 = N 0 < Ni < . . . < N„ = N = Go < Gj < . . . < G m = G 

is a chain of subgroups of G all of whose successive quotient groups are abelian. This 
proves G is solvable. 

It is inaccurate to say that finite group theory is concerned only with the Hblder 
Program. It is accurate to say that the Holder Program suggests a large number of 
problems and motivates a number of algebraic techniques. For example, in the study 
of the extension problem where we are given groups A and B and wish to find G and 
N < G with N = B and G/N = A , we shall see that (under certain conditions) we 
are led to an action of the group A on the set B . Such actions form the crux of the next 
chapter (and will result in information both about simple and non-simple groups) and 
this notion is a powerful one in mathematics not restricted to the theory of groups. 

The final section of this chapter introduces another family of groups and although in 
line with our interest in simple groups, it will be of independent importance throughout 
the text, particularly in our study later of determinants and the solvability of polynomial 
equations. 
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EXERCISES 


1. Prove that if G is an abelian simple group then G = Z p for some prime p (do not assume 
G is a finite group). 

2. Exhibit all 3 composition series for Qg and all 7 composition series for Dg. List the 
composition factors in each case. 

3. Find a composition series for the quasidihedral group of order 16 (cf. Exercise 11, Section 
2.5). Deduce that QD\& is solvable. 

4. Use Cauchy’s Theorem and induction to show that a finite abelian group has a subgroup 
of order n for each positive divisor n of its order. 

5. Prove that subgroups and quotient groups of a solvable group are solvable. 

6. Prove part (1) of the Jordan-Hhlder Theorem by induction on |G|. 

7. If G is a finite group and H < G prove that there is a composition series of G, one of 
whose terms is H. 

8. Let G be a finite group. Prove that the following are equivalent; 

(i) G is solvable 

(ii) G has a chain of subgroups: 1 = Ho < Hi < H% < . . . < H s = G such that H;+\ jH\ 
is cyclic, 0 < i < r — 1 

(iii) all composition factors of G are of prime order 

(iv) G has a chain of subgroups; 1 = No <3 Ni < N 2 < . . . < N t = G such that each A/,- 
is a normal subgroup of G and A/,+i/N ( - is abelian, 0 < / < / — 1. 

[For (iv), prove that a minimal nontrivial normal subgroup M of G is necessarily abelian 
and then use induction. To see that M is abelian, let N <? M be of prime index (by (iii)) and 
show that x~ l y~ l xy e N for all x, y s M (cf. Exercise 40, Section 1). Apply the same 
argument to gNg~ l to show that x~^y~^xy lies in the intersection of all G-conjugates of 
N, and use the minimality of M to conclude that = 1 .] 

9. Prove the following special case of part (2) of the Jordan-Holder Theorem: assume the 
finite group G has two composition series 

1 = Nq ^ N i 5? ... ^ N r ~ G and 1 = Mq 5? M\ ^3 M 2 = G. 

Show that r = 2 and that the list of composition factors is the same. [Use the Second 
Isomorphism Theorem.] 

10. Prove part (2) of the Jordan-Hblder Theorem by induction on min{r, r). [Apply the 
inductive hypothesis to H = N r -\ H M s -i and use the preceding exercises.] 

11. Prove that if H is a nontrivial normal subgroup of the solvable group G then there is a 
nontrivial subgroup A of H with A < G and A abelian. 

12. Prove (without using the Feit-Thompson Theorem) that the following are equivalent 

( 1 ) every group of odd order is solvable 

(ii) the only simple groups of odd order are those of prime order. 


3.5 TRANSPOSITIONS AND THE ALTERNATING GROUP 
Transpositions and Generation of S n 

As we saw in Section 1.3 (and will prove in the next chapter) every element of S„ can 
be written as a product of disjoint cycles in an essentially unique fashion. In contrast. 
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every element of S n can be written in many different ways as a (nondisjoint) product of 
cycles. For example, even in S 3 the element a = (123) may be written 

<r = (123) = (1 3)(1 2) = (1 2)(13)(1 2)(1 3) = (12)(23) 

and, in fact, there are an infinite number of different ways to write < 7 . Not requiring the 
cycles to be disjoint totally destroys the uniqueness of a representation of a permutation 
as a product of cycles. We can, however, obtain a sort of “parity check” from writing 
permutations (nonuniquely) as products of 2-cycles. 

Definition. A 2-cycle is called a transposition . 

Intuitively, every permutation of {1, 2, ...» n} can be realized by a succession of 
transpositions or simple interchanges of pairs of elements (try this on a small deck of 
cards sometime!). We illustrate how this may be done. First observe that 

(flj a 2 . . . a m ) = (a i a„,)(ai a m -i)(tfi a m - 2) - - ■ fai ^2) 

for any m- cycle. Now any permutation in S„ may be written as a product of cycles (for 
instance, its cycle decomposition). Writing each of these cycles in turn as a product of 
transpositions by the above procedure we see that 

every element of S„ may be written as a product of transpositions 

or, equivalently, 

S„ = (T) where T = {(/ j) | 1 < i < j < n}. 

For example, the permutation a in Section 1.3 may be written 

o = (1 128 104)(2 13)(5 11 7)(69) 

= (1 4)(1 10)0 8)(1 12)(2 13)(5 7)(5 1 1) (6 9). 

The Alternating Group 

Again we emphasize that for any o e S„ there may be many ways of writing a as a 
product of transpositions. For fixed a we now show that the parity (i.e., an odd or even 
number of terms) is the same for any product of transpositions equaling o . 

Let * 1 , . . . , x n be independent variables and let A be the polynomial 

A = Yl ( Xi ~ X J^ 

1 < i<j<n 

i.e., the product of all the terms x, — xj for i < j. For example, when n =4, 

A = (xi — x 2 )(xi - x 3 )(xi - X 4 )(X 2 - x 3 )(x 2 - x 4 )(x 3 - X 4 ). 

For each o e S„ let a act on A by permuting the variables in the same way it permutes 
their indices: 

(A) — | J (^'cr(i) X a (jf). 

1 <i < j <n 
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For example, if n = 4 and a = (1 2 3 4) then 

cr(A) = (. *2 - * 3 )C *2 - X 4 )(*2 ~ *l )(*3 - * 4 >C *3 - -*l )(*4 “ *l) 

(we have written the factors in the same order as above and applied a to each factor to 
get o' (A)). Note (in general) that A contains one factor jc,- — xj for all / < j, and since 
or is a bijection of the indices, or (A) must contain either x; — Xj or xj — x, , but not both 
(and certainly no xj — x, terms), for all i < j. Her (A) has a factor Xj—x, where j > i , 
write this term as — (xj — Xj). Collecting all the changes in sign together we see that A 
and or (A) have the same factors up to a product of — l’s, i.e., 

<x(A) = ±A, for all o e S„. 

For each o e S n let 

+ 1, if or (A) = A 

€(<x) — 

— 1, if o (A) = —A. 

In the example above with n = 4 and o = ( 1 2 3 4), there are exactly 3 factors of the 
form Xj — x j where j > / in er(A), each of which contributes a factor of — 1. Hence 

(1 2 3 4)(A) = (-1) 3 (A) = -A, 


SO 

e((12 3 4)) = -l. 


Definition. 

(1) € (or) is called the sign of o. 

(2) o is called an even permutation if f (o) = 1 and an odd permutation if e (o) = — 1 . 

The next result shows that the sign of a permutation defines a homomorphism. 

Proposition 23. The map e : S n -> {±1] is a homomorphism (where (±1) is a 
multiplicative version of the cyclic group of order 2). 

Proof: By definition, 

(ro)(A) = ]~[ (x rtrli) - x Ta(J) ). 

I <i < j<rt 

Suppose that o' (A) has exactly k factors of the form x } — jc, with j > /, that is 
e(o) = (— 1)*. When calculating (to)( A), after first applying o' to the indices we see 
that (to)(A) has exactly k factors of the form x T( j) — with j > i. Interchanging 
the order of the terms in these k factors introduces the sign change (— 1)* = e(o), and 
now all factors of (ra)(A) are of the form x z ^ — x r((? ), with p < q. Thus 

(to)(A) = e(o-) Y\ “ x rm))‘ 

l<p<e}<n 

Since by definition of e 

(Xt(p) -X T ( 9) ) = €(Z ) A 

1 SP<QSn 
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we have (T£r)(A) = e(p)e(x) A. Thus e(T<r) = e(<r)e(r) = e(r)e(cr), as claimed. 

To see the proof in action, let n — 4, a — (1 2 3 4), z = (4 2 3) so za = (1 3 2 4). 
By definition (using the explicit A in this case), 

(rcr)(A) = (1 3 2 4)(A) 

= ( *3 - X 4 XX 3 - X 2 )(* 3 ~ *l)(*4 - *2)(*4 ~ *l)(*2 - X\) 

-(-1) 5 A 

where all factors except the first one are flipped to recover A . This shows e(rcr) = — 1 . 
On the other hand, since we already computed er(A) 

(t<t )(A) = t(<t(A)) 

= ~ ■ x r(3))(^r(2) ~ -^r(4))(^r(2) — -Xt(1))(*t(3) — *t(4)) * 

C*r(3) ■^T(l))(^r(4) ^r(l)) 

= (— !) 3 Y\ ( x *(p)~ x H ‘ ?)) = (-1) 3 t(A) 

1 <p<q<4 

where here the third, fifth, and sixth factors need to have their terms interchanged in 
order to put all factors in the form — x r(9) with p < q. We already calculated that 
e(<r) = (— l) 3 = — 1 and, by the same method, it is easy to see that e(r) = (— l) 2 = 1 
so € (r <r) = — 1 = e(r)e(<r). 

The next step is to compute e((i j)), for any transposition (i j). Rather than 
compute this directly for arbitrary i and j we do it first for i = 1 and j = 2 and reduce 
the general case to this. It is clear that applying (1 2) to A (regardless of what n is) will 
flip exactly one factor, namely xj — x 2 ; thus e((l 2)) = — 1 . Now for any transposition 
(i j ) let k be the permutation which interchanges 1 and i, interchanges 2 and j, and 
leaves all other numbers fixed (if i = 1 or j = 2, k fixes i or j, respectively). Then 
it is easy to see that (i j ) = A(1 2)k (compute what the right hand side does to any 
k e {1 , 2, . . . , n}). Since e is a homomorphism we obtain 

€((/y)) = €(x(i2)A) 

= ett)e(( 1 2 ))€(k) 

= (-1 Mk) 2 
= -l. 

This proves 

Proposition 24. Transpositions are all odd permutations and e is a surjective homo- 
morphism. 

Definition. The alternating group of degree n, denoted by A„, is the kernel of the 
homomorphism e (i.e., the set of even permutations). 

Note that by the First Isomorphism Theorem S n /A„ = e(S n ) — {±1}, so that the 
order of A n is easily determined: \A n \ — — | [ = -(n!). Also, S„ — A„ is the coset of 
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A„ which is not the identity coset and this is the set of all odd permutations. The signs 
of permutations obey the usual Z/2Z laws: 

( even )( even) = (odd)(odd) = even 
( even )( odd) = ( odd )( even) = odd. 

Moreover, since € is a homomorphism and every c € S„ is a product of transpositions, 

say cr = tiT 2 - - ■ r k , then c(a) = e(Tj) ■ • ■ c(z k ); since e(r,-) = — 1, for i = 1,2, k, 

c(a) = (—1)*. Thus the class of k (mod 2), i.e., the parity of the number of transposi- 
tions in the product, is the same no matter how we write cr as a product of transpositions: 

1 +1, if o - is a product of an even number of transpositions 
— 1. if cr is a product of an odd number of transpositions. 

Finally we give a quick way of computing e(o-) from the cycle decomposition of cr. 
Recall that an m -cycle may be written as a product of m — 1 transpositions. Thus 

an m-cycle is an odd permutation if and only if m is even. 

For any permutation o let 0:10:2 - • > a* be its cycle decomposition. Then e(cr) is 
given by e(ai) • ■ ■ e(a*) and e(a,) = — 1 if and only if the length of a* is even. It 
follows that for e(o) to be —1 the product of the c(a;)’s must contain an odd number 
of factors of (—1). We summarize this in the following proposition: 

Proposition 25. The permutation cr is odd if and only if the number of cycles of even 
length in its cycle decomposition is odd. 

For example, a = (1 2345 6)(7 8 9) (10 11) (12 13 14 15) (16 17 18) has 3 cycles 
of even length, so c(a) = — 1. On the other hand, r = (1 12 8 10 4)(2 13) (5 1 1 7)(6 9) 
has exactly 2 cycles of even length, hence c(r) = l. 

Be careful not to confuse the terms “odd” and “even” for a permutation cr with the 
parity of the order of <j . In fact, if cr is of odd order, all cycles in the cycle decomposition 
of a have odd length so cr has an even (in this case 0) number of cycles of even length 
and hence is an even permutation. If |cr | is even, a may be either an even or an odd 
permutation; e.g., (1 2) is odd, (1 2) (3 4) is even but both have order 2. 

As we mentioned in the preceding section, the alternating groups A n will be im- 
portant in the study of solvability of polynomials. In the next chapter we shall prove: 

A n is a non-abelian simple group for all n >5. 

For small values of n, A„ is already familiar to us: A\ and A 2 are both the trivial 
group and | A 3 1 = 3 (so A3 = { (1 2 3) ) = Z3). The group A 4 has order 12. Exercise 7 
shows A4 is isomorphic to the group of symmetries of a regular tetrahedron. The lattice 
of subgroups of A 4 appears in Figure 8 (Exercise 8 asserts that this is its complete 
lattice of subgroups). One of the nicer aspects of this lattice is that (unlike “virtually 
all groups”) it is a planar graph (there are no crossing lines except at the vertices; see 
the lattice of for a nonplanar lattice). 
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EXERCISES 

In Exercises I and 2 of Section 1 .3 you were asked to find the cycle decomposition of some 
permutations. Write each of these permutations as a product of transpositions. Determine 
which of these is an even permutation and which is an odd permutation. 

Prove that a 2 is an even permutation for every permutation a . 

Prove that S„ is generated by {(i i+l) | 1 < i < n — 1}. [Consider conjugates, viz. 
(2 3)(1 2)(2 3) -1 .] 

Show that S„ = ( (1 2), (1 2 3 ... n) ) for all n > 2. 

Show that if p is prime, S p = {a, t) where a is any transposition and r is any p-cycle. 

Show that ( (I 3), (1 2 3 4) ) is a proper subgroup of S 4 . What is the isomorphism type of 
this subgroup? 

Prove that the group of rigid motions of a tetrahedron is isomorphic to A 4 . [Recall Exercise 
20 in Section 1.7.] 

Prove the lattice of subgroups of A 4 given in the text is correct. [By the preceding exercise 
and the comments following Lagrange’s Theorem, A 4 has no subgroup of order 6 .] 

Prove that the (unique) subgroup of order 4 in A 4 is normal and is isomorphic to V 4 . 

Find a composition series for A 4 . Deduce that A 4 is solvable. 

Prove that S 4 has no subgroup isomorphic to gs- 

Prove that A„ contains a subgroup isomorphic to S„- 2 for each n > 3. 

Prove that every element of order 2 in A n is the square of an element of order 4 in S n . [An 
element of order 2 in A n is a product of 2k commuting transpositions.] 

Prove that the subgroup of A 4 generated by any element of order 2 and any element of 
order 3 is all of A 4 . 

Prove that if x and y are distinct 3-cycles in S 4 with x y~ l , then the subgroup of 54 
generated by x and y is A 4 . 

Let x and y be distinct 3 -cycles in S$ with x _y -1 . 

(a) Prove that if x and y fix a common element of [1, . . . , 5), then (x t y) = A 4 . 

(b) Prove that if x and y do not fix a common element of {1 5}, then (x, y) = As- 

If x and y are 3-cycles in S n , prove that ( x, y ) is isomorphic to Z 3 , A 4 , As or Z 3 x Z 3 . 
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CHAPTER 4 


Group Actions 


In this chapter we consider some of the consequences of a group acting on a set. It is 
an important and recurring idea in mathematics that when one object acts on another 
then much information can be obtained on both. As more structure is added to the 
set on which the group acts (for example, groups acting on groups or groups acting 
on vector spaces (considered in Chapter 18)), more information on the structure of the 
group becomes available. This study of group actions culminates here in the proof of 
Sylow’s Theorem and the examples and classifications which accrue from it. 

The concept of an action will recur as we study modules, vector spaces, canonical 
forms for matrices and Galois Theory, and is one of the fundamental unifying themes 
in the text 

4.1 GROUP ACTIONS AND PERMUTATION REPRESENTATIONS 

In this section we give the basic theoiy of group actions and then apply this theory to 
subgroups of S„ acting on {1 , 2, .... /i ) to prove that every element of S„ has a unique 
cycle decomposition. In Sections 2 and 3 we apply the general theory to two other 
specific group actions to derive some important results. 

Let G be a group acting on a nonempty set A. Recall from Section 1.7 that for each 
g € G the map 

o g : A — » A defined by o s : a h-> g - a 

is a permutation of A. We also saw in Section 1.7 that there is a homomorphism 
associated to an action of G on A: 

<p : G -> Sa defined by (p(g) = o gl 

called the permutation representation associated to the given action. Recall some 
additional terminology associated to group actions introduced in Sections 1.7 and 2.2. 

Definition. 

(1) The kernel of the action is the set of elements of G that act trivially on every 
element of A: [g e G \ g ■ a = a for all a € A}. 

(2) For each a € A the stabilizer of a in G is the set of elements of G that fix the 
element a: [g e G \ g ■ a = a) and is denoted by G a . 

(3) An action is faithful if its kernel is the identity. 
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Note that the kernel of an action is precisely the same as the kernel of the associated 
permutation representation; in particular, the kernel is a normal subgroup of G. Two 
group elements induce the same permutation on A if and only if they are in the same coset 
of the kernel (if and only if they are in the same fiber of the permutation representation 
<p). In particular an action of G on A may also be viewed as a faithful action of the 
quotient group G / ker <p on A. Recall from Section 2.2 that the stabilizer in G of an 
element a of A is a subgroup of G. If a is a fixed element of A, then the kernel of 
the action is contained in the stabilizer G a since the kernel of the action is the set of 
elements of G that stabilize every point, namely n aeA G a . 

Examples 

(1) Let n be a positive integer. The group G = S n acts on the set A = {1,2 n) 

by cr ■ i — o- (i) for all i € { 1 , ... , n). The permutation representation associated 
to this action is the identity map <p : S„ — *■ S„. This action is faithful and for each 
/ g { 1 , . . . , n) the stabilizer G -, (the subgroup of all permutations fixing i) is isomorphic 
to S„_i (cf. Exercise 15, Section 3.2). 

(2) Let G = Dg act on the set A consisting of the four vertices of a square. Label these 
vertices 1, 2,3,4 in a clockwise fashion as in Figure 2 of Section 1.2. Let r be the 
rotation of the square clockwise by 7 t/ 2 radians and let s be the reflection in the line 
which passes through vertices 1 and 3. Then the permutations of the vertices given by 
r and ^ are 

o r = (1 2 3 4) and o s = (2 4). 

Note that since the permutation representation is a homomorphism, the permutation 
of the four vertices corresponding to sr is a sr = a s a r = (1 4)(2 3). The action of Dg 
on the four vertices of a square is faithful since only the identity symmetry fixes all 
four vertices. The stabilizer of any vertex a is the subgroup of Dg of order 2 generated 
by the reflection about the line passing through a and the center of the square (so, for 
example, the stabilizer of vertex 1 is ( s )). 

(3) Label the four vertices of a square as in the preceding example and now let A be the set 
whose elements consist of unordered pairs of opposite vertices: A = { {1,3} , {2, 4} }. 
Then Dg also acts on this set A since each symmetry of the square sends a pair of 
opposite vertices to a pair of opposite vertices. The rotation r interchanges the pairs 
{1,3} and {2, 4}; the reflection s fixes both unordered pairs of opposite vertices. Thus 
if we label the pairs {1,3} and {2, 4} as 1 and 2, respectively, then the permutations of 
A given by r and s are 

Of = (1 2) and a s = the identity permutation. 

This action of Dg is not faithful: its kernel is (s,r 2 ). Moreover, for each a e A the 
stabilizer in Dg of a is the same as the kernel of the action. 

(4) Label the four vertices of a square as in Example 2 and now let A be the following set 
of unoidered pairs of vertices: { {1 , 2}, {3, 4} }. The group Dg does not act on this set 
A because {1, 2} € A butr {1,2} = {2, 3} £ A. 

The relation between actions and homomorphisms into symmetric groups may be 
reversed. Namely, given any nonempty set A and any homomorphism <p of the group 
G into S A we obtain an action of G on A by defining 

g-a = <p(g)(a) 
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for a 11 g € G and all a € A. The kernel of this action is the same as ker <p. The permu- 
tation representation associated to this action is precisely the given homomorphism <p. 
This proves the following result 

Proposition 1. For any group G and any nonempty set A there is a bijection between 
the actions of G on A and the homomorphisms of G into Sa- 

In view of Proposition 1 the definition of a permutation representation may be 
rephrased. 

Definition. If G is a group, a permutation representation of G is any homomorphism 
of G into the symmetric group S A for some nonempty set A. We shall say a given action 
of G on A affords or induces the associated permutation representation of G. 

We can think of a permutation representation as an analogue of the matrix repre- 
sentation of a linear transformation. In the case where A is a finite set of n elements we 
have Sa = S„ (cf. Section 1 .6), so by fixing a labelling of the elements of A we may 
consider our permutations as elements of the group S n (which is exactly what we did in 
Examples 2 and 3 above), in the same way that fixing a basis for a vector space allows 
us to view a linear transformation as a matrix. 

We now prove a combinatorial result about group actions which will have important 
consequences when we apply it to specific actions in subsequent sections. 

Proposition 2. Let G be a group acting on the nonempty set A. The relation on A 
defined by 

a ~ b if and only if a = g • b for some g e G 

is an equivalence relation. For each a € A, the number of elements in the equivalence 
class containing a is \G : G a \, the index of the stabilizer of a. 

Proof: We first prove ~ is an equivalence relation. By axiom 2 of an action, a = 1 a 
for all a € A, i.e., a ~ a and the relation is reflexive. If a ~ b, then a = g ■ b for some 
b € G so that 

g~ l a = g~ l (g-b) = (g _l g) b=l b = b 

that is, b ~ a and the relation is symmetric. Finally, if a ~ b and b ~ c, then a = g ■ b 
and b = h - c, for some g, h € G so 

a = g-b = g-(h c)= (gh) ■ c 

hence a ~ c, and the relation is transitive. 

To prove the last statement of the proposition we exhibit a bijection between the 
left cosets of G a in G and the elements of the equivalence class of a. Let C a be the class 
of a , so 

C a = [g ■ a \ g e G). 

Suppose b = g a eC a . Then gG a is a left coset of G a in G. The map 

b = g ■ a i-> gG a 


114 


Chap. 4 Group Actions 



is a map from C a to the set of left cosets of G a in G. This map is suijective since for 
any g G G the element g • a is an element of C a . Since g ■ a = h - a if and only if 
h~ l g e G a if and only if gG a = hG a , the map is also injective, hence is a bijection. 
This completes the proof. 

By Proposition 2 a group G acting on the set A partitions A into disjoint equivalence 
classes under the action of G. These classes are given a name: 

Definition. Let G be a group acting on the nonempty set A. 

(1) The equivalence class {g ■ a \ g e G] is called the orbit of G containing a . 

(2) The action of G on A is called transitive if there is only one orbit, i.e., given 
any two elements a,b e A there is some g g G such that a = g ■ b. 

Examples 

Let G be a group acting on the set A. 

(1) If G acts trivially on A then G a = G for all a e A and the orbits are the elements of 
A. This action is transitive if and only if |A| = 1. 

(2) The symmetric group G — S„ acts transitively in its usual action as permutations on 

A = (1, 2 n). Note that the stabilizer in G of any point / has index n = |A| in S„. 

(3) When the group G acts on the set A, any subgroup of G also acts cm A. If G is 
transitive on A a subgroup of G need not be transitive on A. For example, if G = 
( (1 2), (3 4) ) < S 4 then the orbits of G on {1, 2, 3, 4} are { 1 , 2} and {3, 4} and there 
is no element of G that sends 2 to 3. The discussion below on cycle decompositions 
shows that when ( a ) is any cyclic subgroup of then the orbits of ( o ) consist of 
the sets of numbers that appear in the individual cycles in the cycle decomposition of 
a (for example, the orbits of ( (1 2) (3 4 5) ) are (1, 2} and {3, 4, 5}). 

(4) The group Dg acts transitively on the four vertices of the square and the stabilizer of 
any vertex is the subgroup of order 2 (and index 4) generated by the reflection about 
the line of symmetry passing through that poinL 

(5) The group D% also acts transitively on the set of two pairs of opposite vertices. In this 
action the stabilizer of any point is ( s, r 2 ) (which is of index 2 ). 

Cycle Decompositions 

We now prove that every element of the symmetric group S n has the unique cycle 
decomposition described in Section 1.3. Let A = {1, 2, .... n], let a be an element 
of S n and let G = (o ). Then (a ) acts on A and so, by Proposition 2, it partitions 
{1, 2, . . . , n) into a unique set of (disjoint) orbits. Let O be one of these orbits and let 
x G O. By (the proof of) Proposition 2 applied to A = O we see that there is a bijection 
between the left cosets of G x in G and the elements of O, given explicitly by 

a' x o'G x . 

Since G is a cyclic group, G x < G and G/G x is cyclic of order d, where d is the 
smallest positive integer for which a d G G x (cf. Example 2 following Proposition 7 in 
Section 3. 1). Also, d =\G : G x \ = \0\. Thus the distinct cosets of G x in G are 

1G*. oG x , o 2 G x o d -'G x . 
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This shows that the distinct elements of O are 

a(x ), a 2 (x), .... a d ~ l (x). 

Ordering the elements of O in this m ann er shows that a cycles the elements of O, 
that is, on an orbit of size d, a acts as a d -cycle. This proves the existence of a cycle 
decomposition for each a e S n . 

The orbits of (o ' ) are uniquely determined by o. The only latitude is in which 
order the orbits are listed. Within each orbit, O y we may begin with any element as a 
representative. Choosing a‘ (x) instead of x as the initial representative simply produces 
the elements of O in the order 

cr'(x), a ,+l Qc) <r d ~ l (x), x , <r(x) a' -1 (x), 

which is a cyclic permutation (forward i — 1 terms) of the original list. It follows that 
the cycle decomposition above is unique up to a rearrangement of the cycles and up to 
a cyclic permutation of the integers within each cycle. 

Subgroups of symmetric groups are called permutation groups. For any subgroup 
G of S n the orbits of G will refer to its orbits on { 1 , 2, . . . , n } . The orbits of an element 
a in S„ will mean the orbits of the group ( a ) (namely the sets of integers comprising 
the cycles in its cycle decomposition). 

The exercises below further illustrate how group theoretic information can be ob- 
tained from permutation representations. 

EXERCISES 

Let G be a group and let A be a nonempty set. 

1. Let G act on the set A. Prove that if a, b e A and b = g ■ a for some g e G, then 
Gb = gG a g~ l (G a is the stabilizer of a). Deduce that if G acts transitively on A then the 
kernel of the action is n^G gG a g~ l - 

2. Let G be a permutation group on the set A (i.e., G < Sa), let a e G and let a e A. Prove 
that aG 0 o~ x = G a ( a ). Deduce that if G acts transitively on A then 

P) oG a o~ x = 1. 
creG 

3. Assume that G is an abelian, transitive subgroup of Sa- Show that a (a) ^ a for all 
a € G — {1} and all a € A. Deduce that |G| = j/t|. [Use the preceding exercised 

4. Let S3 act on the set Q of ordered pairs: {(i, j) \ 1 < i, j < 3} by er((i, j)) = (<y(i),o(j))- 
Find the orbits of S3 on Q. For each or e S3 find the cycle decomposition of a under this 
action (i.e., find its cycle decomposition when a is considered as an element of S9 — first 
fix a labelling of these nine ordered pairs). For each orbit O of S 3 acting on these nine 
points pick some a G O and find the stabilizer of a in S3. 

5. For each of parts (a) and (b) repeat the preceding exercise but with S3 acting on the specified 
set: 

(a) the set of 27 triples {(f , j, k) \ 1 < /, j, k < 3} 

(b) the set ^({1, 2, 3}) — {0} of all 7 nonempty subsets of (1,2, 3}. 

6. As in Exercise 1 2 of Section 2.2 let R be the set of all polynomials with integer coefficients 
in the independent variables xu xi, x^,m and let S 4 act on R by permuting the indices of 
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the four variables: 

a - p(X\, X 2 ,X 3, *4) = P(*< j(l),X o (?),X 0 ( 3 )<Xa( 4 )) 

for all a € S4. 

(a) Find the polynomials in the orbit of 54 on R containing x\+X 2 (recall from Exercise 
12 in Section 2.2 that the stabilizer of this polynomial has order 4). 

(b) Find the polynomials in the orbit of S 4 on R containing x\X 2 + *3x4 (recall from 
Exercise 12 in Section 2.2 that the stabilizer of this polynomial has order 8). 

(c) Find the polynomials in the orbit of S4 on R containing Cq + X2KX3 + *4)- 

7. Let G be a transitive permutation group on the finite set A. A block is a nonempty subset 
B of A such that for all a e G either o(B) = B or o(B) fl B = 0 (here or(B) is the set 
iaQ>) | b € B }). 

(a) Prove that if B is a block containing the element a of A, then the set Gg defined by 
Gg = {c e G | er(B) = B) is a subgroup of G containing G a . 

(b) Show that if B is a block and crj (B), 02 (B), . . . , o„ (B) are all the distinct images of 
B under the elements of G, then these form a partition of A. 

(c) A (transitive) group G on a set A is said to be primitive if the only blocks in A 
are the trivial ones: the sets of size 1 and A itself. Show that S 4 is primitive on 
A = {1, 2, 3, 4). Show that D& is not primitive as a permutation group on the four 
vertices of a square. 

(d) Prove that the transitive group G is primitive on A if and only if for each a € A, the 
only subgroups of G cont ainin g G a are G a and G (i.e., G a is a maximal subgroup of 
G, cf. Exercise 16, Section 2.4). [Use part (a).) 

8. A transitive permutation group G on a set A is called doubly transitive if for any (hence 
all) a e A the subgroup G a is transitive on the set A — {a}. 

(a) Prove that S„ is doubly transitive on {1 , 2 n) for all n > 2. 

(b) Prove that a doubly transitive group is primitive. Deduce that Dg is not doubly 
transitive in its action on the 4 vertices of a square. 

9. Assume G acts transitively on the finite set A and let H be a normal subgroup of G. Let 
Oi , O 2 , . . . , O r be the distinct orbits of H on A. 

(a) Prove that G permutes the sets 0 \ , O 2 , ■ ■ ■ , O r in the sense that for each g e G and 
each i e {1 , . . . , r} there is a j such that gO\ = O s , where gO = [g a \ a € O) (i.e., 
in the notation of Exercise 7 the sets O ^, .... O r are blocks). Prove that G is transitive 
on {Oj , . . . , O r ). Deduce that all orbits of H on A have the same cardinality. 

(b) Prove that if a € 0\ then \Oy\ = \H : BnG fl | and prove that r = |G : HG a |. 
[Draw the sublattice describing the Second Isomorphism Theorem for the subgroups 
H and G a of G. Note that H n G a = H a .] 

10. Let H and K be subgroups of the group G. For each x e G define the HK double coset 
of x in G to be the set 

HxK = {hxk \hefi, k € K). 

(a) Prove that HxK is the union of the left cosets x\K, . . . ,x„K where [x\K , . . . , x n K} 
is the orbit containing x K of H acting by left multiplication on the set of left cosets 
of a:. 

(b) Prove that HxK is a union of right cosets of H. 

(c) Show that HxK and HyK are either the same set or are disjoint for all x,y e G. 
Show that the set oi HK double cosets partitions G. 

(d) Prove that \HxK\ = | AT | ■ \H : HnxKx~ l \. 

(e) Prove that \HxK\ = [//| - |AT : Knx~'Hx\. 
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4.2 GROUPS ACTING ON THEMSELVES BY LEFT MULTIPLICATION 
— CAYLEY’S THEOREM 

In this section G is any group and we first consider G acting on itself (Le., A = G) by 
left multiplication : 

g ■ a = ga for all g € G, a € G 

where ga denotes the product of the two group elements g and a in G (if G is written 
additively, the action will be written g - a = g + a and called left translation). We saw 
in Section 1.7 that this satisfies the two axioms of a group action. 

When G is a finite group of order n it is convenient to label the elements of G with 
the integers 1 , 2, . . . , n in order to describe the permutation representation afforded by 

this action. In this way the elements of G are listed as g\, g 2 g„ and for each 

g e G the permutation a g may be described as a permutation of the indices 1,2 n 

as follows: 

cr g (0 = j if and only if ggi = g Jm 

A different labelling of the group elements will give a different description of c g as a 
permutation of {1, 2, . . . , n} (cf. the exercises). 

Example 

Let G = {l,a t b t c) be the Klein 4-group whose group table is written out in Section 
2.5. Label the group elements l, a, b,c with the integers 1,2,3, 4, respectively. Under this 
labelling we compute the permutation o a induced by the action of left multiplication by 
the group element a \ 

a 1 = at = a and so er 0 (l) = 2 
a - a = aa — 1 and so o a (2) = 1 
a - b = ab = c and so cr 0 (3) = 4 and 
a - c = ac = b and so o 0 ( 4) = 3. 

With this labelling of the elements of G we see that o a = (1 2)(3 4). In the permutation 
representation associated to the action of the Klein 4-group on itself by left multiplication 
one similarly computes that 

a o a = (1 2)(3 4) b o h = (1 3) (2 4) c ■-+ <r c = (1 4)(2 3), 

which explicitly gives the permutation representation G — ► S 4 associated to this action 
under this labelling. 

It is easy to see (and we shall prove this shortly in a more general setting) that the 
action of a group on itself by left multiplication is always transitive and faithful, and 
that the stabilizer of any point is the identity subgroup (these facts can be checked by 
inspection for the above example). 

We now consider a generalization of the action of a group by left multiplication on 
the set of its elements. Let H be any subgroup of G and let A be the set of all left cosets 
of H in G. Define an action of G on A by 

g ■ aH = gaH for all g e G, aH e A 

where gaH is the left coset with representative ga. One easily checks that this satisfies 
the two axioms for a group action, i.e., that G does act on the set of left cosets of H 
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by left multiplication. In the special case when H is the identity subgroup of G the 
coset aH is just {a} and if we identify the element a with the set {a}, this action by left 
multiplication on left cosets of the identity subgroup is the same as the action of G on 
itself by left multiplication. 

When H is of finite index m in G it is convenient to label the left cosets of H with the 
integers 1, 2, . . . , m in order to describe the permutation representation afforded by this 
action. In this way the distinct left cosets of H in G are listed as a\H, a^H, ... ,a m H 
and for each g € G the permutation a g may be described as a permutation of the indices 
1 , 2, . . . , m as follows: 

& g (i) = j if and only if ga-,H = ajH. 

A different labelling of the group elements will give a different description of cr g as a 
permutation of { 1 , 2, . . . , m) (cf. the exercises). 

Example 

Let G = Dg and let H = (s ). Label the distinct left cosets 1//, rH, r 2 H> r 2 H with the 
integers 1,2,3,4 respectively. Under this labelling we compute the permutation o s induced 
by the action of left multiplication by the group element s on the left cosets of H: 
s ■ IH = sH = \H and so a 5 (l) = 1 
s ■ rH = srH = r 3 // and so a s (2) = 4 
s • r 2 H = sr 2 H = r 2 H and so o>(3) = 3 
s ■ r 3 H = sr 2 H — rH and so o>(4) = 2. 

With this labelling of the left cosets of H we obtain a s = ( 2 4). In the permutation 
representation associated to the action of D% on the left cosets of ( s ) by left multiplication 
one similarly computes that ay = (1 2 3 4). Note that the permutation representation is a 
homomorphism, so once its value has been determined on generators for D& its value on 
any other element can be determined (e.g., a sr i = ce s c 2 ). 

Theorem 3. Let G be a group, let H be a subgroup of G and let G act by left multi- 
plication on the set A of left cosets of H in G. Let jth be the associated permutation 
representation afforded by this action. Then 

(1) G acts transitively on A 

(2) the stabilizer in G of the point 1 H € A is the subgroup H 

(3) the kernel of the action (i.e., the kernel of tth) is C\ x &g xHx~ l , and ker Tin is 
the largest normal subgroup of G contained in H. 

Proof: To see that G acts transitively on A, let aH and bH be any two elements 
of A, and let g — ba~ l . Then g • aH = ( ba~ l )aH = bH t and so the two arbitrary 
elements aH and bH of A lie in the same orbit, which proves (1). For (2), the stabilizer 
ofthe point IH is, by definition, {g € G \ g-\H = I//}, i.e., {g e G | gH = H\ = H. 
By definition of n H we have 

ker 71 H = {g € G \ gxH = xH for all x e G) 

= {g 6 G I (x~ l gx)H = H for all X e G} 

= {g € G | x~ l gx e H for all x € G} 

= {g € G | g € xHx' 1 for all x € G} = xHx~ [ , 

xeG 
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which proves the first assertion of (3). The second assertion of (3) comes from observing 
first that ker tt# < G and ker tth < H If now N is any normal subgroup of G contained 
in H then we have N = xNx~ l < xHx~ x for all x e G so that 

N < xHx~ l — ker j i H . 
xeC 

This shows that ker is file largest normal subgroup of G contained in H. 

Corollary 4. ( Cayley’s Theorem) Every group is isomorphic to a subgroup of some 
symmetric group. If G is a group of order n, then G is isomorphic to a subgroup of S „ . 

Proof: Let H = 1 and apply the preceding theorem to obtain a homomorphism of 
G into Sc (here we are identifying the cosets of the identity subgroup with the elements 
of G). Since the kernel of this homomorphism is contained in H — 1, G is isomorphic 
to its image in Sc- 

Note that G is isomorphic to a subgroup of a symmetric group, not to the full sym- 
metric group itself. For example, we exhibited an isomorphism of the Klein 4- group 
with the subgroup ( (1 2)(3 4), (1 3)(2 4) ) of S 4 . Recall that subgroups of symmetric 
groups are called permutation groups so Cayley’s Theorem states that every group is 
isomorphic to a permutation group. The permutation representation afforded by left 
multiplication on the elements of G (cosets of H = 1) is called the left regular rep- 
resentation of G. One might think that we could study all groups more effectively by 
simply studying subgroups of symmetric groups (and all finite groups by studying sub- 
groups of S„ , for all n). This approach alone is neither computationally nor theoretically 
practical, since to study groups of order n we would have to work in the much larger 
group S n (cf. Exercise 7, for example). 

Historically, finite groups were first studied not in an axiomatic setting as we have 
developed but as subgroups of S n . Thus Cayley’s Theorem proves that the historical 
notion of a group and the modem (axiomatic) one are equivalent. One advantage of 
the modem approach is that we are not, in our study of a given group, restricted to 
considering that group as a subgroup of some particular symmetric group (so in some 
sense our groups are “coordinate free”). 

The next result generalizes our result on the normality of subgroups of index 2. 

Corollary 5. If G is a finite group of order n and p is the smallest prime dividing | G\, 
then any subgroup of index p is normal. 

Remark: In general, a group of order n need not have a subgroup of index p (for 
example, A4 has no subgroup of index 2). 

Proof: Suppose H < G and | G : H\ = p. Let be the permutation represen- 
tation afforded by multiplication on the set of left cosets of H in G, let K = ker 
and let \H : K\ = k. Then \G : K\ = \G : H\\H : K\ = pk. Since H has p 
left cosets, G/K is isomorphic to a subgroup of S p (namely, the image of G under tt#) 
by the First Isomorphism Theorem. By Lagrange’s Theorem, pk — \G/K\ divides p\. 
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Thus k — = (p — 1)!. But all prime divisors of ( p — 1)1 are less than p and by 
P 

the minimality of p, every prime divisor of k is greater than or equal to p. This forces 
k = l, &o H = K < G t completing the proof. 


EXERCISES 

Let G be a group and let H be a subgroup of G. 

1. Let G = {1, a, b, c\ be the Klein 4-group whose group table is written out in Section 2.5. 

(a) Label 1, a, b, c with the integers 1, 2,4,3. respectively, and prove that under the left 
regular representation of G into £4 the nonidentity elements are mapped as follows: 

a i-» (1 2) (3 4) b i-» (1 4) (2 3) c !-*• (1 3)(2 4). 

(b) Relabel 1 , a, b, c as 1,4,2,3, respectively, and compute the image of each element of 
G under the left regular representation of G into £4. Show that the image of G in £4 
under this labelling is the same subgroup as the image of G in part (a) (even though 
the nonidentity elements individually map to different permutations under the two 
different labellings). 

2 . List the elements of £3 as 1, (1 2), (2 3), (1 3), (1 2 3), (1 3 2) and label these with the 
integers 1,2,3,4,5,6 respectively. Exhibit the image of each element of £3 under the left 
regular representation of £3 into £ 6- 

3. Let r and s be the usual generators for the dihedral group of order 8. 

(a) List the elements of D% as 1, r, r 2 , r 3 , s , sr, sr 2 , sr 3 and label these with the integers 
1 , 2, .... 8 respectively. Exhibit the image of each element of Dg under the left regular 
representation of Dg into £g. 

(b) Relabel this same list of elements of Dg with the integers 1, 3, 5, 7, 2, 4, 6, 8 re- 
spectively and recompute the image of each element of Dg under the left regular 
representation with respect to this new labelling. Show that the two subgroups of Sg 
obtained in parts (a) and (b) are different 

4 . Use the left regular representation of Qg to produce two elements of £g which generate a 
subgroup of Sg isomorphic to the quaternion group Qg. 

5. Let r and s be the usual generators for the dihedral group of order 8 and let H = ( s > . List 
the left cosets of H in Dg as 1 H, r H, r 2 H and r 3 H. 

(a) Label these cosets with the integers 1, 2,3,4, respectively. Exhibit the image of each 
element of Dg under the representation tth of Dg into £4 obtained from the action 
of Dg by left multiplication on the set of 4 left cosets of H in Dg. Deduce that this 
representation is faithful (i.e., theelements of £4 obtained form a subgroup isomorphic 
to Dg). 

(b) Repeat part (a) with the list of cosets relabelled by the integers 1,3 ,2, 4, respectively. 
Show that the permutations obtained from this labelling form a subgroup of £4 that 
is different from the subgroup obtained in part (a). 

(c) Let K = (sr), list the cosets of K in Dg as 1 K, rK, r 2 K and r^K, and label these 
with the integers 1,2, 3, 4. Prove that, with respect to this labelling, the image of Dg 
under the representation tzk obtained from left multiplication on the cosets of A" is 
the same subgroup of £4 as in part (a) (even though the subgroups H and K are 
different and some of the elements of Dg map to different permutations under the two 
homomorphisms). 
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6 . Let r and s be the usual generators for the dihedral group of order 8 and let N = (r 2 ). List 
the left cosets of N in D% as 1 N, rN, sN and srN. Label these cosets with the integers 
1, 2,3,4 respectively. Exhibit the image of each element of £>8 under the representation 
tin of Dg into S 4 obtained from the action of Dg by left multiplication on the set of 4 left 
cosets of N in Dg. Deduce that this representation is not faithful and prove that nuiDg) 
is isomorphic to the Klein 4-group. 


7. Let Qg be the quaternion group of order 8 . 

(a) Prove that Qg is isomorphic to a subgroup of Sg. 

(b) Prove that Qg is not isomorphic to a subgroup of S„ for any n <7. [If Qg acts on 
any set A of order < 7 show that the stabilizer of any point a € A must contain the 
subgroup <—!).] 

8 . Prove that if H has finite index n then there is a normal subgroup K of G with K < H 
and |G : K\ < nl 

9. Prove that if p is a prime and G is a group of order p Q for some a € Z + , then every 
subgroup of index p is normal in G. Deduce that every group of order p 2 has a normal 
subgroup of order p. 


10. Prove that every non-abelian group of order 6 has a nonnormal subgroup of order 2. Use 
this to classify groups of order 6 . [Produce an injective homomorphism into S 3 -] 


11. Let G be a finite group and let tt : G Sc be the left regular representation. Prove that 
if x is an element of G of order n and [G| = mn, then 7 r(jt) is a product of m n -cycles. 

| G | 

Deduce that 71 (x) is an odd permutation if and only if |jr| is even and - is odd. 


12. Let G and n be as in the preceding exercise. Prove that i f 7 r ( G) contains an odd permutation 
then G has a subgroup of index 2. [Use Exercise 3 in Section 3.3.] 


13. Prove that if |G| = 2k where k is odd then G has a subgroup of index 2. [Use Cauchy’s 
Theorem to produce an element of order 2 and then use the preceding two exercises.] 

14. Let G be a finite group of composite order n with the property that G has a subgroup of 
order k for each positive integer k dividing n. Prove that G is not simple. 


4.3 GROUPS ACTING ON THEMSELVES BY CONJUGATION 
—THE CLASS EQUATION 

In this section G is any group and we first consider G acting on itself A = G) by 
conjugation : 

g ■ a = gag~ ] for all g e G, a € G 

where gag~ l is computed in the group G as usual. This definition satisfies the two 
axioms for a group action because 

gi • (£2 ■ a) = 81 * (gzagi 1 ) - 8 \(g 2 ag 2 ')gi' = (g\g 2 )a(g\g 2)' 1 = (g\gi) ■ a 
and 

1 • a = lfll -1 = a 


for all £ 1 , £2 € G and all a € G. 
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Definition. Two elements a and b of G are said to be conjugate in G if there is some 
g e G such that b = gag~ l (i.e., if and only if they are in the same orbit of G acting 
on itself by conjugation). The orbits of G acting on itself by conjugation are called the 
conjugacy classes of G. 

Examples 

(1) If G is an abelian group then the action of G on itself by conjugation is the trivial 
action: g ■ a = a, for all g,a e G, and for each a e G the conjugacy class of a is {a}. 

(2) If |G| > 1 then, unlike the action by left multiplication, G does not act transitively 
on itself by conjugation because (1) is always a conjugacy class (i.e., an orbit for this 
action). More generally, the one element subset {a} is a conjugacy class if and only if 
gag~ l = a for all g e G if and only if a is in the center of G. 

(3) In Sj one can compute directly that the conjugacy classes are {1}, {(1 2), (1 3), (2 3)} 
and {(1 2 3), (1 3 2)}. We shall shortly develop techniques for computing conjugacy 
classes more easily, particularly in symmetric groups. 

As in the case of a group acting on itself by left multiplication, the action by 
conjugation can be generalized If S is any subset of G, define 

gSg' 1 = igsg~ l \s e 5}. 

A group G acts on the set V(G) of all subsets of itself by defining g ■ S = gSg~ l for 
any g e G and S € V(G). As above, this defines a group action of G on V(G). Note 
that if S is the one element set {$} then g ■ S is the one element set {gsg -1 } and so this 
action of G on all subsets of G may be considered as an extension of the action of G 
on itself by conjugation. 

Definition. Two subsets S and T of G are said to be conjugate in G if there is some 
g e G such that T = gSg _i (Le., if and only if they are in the same orbit of G acting 
on its subsets by conjugation). 

We now apply Proposition 2 to the action of G by conjugation. Proposition 2 proves 
that if S is a subset of G, then the number of conjugates of S equals the index |G : G.s| 
of the stabilizer Gs of S. For action by conjugation 

Gs = [g e G I gSg~ l = S} = N C (S ) 
is the normalizer of S in G. We summarize this as 

Proposition 6. The number of conjugates of a subset S in a group G is the index of the 
normalizer of S, |G : N G (S)\. In particular, the number of conjugates of an element s 
of G is the index of the centralizer of s, |G : Cg(s)|- 

Proof: The second assertion of the proposition follows from the observation that 
Afc({*}) = C G (s). 

The action of G on itself by conjugation partitions G into the conjugacy classes 
of G, whose orders can be computed by Proposition 6. Since the sum of the orders of 
these conjugacy classes is the order of G, we obtain the following important relation 
among these orders. 
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Theorem 1. (The Class Equation) Let G be a finite group and let g\ t g ^ t ..., g r be 
representatives of the distinct conjugacy classes of G not contained in the center Z(G) 
of G. Then 

r 

|G| = |Z(G)| + £|G : C 0 (g,)|. 

i = I 


Proof: As noted in Example 2 above the element {*} is a conjugacy class of size 1 if 
and only if x € Z(G), since then gxg~ Y = x for all g e G. Let Z(G) = {1, Z 2 , — , z™}, 
let JCi, IGi, ... , 1C,, be the conjugacy classes of G not contained in the center, and let g- t 
be a representative of /C/ for each i. Then the full set of conjugacy classes of G is given 
by 

Uh {Z2)i -•-> {Zn |}» U ^2. - for- 

Since these partition G we have 

i g i = Ei + X>'I 

J — 1 1=1 

r 

= |Z(G)|+£|G : C c (gi)l, 

1 = 1 

where \K\ \ is given by Proposition 6. This proves the class equation. 

Note in particular that all the s umman ds on the right hand side of the class equation 
are divisors of the group order since they are indices of subgroups of G. This restricts 
their possible values (cf. Exercise 6, for example). 

Examples 

(1) The class equation gives no information in an abelian group since conjugation is the 
trivial action and all conjugacy classes have size 1. 

(2) In any group G we have { g ) < C(j(g)\ this observation helps to minimize com- 
putations of conjugacy classes. For example, in the quaternion group Qs we see 
that (i) < Cq 8 {i ) < Qg. Since i £ Z(Qg) and | Qg : (i}| = 2, we must have 
Cq 8 (i) = ( i ). Thus i has precisely 2 conjugates in Qs, namely i and — i = kik 1 . 
The other conjugacy classes in Qs are determined similarly and are 

m, {-i}, {±0, i±j). {±*j. 

The first two classes form Z(Qg) and the class equation for this group is 

\Qs\ = 2 + 2 + 2 + 2. 

(3) In Dg we may also use the fact that the three subgroups of index 2 are abelian to 
quickly see that if jc ^ Z(Dg), then |C£> 8 (x)| = 4. The conjugacy classes of Dg are 

{1}, [r 1 }, {r, r 3 }, {.s, sr 2 ), {jr, ^r 3 }. 

The first two classes form Z(Dg) and the class equation for this group is 

|Dg| = 2 + 2 + 2 + 2. 
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Before discussing more examples of conjugacy we give two important conse- 
quences of the class equation. The first application of the class equation is to show 
that groups of prime power order have nontrivial centers, which is the starting point for 
the study of groups of prime power order (to which we return in Chapter 6). 

Theorem 8. If p is a prime and P is a group of prime power order p a for some a > 1, 
then P has a nontrivial center: Z(P) ^ 1. 

Proof: By the class equation 

r 

|/>| = |Z(/>)| + £|P : Cp(gt)l 

i = l 

where gi, ... ,g r are representatives of the distinct non-central conjugacy classes. By 
definition, Cp(g t ) ± P for i = 1, 2, .... r so p divides | P : C7>(g,-)|. Since p also 
divides \P\ it follows that p divides |Z(P)|, hence the center must be nontrivial. 

Corollary 9. If |P| = p 2 for some prime p, then P is abelian. More precisely, P is 
isomorphic to either Z p i or Z p x Z p . 

Proof: Since Z(P) f 1 by the theorem, it follows that P/Z(P) is cyclic. By 
Exercise 36, Section 3.1, P is abelian. If P has an element of order p 2 , then P is 
cyclic. Assume therefore that every nonidentity element of P has order p. Let x be 
any nonidentity element of P and let y e P — ( x ). Since | ( x, y ) | > | ( x ) | — p, we 
must have that P = ( x, y ). Both x and y have order p so (x) x (y) = Z p x Z p . It 
now follows directly that the map (x a , y b ) i-> x a y b is an isomorphism from ( x ) x ( y ) 
onto P. This completes the proof. 

Conjugacy in S„ 

We next consider conjugation in symmetric groups. Readers familiar with linear algebra 
will recognize that in the matrix group GL„ ( F ), conjugation is the same as “change of 
basis”: A i-> PAP -1 . The situation in S„ is analogous: 

Proposition 10. Let o', r be elements of the symmetric group S„ and suppose a has 
cycle decomposition 

(< 2 i a 2 . . ■ a kl ) {by b 2 . . - b k7 ) . . . . 

Then tot -1 has cycle decomposition 

(r(ai) x(a 2 ) ... r(a kl )) (tC^) x{b 2 ) ... r(h^))..., 

that is, tot -1 is obtained from a by replacing each entry i in the cycle decomposition 
for a by the entry t(z). 

Proof: Observe that iSo(i) = j, then 

to-t _ 1 (t(z)) = t(J). 

Thus, if the ordered pair i, j appears in the cycle decomposition of o', then the ordered 
pair t(i), rO) appears in the cycle decomposition of tot -1 . This completes the proof. 
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Example 

Let a = (1 2)(345)(6789) and letr = (1 357)(2468). Then 

= (34)(567)(8 1 29). 


Definition. 

(1) If a e S, i is the product of disjoint cycles of lengths nj, « 2 , ■ ■ ■ . n r with 
«i < ri 2 < ■ ■ ■ <n r (including its 1 -cycles) then the integers n\ , n^, - - - , n r are 
called the cyc/e type of a. 

(2) If n e Z + , a partition of « is any nondecreasing sequence of positive integers 
whose sum is n. 

Note that by the results of the preceding section the cycle type of a permutation is 
unique. For example, the cycle type of an m -cycle in S n is 1 , 1 , . . . , 1 , m, where the m 
is preceded by n — m ones. 

Proposition 11. Two elements of S„ are conjugate in S„ if and only if they have the 
same cycle type. The number of conjugacy classes of S„ equals the number of partitions 
of n. 

Proof: By Proposition 10, conjugate permutations have the same cycle type. Con- 
versely, suppose the permutations o\ and 02 have the same cycle type. Order the cycles 
in nondecreasing length, including 1 -cycles (if several cycles of c\ and 02 have the 
same length then there are several ways of doing this). Ignoring parentheses, each 
cycle decomposition is a list in which all the integers from 1 to n appear exactly once. 
Define r to be the function which maps the / th integer in the list for o\ to the / ,h integer 
in the list for 0 %. Thus r is a permutation and since the parentheses which delineate the 
cycle decompositions appear at the same positions in each list, Proposition 10 ensures 
that Toir -1 = 02. so that <tj and are conjugate. 

Since there is a bijection between the conjugacy classes of S n and the permissible 
cycle types and each cycle type for a permutation in S n is a partition of n, the second 
assertion of the proposition follows, completing the proof. 

Examples 

(1) Let o\ = (1)(3 5) (8 9)(2 4 7 6) and let 02 = (3)(4 7)(8 1)(5 2 6 9). Then define r by 
t( 1) = 3, r(3) = 4, r(5) = 7, r(8) = 8, etc. Then 

r = (1342576 9)(8) 

and ref) r _1 = < 72 - 

(2) If in the previous example we had reordered 02 as 02 = (3)(8 1)(4 7)(5 2 6 9) by 
interchanging the two cycles of length 2, then the corresponding r described above is 
defined by r(l) = 3, r(3) = 8, r (5) = 1, r(8) = 4, etc., which gives the permutation 

r = (1 3 8 4 2 5)(6 9 7) 

again with raj r _1 = 02 , which shows that there are many elements conjugating <ri 
into 02 - 
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(3) If n = 5, the partitions of 5 and corresponding representatives of the conjugacy classes 
(with 1 -cycles not written) are as given in the following table: 


Partition of 5 

Representative of Conjugacy Class 

1,1, 1,1,1 

1 

1,1, 1,2 

(12) 

1,1,3 

(12 3) 

1,4 

(1 234) 

5 

(12 3 4 5) 

1,2,2 

(12)(3 4) 

2,3 

(1 2)(3 4 5) 


Proposition 1 1 and Proposition 6 can be used to exhibit the centralizers of some 
elements in S n . For example, if a is an m -cycle in S„, then the number of conjugates 
of a (i.e., the number of m -cycles) is 

n ■ (n — 1) • ■ • (n — m + 1) 
m 

|£nl 

By Proposition 6 this is the index of the centralizer of o: — . Since \S n \ = n\ 

|C^(cr)| 

we obtain 

JQ,(o)| = m ' (n-m)\. 

The element a certainly commutes with 1, cr, cr 2 , .... cr m_1 . It also commutes with any 
permutation in S„ whose cycles are disjoint from a and there are (n — m) \ permutations 
of this type (the full symmetric group on the numbers not appearing in cr). The product 
of elements of these two types already accounts for m ■ (n — m)\ elements commuting 
with a . By the order computation above, this is the full centralizer of a in S n . Explicitly, 

if a is an/w-cycle in S „ , then Cs„(cr) = {rr'r | 0 < / < m — 1, r € 

where S n -„, denotes the subgroup of S n which fixes all integers appearing in the m-cycle 
a (and is the identity subgroup if m = n or m — n — 1). 

For example, the centralizer of cr = (1 3 5) in 57 is the subgroup 

{(1 3 5) 2 r I i = 0, 1 or 2, and r fixes 1, 3 and 5}. 

Note that t € Sa where A = [2, 4, 6, 7}, so there are 4! choices for r and the centralizer 
has order 3 • 4! = 72. 

We shall discuss centralizers of other elements of S„ in the next exercises and in 
Chapter 5. 

We can use this discussion of the conjugacy classes in S„ to give a combinatorial 
proof of the simplicity of As. We first observe that normal subgroups of a group G are 
the union of conjugacy classes of G, i.e., 

if H <) G, then for every conjugacy class /C of G either K,QHorKC\H=Q. 

This is because if x e K. fl H t then gxg~* € gHg~ x for all g € G. Since H is normal, 
gHg~ l = H, so that H contains all the conjugates of x, i.e., K c H. 
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Theorem 12. A 5 is a simple group. 

Proof: We first work out the conjugacy classes of A5 and their orders. Proposition 
1 1 does not apply directly since two elements of the same cycle type (which are conjugate 
in 55) need not be conjugate in A5. Exercises 19 to 22 analyze the relation of classes 
in S’* to classes in A„ in detail. 

We have already seen that representatives of the cycle types of even permutations 
can be taken to be 


l, (1 2 3), (1 2 3 45) and (1 2)(3 4). 

The centralizers of 3-cycles and 5-cycles in S$ were determined above, and checking 
which of these elements are contained in A5 we see that 

C*((12 3))=((1 23)) and C A ,((1 23 4 5)) = <(1 2345)). 

These groups have orders 3 and 5 (index 20 and 1 2), respectively, so there are 20 distinct 
conjugates of (1 2 3) and 12 distinct conjugates of(12345)inA5. Since there are a 
total of twenty 3 -cycles in Ss (Exercise 16, Section 1 ,3) and all of these lie in A5, we 
see that 

all twenty 3-cycles are conjugate in A 5 . 

There are a total of twenty-four 5-cycles in As but only 12 distinct conjugates of the 
5-cycle (12 3 45). Thus some 5-cycle, cr, is not conjugate to ( 1 2 3 4 5) in A5 (in fact, 
( 1 3 5 2 4) is not conjugate in As to ( 1 2 3 4 5) since the method of proof in Proposition 
1 1 shows that any element of Ss conjugating (1 23 4 5) into (13524) must be an odd 
permutation). As above we see that cr also has 12 distinct conjugates in As, hence 

the 5-cycles lie in two conjugacy classes in A 5 , each of which has 12 elements. 

Since the 3-cycles and 5-cycles account for all the nonidentity elements of odd order, 
the 15 remaining nonidentity elements of As must have order 2 and therefore have 
cycle type (2,2). It is easy to see that (1 2) (3 4) commutes with (1 3) (2 4) but does not 
commute with any element of odd order in A5. It follows that |C^ 5 ((12)(34))| = 4. 
Thus (1 2) (3 4) has 15 distinct conjugates in As, hence 

all 15 elements of order 2 in As are conjugate to (1 2)(3 4). 

In summary, the conjugacy classes of As have orders 1, 15, 20, 12 and 12. 

Now, suppose H were a normal subgroup of A5. Then as we observed above, H 
would be the union of conjugacy classes of As- Then the order of H would be both 
a divisor of 60 (the order of A5) and be the sum of some collection of the integers 
{1, 12, 12, 15, 20} (the sizes of the conjugacy classes in A 5 ). A quick check shows the 
only possibilities are \ H\ = l or |//| = 60, so that As has no proper, nontrivial normal 
subgroups. 

Right Group Actions 

As noted in Section 1 .7, in the definition of an action the group elements appear to the 
left of the set elements and so our notion of an action might more precisely be termed a 
left group action . One can analogously define the notion of a right group action of the 
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group G on the nonempty set A as a map from A x G to A, denoted by a ■ g for a € A 
and g € G, that satisfies the axioms: 

(1) (a - gi) ■ g 2 — a • (g]g 2 ) for all a e A, and gi, g 2 e G, and 

(2) a - 1 = a for all a e A . 

In much of the literature on group theory, conjugation is written as a right group 
action using the following notation: 

a 8 = g-'ag for all g,aeG. 

Similarly, for subsets S of G one defines S s = g~ l Sg. In this notation the two axioms 
for a right action are verified as follows: 

(a 8 ') 81 = = (g\g2)~ i a(gig 2 ) = a**'** 

and 

n 1 = \~ x a\ = a 

for all g lt g 2 , a e G. Thus the two axioms for this right action of a group on itself take 
the form of the familiar “laws of exponentiation.” (Note that the integer power a n of 
a group element a is easily distinguished from the conjugate a 8 of a by the nature of 
the exponent: n e % but g e G.) Because conjugation is so ubiquitous in the theory of 
groups, this notation is a useful and efficient shorthand (as opposed to always writing 
gag~ x or g - a for action on the left by conjugation). 

For arbitrary group actions it is an easy exercise to check that if we are given a left 
group action of G on A then the map A x G ->■ A defined by a - g = g~ x - a is a right 
group action. Conversely, given a right group action of G on A we can form a left group 
action by g ■ a = a ■ g -1 . Call these pairs corresponding group actions. Put another 
way, for corresponding group actions, g acts on the left in the same way that g -1 acts on 
the right. This is particularly transparent for the action of conjugation because the “left 
conjugate of a by g” namely gag~\ is the same group element as the “right conjugate 
of a by g -1 ,” namely a 8 ~ . Thus two elements or subsets of a group are “left conjugate” 
if and only if they are “right conjugate,” and so the relation “conjugacy” is the same for 
the left and right corresponding actions. More generally, it is also an exercise (Exercise 
1) to see that for any corresponding left and right actions the orbits are the same. 

We have consistently used left actions since they are compatible with the notation of 
applying functions on the left (i.e., with the notation <p (g)); in this way left multiplication 
on the left cosets of a subgroup is a left action. Similarly, right multiplication on the 
right cosets of a subgroup is a right action and the associated permutation representation 
(p is a homomorphism provided the function <p : G — > Sa is written on the right as 
(gig 2 )<P (and also provided permutations in are written on the right as functions 
from A to itself). There are instances where a set admits two actions by a group G : one 
naturally on the left and the other on the right, so that it is useful to be comfortable with 
both types of actions. 
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EXERCISES 


Let G be a group. 

1. Suppose G has a left action on a set A, denoted by g ■ a for all g e G and a e A. Denote 
the corresponding right action on A by a ■ g. Prove that the (equivalence) relations ~ and 
~ / defined by 


a ~ b if and only if a = gb for some g e G 

and 

a ~ b if and only if a = b- g for some g e G 

are the same relation (i.e., a ~ b if and only if a b). 

2. Find all conjugacy classes and their sizes in the following groups: 

(a) 2)8 (b)Gs (c)A4. 

3. Find all the conjugacy classes and their sizes in the following groups: 

(a) Z 2 x S3 (b) S3 x S3 (c) Z3 x A4. 

4. Prove that if S c G and g e G then gNc(S)g~ i = No(gSg~ l ) and gC c (S)g~ l = 
C G (gSg- 1 ). 

5. If the center of G is of index n, prove that every conjugacy class has at most n elements. 

6. Assume G is a non-abelian group of order 15. Prove that Z(G) = 1. Use the fact that 
( g ) < Cc (g) for all g e G to show that there is at most one possible class equation for 
G. [Use Exercise 36, Section 3.1.] 

7. For n = 3, 4, 6 and 7 make lists of the partitions of n and give representatives for the 
corresponding conjugacy classes of S„. 

8. Prove that Z(S„) = 1 for all n > 3. 

9. Show that |Cs n ((1 2)(34))| = 8 ■ (n — 4)! for all n >4. Determine the elements in this 
centralizer explicitly. 

10. Let <7 be the 5-cycle (1 2 3 4 5) in S5. In each of (a) to (c) find an explicit element r e S5 
which accomplishes the specified conjugation: 

(a) tot -1 = a 2 

(b) r<7T -1 = o' -1 

(c) rcrr -1 = cr -2 . 

11. In each of (a) - (d) determine whether a\ and 02 are conjugate. If they are, give an explicit 
permutation r £uch that ra\t ~ 1 = <72. 

(a) o\ = (1 2) (3 4 5) and 02 = (1 2 3)(4 5) 

(b) a\ = (1 5)(3 7 2)(10 6 8 1 1) and cr 2 = (3 7 5 10)(4 9)(13 11 2) 

(c) o\ = (1 5)(3 7 2) (10 6 8 11) and 02 = a\ 

(d) o\ = (1 3) (2 4 6) and a 2 = (3 5)(2 4)(5 6). 

12. Find a representative for each conjugacy class of elements of order 4 in and in Sj 2 . 

13. Fmd all finite groups which have exactly two conjugacy classes. 

14. In Exercise 1 of Section 2 two labellings of the elements { 1 , a, b, c] of the Klein 4-group 
V were chosen to give two versions of the left regular representation of V into S4. Let 
ttj be the version of regular representation obtained in part (a) of that exercise and let 
7T2 be the version obtained via the labelling in part (b). Let r = (2 4). Show that 
r o tt\ (g) o r -1 = 7r 2 (g) for each g € V (i.e., conjugation by r sends the image of tt\ to 
the image of 7r 2 elementwise). 
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15 . 

16. 


17 . 


18 . 


19 . 


20 . 


21 . 


22 . 

23 . 


24. 


25. 


Find an element of 5s which conjugates the subgroup of 5s obtained in part (a) of Exercise 
3, Section 2 to the subgroup of S% obtained in part (b) of that same exercise (both of these 
subgroups are isomorphic to Dg). 

Find an element of S 4 which conjugates the subgroup of S 4 obtained in part (a) of Exercise 
5, Section 2 to the subgroup of S 4 obtained in part (b) of that same exercise (both of these 
subgroups are isomorphic to Ds). 

Let A be a nonempty set and let X be any subset of Sa- Let 

F(X) = [a e A | a(a) = a for all a e X) — the fixed set of X. 


Let M(X) = A — F(X) be the elements which are moved by some element of X . Let 
D = (a e Sa I |Af(o)| < 00 }. Prove that D is a normal subgroup of Sa- 

Let A be a set, let H be a subgroup of Sa and let F(H) be the fixed points of H on A as 
defined in the preceding exercise. Prove that if r e Ns A (H) then x stabilizes the set F(H) 
and its complement A — F(H). 

Assume H is a normal subgroup of G, AC is a conjugacy class of G contained in H 
and x e K. Prove that /C is a union of k conjugacy classes of equal size in H, where 
k = \G : HCc(x) |. Deduce that a conjugacy class in S„ which consists of even permuta- 
tions is either a single conjugacy class under the action of A n or is a union of two classes 
of the same size in A„. [Let A = Cc(x) and B = H so AH B = Ch(x). Draw the lat- 
tice dia gr am associated to the Second Isomorphism Theorem and interpret the appropriate 
indices. See also Exercise 9, Section 1 .] 

Let <r e A„. Show that all elements in the conjugacy class of a in S„ (i.e., all elements 
of the same cycle type as <r) are conjugate in A n if and only if o commutes with an odd 
permutation. [Use the preceding exercise.] 

Let /C be a conjugacy class in S n and assume that fC c A n . Show a e S» does not 
commute with any odd permutation if and only if the cycle type of a consists of distinct 
odd integers. Deduce that 1C consists of two conjugacy classes in A n if and only if the cycle 
type of an element of AC consists of distinct odd integers. [Assume first that o € AC does 
not commute with any odd permutation. Observe that a commutes with each individual 
cycle in its cycle decomposition — use this to show that all its cycles must be of odd 
length. If two cycles have the same odd length, k, find a product of k transpositions which 
interchanges them and commutes with 0 . Conversely, if the cycle type of o consists of 
distinct integers, prove that a commutes only with the group generated by the cycles in its 
cycle decomposition.] 

Show that if b is odd then the set of all n -cycles consists of two conjugacy classes of equal 
size in A n . 


Recall (cf. Exercise 16, Section 2.4) that a proper subgroup M of G is called maximal if 
whenever M < H < G, either H = M or H — G. Prove that if M is a maximal subgroup 
of G then either Nc (M) = M ox He (Af) = G. Deduce that if M is a maximal subgroup of 
G that is not normal in G then the number of nonidentity elements of G that are contained 
in conjugates of M is at most (| Af| — 1)|G : M\. 

Assume H is a proper subgroup of the finite group G. Prove G ^ U s€ cg^g~ l ^ i e., G is 
not the union of the conjugates of any proper subgroup. [Put H in some maximal subgroup 
and use the preceding exercise.] 


Let G = G 2 - 2 (C) and let H = 



| a, b, c e C, ac ^ 0). Prove that every element 


of G is conjugate to some element of the subgroup H and deduce that G is the union of 
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conjugates of H. [Show that every element of GZ, 2 (Q has an eigenvector] 

26. Let G be a transitive permutation group on the finite set A with | v4 1 > 1. Show that there 
is some a e G such that cr(a) ± a for all a e A (such an element a is called fixed point 
free). 

21. Let gi , gi , ... , g T be representatives of the conjugacy classes of the finite group G and 
assume these elements pairwise commute. Prove that G is abelian. 

28. Let p and q be primes with p < q. Prove that a non-abelian group G of order pq has a 
nonnormal subgroup of index q, so that there exists an injective homomorphism into S q . 
Deduce that G is isomorphic to a subgroup of the normalizer in S q of the cyclic group 
generated by the <?-cycle (12... q). 

29. Let p be a prime and let G be a group of order p a . Prove that G has a subgroup of order 

p& , for every with 0 < < a. [Use Theorem 8 and induction on a.] 

30. If G is a group of odd order, prove for any nonidentity element x e G that x and x -1 are 
not conjugate in G. 

31. Using the usual generators and relations for the dihedral group Dm (cf. Section 1.2) show 
that for n = 2k an even integer the conjugacy classes in D^n are the following: (1), {/•*}, 

{r ±1 }, (r* 2 ), .... (T-iC*- 1 )}, [ sr 2b \ b = \,...,k) and [sr u ~ l \ b = 1, A). Give 

the class equation for Dm ■ 

32. For n = 2k + 1 an odd integer show that the conjugacy classes in Dm are (1), [r ±l \, 

[r* 2 ] {r ±fe ], {sr^ | b = 1. . . . , n). Give the class equation for Dm- 

33. This exercise gives a formula for the size of each conjugacy class in S„. Let cr be a 
permutation in S„ and let mi , m 2 , . ... m* be the distinct integers which appear in the 

cycle type of 0 (including 1 -cycles). For each i € {1,2 s) assume a has kj cycles of 

length m; (so that E?_j kjtm = n). Prove that the number of conjugates of 0 is 

n! 

(*l !m )(A'2 !">2 ) - - (ks !m* T ) 

[See Exercises 6 and 7 in Section 1 .3 where this formula was given in some special cases.] 

34. Prove that if p is a prime and P is a subgroup of S p of order p, then | Ns p (P) \ = p(p — 1 ). 
[Argue that every conjugate of P contains exactly p — 1 p-cycles and use the formula for 
the number of p -cycles to compute the index of N$ p (P) in S^.J 

35. Let p be a prime. Find a formula for the number of conjugacy classes of elements of order 
p in S n (using the greatest integer function). 

36. Let 7T : G Sq be the left regular representation afforded by the action of G on itself by 
left multiplication. For each g € G denote the permutation n(g) by cr^, sothator^(x) = gx 
for all x e G. Let k : G -*■ Sc be the permutation representation afforded by the 
corresponding right action of G on itself, and for each h e G denote the permutation k(h) 
by x h- Thus r h(x) = xh~ l for all x e G (X is called the right regular representation of 
G). 

(a) Prove that a g and z h commute for all g,h e G. (Thus the centralizer in Sg of rr(G) 
contains the subgroup 1(G), which is isomorphic to G). 

(b) Prove that o g = x g if and only if g is an element of order 1 or 2 in the center of G. 

(c) Prove that o g = x/, if and only if g and h lie in the center of G. Deduce that 
7 r(G) 0 1(G) = n(Z(G)) = 1(Z(G)). 
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4.4 AUTOMORPHISMS 


Definition. Let G be a group. An isomorphism from G onto itself is called an 
automorphism of G. The set of all automorphisms of G is denoted by Aut(G). 

We leave as an exercise the simple verification that Aut(G) is a group under compo- 
sition of automorphisms, the automorphism group of G (composition of automorphisms 
is defined since the domain and range of each automorphism is the same). Notice that 
automorphisms of a group G are, in particular, permutations of the set G so Aut(G) is 
a subgroup of So- 

One of the most important examples of an automorphism of a group G is provided 
by conjugation by a fixed element in G. The next result discusses this in a slightly more 
general context 

Proposition 13. Let H be a normal subgroup of the group G. Then G acts by con- 
jugation on H as automorphisms of H. More specifically, the action of G on H by 
conjugation is defined for each g e G by 

h j-+ ghg~' for each h e H. 

For each geG, conjugation by g is an automorphism of H. The permutation rep- 
resentation afforded by this action is a homomorphism of G into Aut(H) with kernel 
C c (H)‘ In particular, G/Cg(H) is isomorphic to a subgroup of Aut(//). 

Proof: (cf. Exercise 17, Section 1.7) Let (p g be conjugation by g. Note that because 
g normalizes H, <p g maps H to itself. Since we have already seen that conjugation 
defines an action, it follows that <pi = 1 (the identity map on H ) and (p a ° <Pb = <Pab 
for all a, b e G. Thus each <p g gives a bijection from H to itself since it has a 2-sided 
inverse <p g - 1 . Each <p g is a homomorphism from H to H because 

<p g (hk) = g(hk)g~ l = gh(gg~ l )kg~ l = (ghg-'Xgkg” 1 ) = <p g (h)<p g (k) 

for all h, k e H. This proves that conjugation by any fixed element of G defines an 
automorphism of H. 

By the preceding remark, the permutation representation : G Sh defined by 
xfrig) — <p g (which we have already proved is a homomorphism) has image contained 
in the subgroup Aut (H) of 5//. Finally, 

ker ^ = {£ e G | <p g = id) 

= {g G G | ghg~ { = h for all h e H } 

= C g (H). 

The First Isomorphism Theorem implies the final statement of the proposition. 

Proposition 13 shows that a group acts by conjugation on a normal subgroup as 
structure preserving permutations, i.e., as automorphisms. In particular, this action 
must send subgroups to subgroups, elements of order n to elements of order n, etc. Two 
specific applications of this proposition are described in the next two corollaries. 
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Corollary 14. If AT is any subgroup of the group G and g e G, then K = gKg 1 . 
Conjugate elements and conjugate subgroups have the same order. 

Proof: Letting G = H in the proposition shows that conjugation by g e G is an 
automorphism of G, from which the corollary follows. 

Corollary 15. For any subgroup H of a group G, the quotient group Ng(H)/Cg(H ) 
is isomorphic to a subgroup of Aut (//). In particular, G/Z(G) is isomorphic to a 
subgroup of Aut(G). 

Proof: Since H is a normal subgroup of the group N G (H), Proposition 13 (applied 
with A ! g(H) playing the role of G) implies the first assertion. The second assertion is 
the special case when H = G y in which case Nc(G) = G and Cg(G ) = Z(G). 

Definition. Let G be a group and let g e G. Conjugation by g is called an inner 
automorphism of G and the subgroup of Aut(G) consisting of all inner automorphisms 
is denoted by Inn(G). 

Note that the collection of inner automorphisms of G is in fact a subgroup of Aut(G) 
and that by Corollary 15, lnn(G) = G/Z(G). Note also that if H is a normal subgroup 
of G, conjugation by an element of G when restricted to H is an automorphism of H 
but need not be an inner automorphism of H (as we shall see). 

Examples 

(1) A group G is abelian if and only if every inner automorphism is trivial. If H is an 
abelian normal subgroup of G and H is not contained in 2(G), then there is some 
g € G such that conjugation by g restricted to H is not an inner automorphism of 
H. An explicit example of this is G = A4, H is the Klein 4-group in G and g is any 
3-cycle. 

(2) Since Z(08> = ( — 1 ) we have Inn(Gs) = V4- 

(3) Since Z(£)b) = (r 2 ) we have Inn(£>8) = V4. 

( 4 ) Since for all n > 3, Z(S „ ) = 1 we have Inn(S^) = 5 „ . 

Corollary 15 shows that any information we have about the automorphism group 
of a subgroup H of a group G translates iDto information about Ng(H)/Cg(H). For 
example, if hi = Z2, then since H has unique elements of orders 1 and 2, Corollary 14 
forces Aut (//) = 1 . Thus if II = Z 2 , Ng(H) = Cq(H)\ if in addition H is a normal 
subgroup of G, then H < Z(G) (cf. Exercise 10, Section 2.2). 

Although the preceding example was fairly trivial, it illustrates that the action of 
G by conjugation on a normal subgroup H can be restricted by knowledge of the 
automorphism group of H . This in turn can be used to investigate the structure of G 
and will lead to some classification theorems when we consider semidirect products in 
Section 5 5. 

A notion which will be used in later sections most naturally warrants introduction 
here: 
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Definition. A subgroup H of a group G is called characteristic in G , denoted H char G , 
if every automorphism of G maps H to itself, i.e., a(H) = H for all o € Aut(G). 

Results concerning characteristic subgroups which we shall use later (and whose 
proofs are relegated to the exercises) are 

(1) characteristic subgroups are normal, 

( 2 ) if H is the unique subgroup of G of a given order, then H is characteristic in G, 
and 

(3) if K char H and H < G, then K < G (so although “normality” is not a transitive 
property (Le., a normal subgroup of a normal subgroup need not be normal), a 
characteristic subgroup of a normal subgroup is normal). 

Thus we may think of characteristic subgroups as “strongly normal” subgroups. For 
example, property (2) and Theorem 2.7 imply that every subgroup of a cyclic group is 
characteristic. 

We close this section with some results on automorphism groups of specific groups. 

Proposition 16. The automorphism group of the cyclic group of order n is isomorphic 
to (Z/nZ) x , an abelian group of order <p(n) (where <p is Euler’s function). 

Proof: Let x be a generator of the cyclic group Z„. If V' € Aut(2 n ), then f(x) = x a 
for some a e 1L and the integer a uniquely determines f. Denote this automorphism 
by \j/ a . As usual, since |x| = rt, the integer a is only defined mod n. Since \J/ a is an 
automorphism, x and x a must have the same order, hence (a, n) = 1 . Furthermore, for 
every a relatively prime to n, the map x i-»- x a is an automorphism of Z n . Hence we 
have a surjective map 

: Aut(Z„) -v (Z/nZ) x 
y\f a h* a (mod n). 

The map vp is a homomorphism because 

fa °fb(x)= fa(* b ) = (* b ) a = x ° b = fab(x) 

for all f a ,fb€ Aut (Z„), so that 

^(fa ° fb) = 'i’ifab) = ab (mod n ) = } H(fa)'V(fb)- 

Finally, »P is clearly injective, hence is an isomorphism. 

A complete description of the isomorphism type of Aut(Z„) is given at the end of 
Section 9.5. 

Example 

Assume G is a group of order pq, where p and q are primes (not necessarily distinct) with 
p < q. If p \q — 1, we prove G is abelian. 

If Z(G ) ^ 1, Lagrange’s Theorem forces G/Z(G) to be cyclic, hence G is abelian by 
Exercise 36, Section 3.1. Hence we may assume Z(G) = 1. 
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If every nonidentity element of G has order p, then the centralizer of every nonidentity 
element has index q, so the class equation for G reads 

pq = 1 +kq. 

This is impossible since q divides pq and kq but not 1 . Thus G contains an element, x, of 
order q. 

Let H = (x). Since H has index p and p is the smallest prime dividing |G1, the 
subgroup H is normal in G by Corollary 5. Since Z(G) = 1, we must have Cq (H) = H. 
Thus G/H = Ng(H)/Cg(H) is a group of order p isomorphic to a subgroup of Aut(//> 
by Corollary 15. But by Proposition 16, Aut (H) has order <p(q) = q — I, which by 
Lagrange’s Theorem would imply p \ q — 1 , contrary to assumption. This shows that G 
must be abelian. 

One can check that every group of order pq , where p and q are distinct primes 
with p < q and p \ q — 1 is cyclic (see the exercises). This is the first instance where 
there is a unique isomorphism type of group whose order is composite. For instance, 
every group of order 15 is cyclic. 

The next proposition summarizes some results on automorphism groups of known 
groups and will be proved later. Part 3 of this proposition illustrates how the theory of 
vector spaces comes into play in group theory. 

Proposition 17. 

(1) If p is an odd prime and n eZ + , then the automorphism group of the cyclic 
group of order p is cyclic of order p — 1. More generally, the automorphism 
group of the cyclic group of order p n is cyclic of order p n ~ l (p— 1) (cf. Corollary 
20, Section 9.5). 

(2) For all n > 3 the automorphism group of the cyclic group of order 2” is iso- 
morphic to Z 2 x Z 2 *- 2 , and in particular is not cyclic but has a cyclic subgroup 
of index 2 (cf. Corollary 20, Section 9.5). 

(3) Let p be a prime and let V be an abelian group (written additively) with the 
property that pv = 0 for all v € V. If f V | = p n , then V is an n -dimensional 
vector space over the field = Z/pZ. The automorphisms of V are precisely 
the nonsingular linear transformations from V to itself, that is 

Aut(V) = GL(V) = GL„(F P ). 

In particular, the order of Aut(F) is as given in Section 1.4 (cf. the examples in 
Sections 10.2 and 11.1). 

(4) For all n ^ 6 we have Aut(5„) = Inn^) = S„ (cf. Exercise 18). For n = 6 we 
have |Aut(S6) : InnOS*)! = 2 (cf. the following Exercise 19 and also Exercise 
10 in Section 6.3). 

(5) Aut(Z)g) = Dg and Aut(£) 8 ) = & (cf. the following Exercises 4 and 5 and also 
Exercise 9 in Section 6.3). 

The group V described in Part 3 of the proposition is called the elementary abelian 
group of order p n (we shall see in Chapter 5 that it is uniquely determined up to 
isomorphism by p and n). The Klein 4-group, V 4 , is the elementary abelian group of 
order 4. This proposition asserts that 

Aut(V 4 ) £ CL 2 (F 2 ). 
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By the exercises in Section 1.4, the latter group has order 6. But Aut(V 4 ) permutes 
the 3 Donidentity elements of V 4 , and this action of Aut(V4) on V4 — { 1} gives an 
injective permutation representation of Aut(V 4 ) into S 3 . By order considerations, the 
homomorphism is onto, so 

Aut(V 4 ) = GL2(F 2 ) = S 3 - 
Note that V4 is abelian, so Inn(V 4 ) = 1. 

For any prime p, the elementary abelian group of order p 2 is Z p x Z p . Its auto- 
morphism group, G/,2(F P ), has order p(p — 1 ) 2 (p + 1). Thus Corollary 9 implies that 
for p a prime 

if I P\ = p\ |Aut(P)| = p{p - 1) or p(p - 1 ) 2 (p + 1) 
according to whether P is cyclic or elementary abelian, respectively. 

Example 

Suppose G is a group of order 45 — 3 2 5 with a normal subgroup P of order 3 2 . We show 
that G is necessarily abelian. 

The quotient G/Cg{P ) is isomorphic to a subgroup of Aut(P) by Corollaiy 15, and 
Aut(P) has order 6 or 48 (according to whether P is cyclic or elementary abelian, respec- 
tively) by the preceding paragraph. On the other hand, since the order of P is the square 
of a prime, P is an abelian group, hence P < Cq{P). It follows that |Cc(P)l is divisible 
by 9, which implies \G/Cg(P)\ is 1 or 5. Together these imply \G/Cc(P)\ = 1, i.e., 
Cg(P) = G and P < Z(G). Since then G/Z(G) is cyclic, G must be an abelian group. 


EXERCISES 


Let G be a group. 

1. If cr € Aut(G) and ip g is conjugation by g prove oip g o~ x = <Pa(g)- Deduce that Inn(G) < 
Aut(G). (The group Aut(G)/Inn(G) is called the outer automorphism group of G.) 

2. Prove that if G is an abelian group of order pq, where p and q are distinct primes, then G 
is cyclic. [Use Cauchy’s Theorem to produce elements of order p and q and consider the 
order of their product] 

3. Prove that under any automorphism of Dg t r has at most 2 possible images and s has at 
most 4 possible images (r and s are the usual generators — cf. Section 1.2). Deduce that 
|Aut(Ds)| < 8. 

4. Use arguments similar to those in the preceding exercise to show ]Aut(08)| < 24. 

5. Use the fact that Dg P>i 6 to prove that Aut(Z)g) = £>$. 

6 . Prove that characteristic subgroups are normal. Give an example of a normal subgroup 
that is not characteristic. 

7- If H is the unique subgroup of a given order in a group G prove H is characteristic in G. 

8. Let G be a group with subgroups H and K with H < K. 

(a) Prove that if H is characteristic in K and K is normal in G then H is normal in G. 

(b) Prove that if H is characteristic in K and K is characteristic in G then H is charac- 
teristic in G. Use this to prove that the Klein 4-group V 4 is characteristic in S 4 . 

(c) Give an example to show that if H is normal in K and K is characteristic in G then 
H need not be normal in G. 
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9. If r, s are the usual generators for the dihedral group , use the preceding two exercises 
to deduce that every subgroup of ( r > is normal in D^n ■ 

10 . Let G be a group, let A be an abelian normal subgroup of G, and write G = G/ A. Show 
that G acts (on the left) by conjugation on A by g a = gag ~ 1 , where g is any representative 
of the coset g (in particular, show that this action is well defined). Give an explicit example 
to show that this action is not well defined if A is non- abelian. 

11. If p is a prime and P is a subgroup of S p of order p, prove Ns p (P)/Cs p (P) = Aut (P). 
[Use Exercise 34, Section 3.] 

12. Let G be a group of order 3825. Prove that if H is a normal subgroup of order 17 in G 
then H < Z(G). 

13 . Let G be a group of order 203. Prove that if H is a normal subgroup of order 7 in G then 
H < Z(G). Deduce that G is abelian in this case. 

14 . Let G be a gToup of order 1575. Prove that if H is a normal subgroup of order 9 in G then 
H < Z(G). 

15. Prove that each of the following (multiplicative) groups is cyclic: (Z/5Z) X , (Z/9Z) X and 
(Z/18Z) X . 

16. Prove that (Z/24Z) X is an elementary abelian group of order 8 . (We shall see later that 
(Z/nZ) x is an elementary abelian group if and only if n | 24.) 

17. Let G = (x ) be a cyclic group of order n. For n = 2,3, 4, 5 , 6 write out the elements 
of Aut(G) explicitly (by Proposition 16 above we know Aut(G) = (Z/nZ) x , so for each 
element a e (Z/nZ) x , write out explicitly what the automorphism rj/ a does to the elements 
(1, x, x 2 , ... , jc 71 ' 1 } of G) . 

18. This exercise shows that for n ^6 every automorphism of S„ is inner. Fix an integer n > 2 
with n ^ 6 . 

(a) Prove that the automorphism group of a group G permutes the conjugacy classes of 
G. i.e., for each a e Aut(G) and each conjugacy class K. of G the set a(} C) is also a 
conjugacy class of G. 

(b) Let K be the conjugacy class of transpositions in S n and let K! be the conjugacy class 
of any element of order 2 in S n that is not a transposition. Prove that \K\ ^ |/C'|. 
Deduce that any automorphism of S n sends transpositions to transpositions. [See 
Exercise 33 in Section 3.] 

(c) Prove that for each a e Aut(S„) 

a : (1 2) i-+ (a In), a : (1 3) {a 63 ), .... a : (1 n) h> (a b„) 

for some distinct integers a, fa, 63 , ... , b„ e { 1 , 2 , . . . , n). 

(d) Show that (1 2), (13), . . . , (1 n) generate S„ and deduce that any automorphism 
of S„ is uniquely determined by its action on these elements. Use (c) to show that S„ 
has at most n! automorphisms and conclude that Aut(S„) = Inn(S„) for n / 6 . 

19. This exercise shows that |Aut(S 6 ) : Inn(5g)| < 2 (Exercise 10 in Section 6.3 shows that 
equality holds by exhibiting an automorphism of 5^ that is not inner). 

(a) Let /C be the conjugacy class of transpositions in S& and let K! be the conjugacy class 
of any element of order 2 in S& that is not a transposition. Prove that |/C| ^ |/C'| unless 
K! is the conjugacy class of products of three disjoint transpositions. Deduce that 
Aut(S 6 ) has a subgroup of index at most 2 which sends transpositions to transpositions. 

(b) Prove that |Aut(5g) : Inn(5e)| < 2. [Follow the same steps as in (c) and (d) of 
the preceding exercise to show that any automorphism that sends transpositions to 
transpositions is inner.] 
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The next exercise introduces a subgroup, J(P), which (like the center of P) is defined for an 
arbitrary finite group P (although in most applications P is a group whose order is a power of 
a prime). This subgroup was defined by J. Thompson in 1964 and it now plays a pivotal role 
in the study of finite groups, in particular, in the classification of finite simple groups. 

20. For any finite group P let d(P) be the minimum number of generators of P (so, for 
example, d(P) — 1 if and only if P is a nontrivial cyclic group and d(Qg) = 2). Letm(P) 
be the maximum of the integers d(A) as A runs over all abelian subgroups of P (so, for 
example, m(Qg) = 1 and m(Dg) = 2). Define 

J(P) = ( A I A is an abelian subgroup of P with d(A) = m(P) ). 

(J(P) is called the Thompson subgroup of P.) 

(a) Prove that J(P) is a characteristic subgroup of P. 

(b) For each of the following groups P list all abelian subgroups A of P that satisfy 
d(A) = m(P): Qg, Dg, Dig and QDyf, (where QD\& is the quasidihedral group 
of order 16 defined in Exercise 1 1 of Section 2.5). [Use the lattices of subgroups for 
these groups in Section 2.5. ] 

(c) Show that J(Qg) = Qg, J(Dg ) = Dg, J(,D\e) = Di6 and JiQD^) is a dihedral 
subgroup of order 8 in QD\£. 

(d) Prove that if Q < P and 7(P) is a subgroup of Q, then J(P) = J(Q). Deduce that if 
P is a subgroup (not necessarily normal) of the finite group G and J (P) is contained 
in some subgroup Q of P such that Q < G, then J (P) < G. 


4.5 SYLOV\TS THEOREM 

In this section we prove a partial converse to Lagrange’s Theorem and derive numerous 
consequences, some of which will lead to classification theorems in the next chapter. 

Definition. Let G be a group and let p be a prime. 

(1) A group of order p a for some a > 1 is called a p-group. Subgroups of G which 
are p-groups are called p-subgroups. 

(2) If G is a group of order p a m, where p\m, then a subgroup of order p a is called 
a Sylow p-subgroup of G. 

(3) The set of Sylow p-subgroups of G will be denoted by Syl p ( G ) and the number 
of Sylow p-subgroups of G will be denoted by n p (G ) (or just n p when G is 
clear from the context). 

Theorem 18. ( Sylow’ s Theorem) Let G be a group of order p“m, where p is a prime 
not dividing m. 

(1) Sylow p-subgroups of G exist, i.e., Syl p (G) / 0. 

(2) If P is a Sylow p-subgroup of G and Q is any p-subgroup of G, then there 
exists g g G such that Q < gPg 1 , i.e., Q is contained in some conjugate of 
P. In particular, any two Sylow p-subgroups of G are conjugate in G. 

(3) The number of Sylow p-subgroups of G is of the form 1 + kp, i.e., 

n p = l(mod p). 

Further, n p is the index in G of the normalizer N C (P) for any Sylow p-subgroup 
P, hence n p divides m. 
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We first prove the following lemma: 

Lemma 19. Let P e Syl p (G). If Q is any p-subgroup of G, then QC\N G (P) = QHP. 


Proof: Let H = N G (P) O Q. Since P < N G (P) it is clear that P n Q < H, so 
we must prove the reverse inclusion. Since by definition H < Q, this is equivalent to 
showing H < P. We do this by demonstrating that PH is a p-subgroup of G containing 
both P and H\ but P is a p-subgroup of G of largest possible order, so we must have 
PH = P, i.e., H < P. 


Since H < N G (P), by Corollary 15 in Section 3.2, PH is a subgroup. By Propo- 
sition 13 in the same section 


\PH\ = 




All the numbers in the above quotient are powers of p, so P H is a p-group. Moreover, 
P is a subgroup of PH so the order of PH is divisible by p a , the largest power of 
p which divides |G|. These two facts force \PH\ = p a = [P|. This in turn implies 
P = PH and H < P. This establishes the lemma. 


Proof of Sylow ’s Theorem ( 1 ) Proceed by induction on | G | . If !GJ = 1, there is nothing 
to prove. Assume inductively the existence of Sylow p-subgroups for all groups of 
order less than |G[. 

If p divides |Z(G)|, then by Cauchy’s Theorem for abelian groups (Proposition 21, 
Section 3.4) Z(G) has a subgroup, A, of order p. LetG = G/N, so that |G| = p a ~ x m. 
By induction, G has a subgroup P of order p a_1 . If we let P be the subgroup of G 
containing N such that P/N = P then |P| = \P/H\ - \N\ = p a and P is a Sylow 
p-subgroup of G. We are reduced to the case when p does not divide [Z(G)|. 

Let gi, g 2 , . . . , g r be representatives of the distinct non-central conjugacy classes 
of G. The class equation for G is 

r 

|G| = |Z(G)| + ^2 l G ; C cte)l- 

;=i 

If p j |G : Cc(gi) | for all i, then since p | |G|, we would also have p j |Z(G)|, 
a contradiction. Thus for some i, p does not divide |G : C G (gi)l- For this i let 
H = C G (gi ) so that 

\H\ = p a k , where p { k. 

Since g t £ Z(G)> \H \ < |G|. By induction, H has a Sylow p-subgroup, P , which of 
course is also a subgroup of G. Since \P\ — p a , P isa Sylow p-subgroup of G. This 
completes the induction and establishes (1). 

Before proving (2) and (3) we make some calculations. By (1) there exists a Sylow 
p-subgroup, P, of G. Let 

{P u P 2 ,--,P r ) = S 

be the set of all conjugates of P (i.e., S = {gPg~ x \ g € G}) and let Q be any p- 
subgroup of G. By definition of S , G, hence also Q, acts by conjugation on «S. Write 
5 as a disjoint union of orbits under this action by Q : 

S=0 { U0 2 U‘--U0 s 
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where r = \0\\-\ \-\O s \. Keep in min d that r does not depend on 0 but the number 

of 0-oibits s does (note that by definition, G has only one orbit on S but a subgroup 0 of 
G may have more than one orbit). Renumber the elements of S if necessary so that the 
first s elements of S are representatives of the 0-orbits: P, e Oj , 1 < i < s. It follows 
from Proposition 2 that \Oi\ = | Q : Nq(Pi)\. By definition, Nq{P{) = Nc(Pi) n 0 
and by Lemma 19, Nc{P , ) n 0 = P,C\ Q. Combining these two facts gives 

10/1 = 10:^001, 1<i<j. (4-1) 

We are now in a position to prove that r = l (mod p). Since 0 was arbitrary we 
may take Q = P\ above, so that (I) gives 


|Oil = l. 

Now, for all / > 1, Pj ^ Pi, so Pi O P/ < Pi. By (1) 

| Oil = IPi : Pi n P/I >1, 2 < i <s. 


Since Pi is a p-group, | Pi : Pj D P,- 1 must be a power of p, so that 

P | |Oj|, 2 <i<s. 


Thus 


r = |0,| + (\0 2 \ + . . . + IOJ) = l(mod p). 


We now prove parts (2) and (3). Let Q be any p- subgroup of G. Suppose 0 is 

not contained in P, for any / e {1. 2 r) (i.e., 0 % gPg~ l for any g e G ). In'this 

situation, 0 D P/ < 0 for all i, so by (I) 7 

10/1 = 10 : 0DP|> 1, 1 <i<s_. 

Thus p | [O,- 1 for all i , so p divides |Oi | + . . . + 10, | = r. This contradicts the fact that 
r = l(mod p) (remember, r does not depend on the choice of 0). This contradiction 
proves 0 < gPg~ l for some g e G. 

To see that all Sylow p-subgroups of G are conjugate, let 0 be any Sylow p- 
subgroup of G. By the preceding argument, Q < gPg~ l for some g £ G. Since 
IgPg' 1 ! = 101 = p a » we must have gPg~ l — 0- This establishes part (2) of the 
theorem. In particular, S = Syl p (G) since every Sylow p-subgroup of G is conjugate 
to P, and so n p = r = 1 (mod p), which is the first part of (3). 

Finally, since all Sylow p-subgroups are conjugate, Proposition 6 shows that 

n p = \G : N C (P) | for any P € Syl p (G ), 
completing the proof of Sylow’s Theorem. 


Note that the conjugacy part of Sylow’s Theorem together with Corollary 14 shows 
that any two Sylow p-subgroups of a group ( for the same prime p) are isomorphic. 
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Corollary 20. Let P be a Sylow p-subgroup of G. Then the following are equivalent: 

(1) P is the unique Sylow p-subgroup of G, i.e., n p = 1 

(2) P is normal in G 

(3) P is characteristic in G 

(4) All subgroups generated by elements of p-power order are p-groups, i.e., if X 
is any subset of G such that |x| is a power of p for all jc £ X. then ( X ) is a 
p-group. 


Proof: If (1) holds, thengPg -1 = P for all g 6 G since gPg~ l e Syl p (G),'\.e , ., P 
is normal in G. Hence (1) implies (2). Conversely, if P < G and Q £ Syl p (G), then by 
Sylow’s Theorem there exists g £ G such that Q — gPg~ l = P. Thus Syl p (G) — {P} 
and (2) implies (1). 

Since characteristic subgroups are normal, (3) implies (2). Conversely, if P < G, 
we just proved P is the unique subgroup of G of order p a . hence P char G. Thus (2) 
and (3) are equivalent. 

Finally, assume (1) holds and suppose X is a subset of G such that |x| is a power 
of p for all x £ X. By the conjugacy part of Sylow’s Theorem, for each x £ X there 
is some g £ G such that x £ gPg~ l = P. Thus KP, and so ( X ) < P, and ( X ) 
is a p-group. Conversely, if (4) holds, let X be die union of all Sylow p-subgroups of 
G. If P is any Sylow p-subgroup, P is a subgroup of the p-group ( X ). Since P is a 
p-subgroup of G of maximal order, we must have P = ( X ), so (1) holds. 

Examples 

Let G be a finite group and let p be a prime. 

(1) If p does not divide the order of G, the Sylow p-subgroup of G is the trivial group 
(and all parts of Sylow’s Theorem hold trivially). If |G| = p", G is the unique Sylow 
p-subgroup of G. 

(2) A finite abelian group has a unique Sylow p-subgroup for each prime p. This subgroup 
consists of all elements x whose order is a power of p. This is sometimes called the 
p-primary component of the abelian group. 

(3) S3 has three Sylow 2-subgroups: ( (1 2) ), ( (23) > and ( (1 3) >. It has a unique (hence 
normal) Sylow 3-subgroup: ( (1 23) > = A3. Note that 3 s l(mod 2). 

(4) A4 has a unique Sylow 2-subgroup: ( (1 2) (3 4), (l 3) (2 4) ) = V4. It has four Sylow 
3-subgroups: ( (12 3) ), ( (1 24) >,((134)) and ( (234) >. Note that 4 = l(mod 3). 

(5) S4 has «2 = 3 and /J3 = 4. Since S4 contains a subgroup isomorphic to £>8, every 
Sylow 2-subgroup of S4 is isomorphic to D%. 

Applications of Sylow’s Theorem 

We now give some applications of Sylow’s Theorem. Most of the examples use Sylow’s 
Theorem to prove that a group of a particular order is not simple. After discussing 
methods of constructing larger groups from smaller ones (for example, the formation 
of semidirect products) we shall be able to use these results to classify groups of some 
specific orders n (as we already did for n = 15). 

Since Sylow’s Theorem ensures the existence of p-subgroups of a finite group, it 
is worthwhile to study groups of prime power order more closely. This will be done in 
Chapter 6 and many more applications of Sylow’s Theorem will be discussed there. 
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For groups of small order, the congruence condition of Sylow’s Theorem alone 
is often sufficient to force the existence of a normal subgroup. The first step in any 
numerical application of Sylow’s Theorem is to factor the group order into prime powers. 
The largest prime divisors of the group order tend to give the fewest possible values for 
n p (for example, the congruence condition on gives no restriction whatsoever), which 
limits the structure of the group G. In the following examples we shall see situations 
where Sylow’s Theorem alone does not force the existence of a normal subgroup, 
however some additional argument (often involving studying the elements of order p 
for a number of different primes p) proves the existence of a normal Sylow subgroup. 

Example: (Groups of order pq, p and q primes with p <q) 

Suppose \G\ — pq for primes p and q with p <q. Let P € Syl p (G) and let Q € Syl q (G). 
We show that Q is normal in G and if P is also normal in G, then G is cyclic. 

Now the three conditions: n q = 1 + kq for some k > 0, n q divides p and p < q, 
together force k = 0. Since n q = 1, Q < G. 

Since n p divides the prime q, the only possibilities are n p = 1 or q. In particular, if 
p\q — 1, (that is, if q ^ l(mod p)), then n p cannot equal q, so P < G. 

Let P = (x) and Q = (y). If P < G, then since G/Cg(P) is isomorphic to a 
subgroup of Aut(Zp) and the latter group has order p — 1, Lagrange’s Theorem together 
with the observation that neither p nor q can divide p — 1 implies that G = Cc(P). In 
this case x € P < Z(G) so x and y commute. (Alternatively, this follows immediately 
from Exercise 42 of Section 3.1.) This means \xy\ = pq (cf. the exercises in Section 2.3), 
hence in this case G is cyclic: G — Zpq. 

I ip | q — 1, we shall see in Chapter 5 that there is a unique non-abelian group of order 
pq (in which, necessarily, n p = q). We can prove the existence of this group now. Let Q be 
a SyJow ^-subgroup of the symmetric group of degree q, S q . By Exercise 34 in Section 3, 
|Ns fl (0l = q(q ~ 1)- By assumption, p \ q — 1 so by Cauchy’s Theorem Ns g (Q) has a 
subgroup. P, of order p. By Corollary 15 in Section 3.2, P Q is a group of order pq. Since 
Cs q (Q) = Q (Example 2, Section 3), .PQ is a non-abelian group. The essential ingredient 
in the uniqueness proof of PQ is Theorem 17 on the cyclicity of Aut(Z^). 


Example: (Groups of order 30) 

Let G be a group of order 30. We show that G has a normal subgroup isomorpliic to 
Z15. We shall use this information to classify groups of order 30 in the next chapter. Note 
that any subgroup of order 15 is necessarily normal (since it is of index 2) and cyclic 
(by the preceding result) so it is only necessary to show there exists a subgroup of order 
15. The quickest way of doing this is to quote Exercise 13 in Section 2. We give an 
alternate argument which illustrates how Sylow’s Theorem can be used in conjunction 
with a counting of elements of prime order to produce a normal subgroup. 

Let P € Syls(G) and let Q € Syh(G). If either P or Q is normal in G, by Corollary 
15, Chapter 3, PQ is a group of order 15. Note also that if either P or Q is normal, then 
both P and Q are characteristic subgroups of PQ, and since PQ<G , both P and Q are 
normal in G (Exercise 8(a), Section 4). Assume therefore that neither Sylow subgroup is 
normal. The only possibilities by Part 3 of Sylow’s Theorem are ns = 6 and m = 10. 
Each element of order 5 lies in a Sylow 5-subgroup, each Sylow 5-subgroup contains 4 
nonidentity elements and, by Lagrange’s Theorem, distinct Sylow 5-subgroups intersect 
in the identity. Thus the number of elements of order 5 in G is the number of nonidentity 
elements in one Sylow 5-subgroup times the number of Sylow 5-subgroups. This would 
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be 4 ■ 6 = 24 elements of order 5. By similar reasoning, the number of elements of order 3 
would be 2 ■ 10 = 20. This is absurd since a group of order 30 cannot contain 24 +20 = 44 
distinct elements. One of P or Q (hence both) must be normal in G. 

This sort of counting technique is frequently useful (cf. also Section 6.2) and works 
particularly well when the Sylow p-subgroups have order p (as in this example), since then 
the intersection of two distinct Sylow p-subgroups must be the identity. If the order of the 
Sylow p-subgroup is p“ with a > 2, greater care is required in counting elements, since 
in this case distinct Sylow p-subgroups may have many more elements in common, i.e., 
the intersection may be nontrivial 

Example: (Groups of order 12) 

Let G be a group of order 12. We show that either G has a normal Sylow 3-subgroup or 
G = M (io the latter case G has a normal Sylow 2-subgroup). We shall use this information 
to classify groups of order 12 in the next chapter. 

Suppose n3 ^ 1 and let P € Syl-$(G). Since ni | 4 and = l(mod 3), it follows that 
«3 = 4. Since distinct Sylow 3-subgroups intersect in the identity and each contains two 
elements of order 3, G contains 2-4 = 8 elements of order 3. Since \G : Nc(P)\ = ni = 4, 
Nc(P) = P. Now G acts by conjugation on its four Sylow 3-subgroups, so this action 
affords a permutation representation 


<p ; G — ► S 4 

(note that we could also act by left multiplication ontheleft cosets of P and use Theorem 3). 
The kernel K of this action is the subgroup of G which normalizes all Sylow 3-subgroups 
of G. In particular, K < Nc(P) = P- Since P is not normal in G by assumption, K = 1, 
i.e., <p is injective and 

G = <p(G) < £4. 

Since G contains 8 elements of order 3 and there are precisely 8 elements of order 3 in 
1S4, all contained in A 4, it follows that <p(G) intersects A4 in a subgroup of order at least 8. 
Since both groups have order 12 it follows that <p(G) = A4, so that G = A4. 

Note that A4 does indeed have 4 Sylow 3-subgroups (see Example 4 following Corol- 
lary 20), so that such a group G does exist. Also, let V be a Sylow 2-subgroup of A4. 
Since \ V\ = 4, it contains all of the remaining elements of At. In particular, there cannot 
be another Sylow 2-subgroup. Thus /i2(At) = 1, i.e., V <! A4 (which one can also see 
directly because V is the identity together with the three elements of S 4 which are products 
of two disjoint transpositions, that is, V is a union of conjugacy classes). 

Example: (Groups of order p 2 q, p and q distinct primes) 

Let G be a group of order p 2 q. We show that G has a normal Sylow subgroup (for either 
p or q). We shall use this information to classify some groups of this order in the next 
chapter (cf. Exercises 8 to 12 of Section 55). Let P e Syl p (G ) and let Q € Syl q (G). 

Consider first when p > q. Since n p | q and n p = 1 + kp, we must have n p = 1. 
Thus P < G. 

Consider now the case p < q. If n q = 1, Q is normal in G. Assume therefore that 
n q > 1, i.e., n q = 1 + tq, for some t > 0. Now n q divides p 2 so n q = p or p 2 . Since 
q > p we cannot have n q = p , hence n q = p 2 . Thus 

tq = P 2 ~ 1 = (p - 1)(P + 1)- 
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Since q is prime, either q \ p — 1 or g | p + 1 . The former is impossible since q > p so 
the latter holds. Since q > p but q \ p + 1 , we must have q = p + 1. This forces p — 2, 
q = 3 and |G| = 12. The result now follows from the preceding example. 

Groups of Order 60 

We illustrate how Sylow’s Theorems can be used to unravel the structure of groups of 
a given order even if some groups of that order may be simple. Note the technique of 
changing from one prime to another and the inductive process where we use results on 
groups of order < 60 to study groups of order 60. 

Proposition 21. If |G| =60 and G has more than one Sylow 5-subgroup, then G is 
simple. 

Proof: Suppose by way of contradiction that |G| = 60 and n 5 > 1 but that there 
exists H a normal subgroup of G with H 1 or G. By Sylow’s Theorem the only 
possibility for n 5 is 6. Let P € Syl$(G), so that |7V C (P)| = 10 since its index is ns. 

If 5 | |f/| then H contains a Sylow 5-subgroup of G and since H is normal, it 
contains all 6 conjugates of this subgroup. In particular, |//| > 1 + 6 ■ 4 = 25, and the 
only possibility is \H\ = 30. This leads to a contradiction since a previous example 
proved that any group of order 30 has a normal (hence unique) Sylow 5 -subgroup. This 
argument shows 5 does not divide \ H\ for any proper normal subgroup H of G. 

If \H\ = 6 or 12, H has a normal, hence characteristic, Sylow subgroup, which is 
therefore also normal in G. Replacing H by this subgroup if necessary, we may assume 
\H\ — 2, 3 or 4. Let G = G/H , so |G| = 30, 20 or 15. In each case, G has a normal 
subgroup P of order 5 by previous results. If we let H\ be the complete preimage of 
P in G, then Hy < G, Hi ^ G and 5 | | H\ \ . This contradicts the preceding paragraph 
and so completes the proof. 

Corollary 22. As is simple. 

Proof: The subgroups {(12345)) and { (I 3 2 4 5) ) are distinct Sylow 5-subgroups 
of As so the result follows immediately from the proposition. 

The next proposition shows that there is a unique simple group of order 60. 

Proposition 23. If G is a simple group of order 60, then G = As- 

Proof: Let G be a simple group of order 60, so = 3, 5 or 15. Let P € SyliiG) 
and let N — Nc(P)> so \G : N\ = ri 2 . 

First observe that G has no proper subgroup H of index less that 5, as follows: if 
H were a subgroup of G of index 4, 3 or 2, then, by Theorem 3, G would have a normal 
subgroup K contained in H with G/K isomorphic to a subgroup of 54 , S 3 or 52. Since 
K G, simplicity forces K = 1. This is impossible since 60 (= |G|) does not divide 
4!. This argument shows, in particular, that 712 / 3. 

If « 2 = 5, then N has index 5 in G so the action of G by left multiplication on 
the set of left cosets of N gives a permutation representation of G into 55 . Since (as 
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above) the kernel of this representation is a proper normal subgroup and G is simple, 
the kernel is 1 and G is isomorphic to a subgroup of £5. Identify G with this isomorphic 
copy so that we may assume G < £5. If G is not contained in A5, then £5 = GAs 
and, by the Second Isomorphism Theorem, A 5 fl G is of index. 2 in G. Since G has no 
(normal) subgroup of index 2, this is a contradiction. This argument proves G < As. 
Since |G| = |As|, the isomorphic copy of G in £5 coincides with As, as desired. 

Finally, assume «2 = 15. If for every pair of distinct Sylow 2-subgroups P and Q 
of G, P fl Q = 1, then the number of nonidentity elements in Sylow 2-subgroups of G 
would be (4 — 1) ■ 15 — 45. But ns = 6 so the number of elements of order 5 in G is 
(5 — 1) ■ 6 = 24. accounting for 69 elements. This contradiction proves that there exist 
distinct Sylow 2-subgroups P and Q with \P fl Q\ = 2. Let M = Nq(P fl Q). Since 
P and Q are abelian (being groups of order 4), P and Q are subgroups of M and since 
G is simple, M ^ G. Thus 4 divides \M\ and \ M\ > 4 (otherwise, P = M = Q). The 
only possibility is \M\ — 12, i.e., M has index 5 in G (recall M cannot have index 3 
or 1). But now the argument of the preceding paragraph applied to M in place of N 
gives G ~ As- This leads to a contradiction in this case because h 2 (As) = 5 (cf. the 
exercises). The proof is complete. 


EXERCISES 

Let G be a finite group and let p be a prime. 

1. Prove that if P e Syl p (G ) and H is a subgroup of G cont ainin g P then P e Syl p (H). 
Give an example to show that, in general, a Sylow p-subgroup of a subgroup of G need 
not be a Sylow p-subgroup of G. 

2. Prove that if H is a subgroup of G and Q e Syl p (H) then gQg~ x € Syl p (gHg ~ { ) for all 
g e G. 

3. Use Sylow’s Theorem to prove Cauchy’s Theorem. (Note that we only used Cauchy’s 
Theorem for abelian groups — Proposition 3.21 — in the proof of Sylow’s Theorem so 
this line of reasoning is not circular.) 

4. Exhibit all Sylow 2-subgroups and Sylow 3-subgroups of £>12 and £3 x £3. 

5. Show that a Sylow p-subgroup of D?* is cyclic and normal for every odd prime p. 

6. Exhibit all Sylow 3-subgroups of A 4 and all Sylow 3-subgroups of £4. 

7. Exhibit all Sylow 2- subgroups of £4 and find elements of £4 which conjugate me of these 
into each of the others. 

8. Exhibit two distinct Sylow 2-subgroups of £5 and an element of £5 that conjugates one 
into the other. 

9. Exhibit all Sylow 3-subgroups of SL 2 OF 3 ) (cf. Exercise 9, Section 2.1). 

10. Prove that the subgroup of ££2(3*3) generated by ^ ^ ^ and ^ j ^ ^ is the unique 

Sylow 2-subgroup of ££26^3) (cf. Exercise 10, Section 2.4). 

11. Show that the center of SL 2 CF 3 ) is the group of order 2 consisting of ±7, where / is the 
identity matrix. Prove that ££2(F3)/Z(££2(F3)) = A4. [Use facts about groups of order 
12 .] 

12. Let 2n = 2° k where k is odd. Prove that the number of Sylow 2-subgroups of £>2„ is k. 
[Prove that if P € Syhilhn) then No ^ (P) = T 5 .) 
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13. Prove that a group of order 56 has a normal Sylow p-subgroup for some prime p dividing 
its order. 

14 . Prove that a group of order 312 has a normal Sylow p-subgroup for some prime p dividing 
its order. 

15. Prove that a group of order 351 has a normal Sylow p-subgroup for some prime p dividing 
its order. 

16 - Let |G| = pqr t where p, q and r are primes with p < q < r. Prove that G has a normal 
Sylow subgroup for either p, q or r. 

17 . Prove that if |G| = 105 then G has a normal Sylow 5-subgroup and a normal Sylow 
7-subgroup. 

18 . Prove that a group of order 200 has a normal Sylow 5-subgroup. 

19 . Prove that if |G| = 6545 then G is not simple. 

20. Prove that if |G| = 1365 then G is not simple. 

21 . Prove that if |G| = 2907 then G is not simple. 

22. Prove that if |G| = 132 then G is not simple. 

23. Prove that if |G) = 462 then G is not simple. 

24 . Prove that if G is a group of order 231 then Z(G) contains a Sylow 1 1-subgroup of G and 
a Sylow 7-subgroup is normal in G. 

25. Prove that if G is a group of order 385 then Z(G) contains a Sylow 7-subgroup of G and 
a Sylow 1 1 -subgroup is normal in G. 

26. Let G be a group of order 105. Prove that if a Sylow 3-subgroup of G is normal then G is 
abelian. 

27. Let G be a group of order 315 which has a normal Sylow 3-subgroup. Prove that Z(G) 
contains a Sylow 3-subgroup of G and deduce that G is abelian. 

28. Let G be a group of order 1575, Prove that if a Sylow 3-subgroup of G is normal then a 
Sylow 5-subgroup and a Sylow 7-subgroup are normal. In this situation prove that G is 
abelian. 

29 . If G is a non- abelian simple group of order <100, prove that G = As- [Eliminate all 
orders but 60.] 

30. How many elements of order 7 must there be in a simple group of order 168? 

31. For p = 2, 3 and 5 find n p (A$) and n p (Ss). [Note that Aa < A 5 .] 

32. Let P be a Sylow p-subgroup of H and let H be a subgroup of K. If P < H and 
H < K, prove that P is normal in K. Deduce that if P € Syl p (G) and H = Nc(P), the n 
Nq(H) ~ H (in words: normalize rs of Sylow p-subgroups are self-normalizing). 

33. Let P be a normal Sylow p-subgroup of G and let H be any subgroup of G. Prove that 
P Pi H is the unique Sylow p-subgroup of H. 

34. Let P € Syl p (G) and assume N < G. Use the conjugacy part of Sylow’s Theorem to 
prove that P fi N is a Sylow p-subgroup of N. Deduce that PN/N is a Sylow p-subgroup 
of G/N (note that this may also be done by the Second Isomorphism Theorem — cf. 
Exercise 9, Section 3.3). 

35. Let P € Syl p (G) and let H < G. Prove that gPg~ y Pi H is a Sylow p-subgroup of H 
for some g € G. Give an explicit example showing that hPh~ x fl H is not necessarily a 
Sylow p-subgroup of H for any h € H (in particular, we cannot always take g = 1 in the 
first part of this problem, as we could when H was normal in G). 
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36. Prove that if N is a normal subgroup of G then n p (G/N ) < n p (G). 

37. Let 7? be a normal p-subgroup of G (not necessarily a Sylow subgroup). 

(a) Prove that R is contained in every Sylow p-subgroup of G. 

(b) If S is another normal p-subgroup of G, prove that RS is also a normal p-subgroup 
of G. 

(c) The subgroup O p (G) is defined to be the group generated by all normal p-subgroups 
of G. Prove that O p (G) is the unique largest normal p-subgroup of G and O p (G ) 
equals the intersection of all Sylow p-subgroups of G. 

(d) Let G = G/O p (G). Prove that O p (G) = I (i.e., G has no nontrivial normal p- 
subgroup). 

38. Use the method of proof in Sylow’s Theorem to show that if n p is not congruent to 
l(mod p 2 ) then there are distinct Sylow p-subgroups P and Q of G such that 

\P ■■ PnQ\ = \Q : PDQ\=p. 

39. Show that the subgroup of strictly upper triangular matrices in GL n (F p ) (cf. Exercise 17, 
Section 2.1) is a Sylow p-subgroup of this finite group. [Use the order formula in Section 
1.4 to find the order of a Sylow p-subgroup of GL„ (F p ).] 

40. Prove that the number of Sylow p-subgroups of GZ, 2 (F P ) is p 4- 1. [Exhibit two distinct 
Sylow p-subgroups.] 

41. Prove that SL 2 (F 4 ) = As (cf. Exercise 9, Section 2.1 for the definition of SL 2 OF 4 ))- 

42. Prove that the group of rigid motions in R 3 of an icosahedron is isomorphic to As . [Recall 
that the order of this group is 60: Exercise 13, Section 1.2.] 

43. Prove that the group of rigid motions in R 3 of a dodecahedron is isomorphic to As- (As 
with the cube and the tetrahedron, the icosahedron and the dodecahedron are dual solids.) 
[Recall that the order of this group is 60: Exercise 12, Section 1.2.] 

44. Let p be the smallest prime dividing the order of the finite group G. If P e Syl p (G) and 
P is cyclic prove that Ng(P) = Cg(P)- 

45. Find generators for a Sylow p-subgroup of S 2 P , where p is an odd prime. Show that this 
is an abelian group of order p 2 . 

46. Find generators for a Sylow p-subgroup of S p 2 , where p is a prime. Show that this is a 
non-abelian group of order p p+1 . 

47. Write and execute a computer program which 

(i) gives each odd number n < 10, 000 that is not a power of a prime and that has some 
prime divisor p such that n p is not forced to be 1 for all groups of order n by the 
congruence condition of Sylow’s Theorem, and 

(ii) gives for each n in (i) the factorization of n into prime powers and gives the list of all 
permissible values of n p for all primes p dividing n (i.e., those values not ruled out 
by Part 3 of Sylow’s Theorem). 

48. Cany out the same process as in the preceding exercise for all even numbers less than 
1000. Explain the relative lengths of the lists versus the number of integers tested. 

49. Prove that if |G| =2 n m where m is odd and G has a cyclic Sylow 2-subgroup then G has 
a normal subgroup of order m. [Use induction and Exercises 1 1 and 12 in Section 2.] 

50. Prove that if U and W are normal subsets of a Sylow p-subgroup PofG then U is conjugate 
to W in G if and only if U is conjugate to W in Nc(P)- Deduce that two elements in the 
center of P are conjugate in G if and only if they are conjugate in Nc(P)- (A subset U of 
P is normal in P if Np(U) = P.) 


148 


Chap. 4 Group Actions 



51. Let P be a Sylow p-subgroup of G and let M be any subgroup of G which contains Nq (P). 
Prove that \G : M\ = l(mod p). 

The following sequence of exercises leads to the classification of all numbers n with the property 
that every group of order n is cyclic (for example, n = 1 5 is such an integer). These arguments 
are a vastly simplified prototype for the proof that every group of odd order is solvable in the 
sense that they use the structure (commutativity) of the proper subgroups and their embedding 
in the whole group (we shall see that distinct maximal subgroups intersect in the identity) to 
obtain a contradiction by counting arguments. In the proof that groups of odd order are solvable 
one uses induction to reduce to the situation in which a minimal counterexample is a simple 
group — but here every proper subgroup is solvable (not abelian as in our situation). The 
analysis of the structure and embedding of the maximal subgroups in this situation is much 
more complicated and the counting arguments are (roughly speaking) replaced by character 
theory arguments (as will be discussed in Part VI). 

52. Suppose G is a finite simple group in which every proper subgroup is abelian. If M and 
N are distinct maximal subgroups of G prove MON = 1 . [See Exercise 23 in Section 3.] 

53. Use the preceding exercise to prove that if G is any non-abelian group in which every proper 
subgroup is abelian then G is not simple. [Let G be a counterexample to this assertion and 
use Exercise 24 in Section 3 to show that G has more than one conjugacy class of maximal 
subgroups. Use the method of Exercise 23 in Section 3 to count the elements which lie in 
all conjugates of M and N, where M and N are nonconjugate maximal subgroups of G; 
show that this gives more than \G\ elements.] 

54. Prove the following classification: if G is a finite group of order p\p 2 ■ ■ ■ p r where the 
pi's are distinct primes such that pi does not divide pj — 1 for all i and j, then G is 
cyclic. [By induction, every proper subgroup of G is cyclic, so G is not simple by the 
preceding exercise. If N is a nontrivial proper normal subgroup, N is cyclic and G/N acts 
as automorphisms of N. Use Proposition 16 to show that N < Z(G ) and use induction to 
show G/Z(G) is cyclic, hence G is abelian by Exercise 36 of Section 3.L] 

55. Prove the converse to the preceding exercise: if «> 2 is an integer such that every group 
of order n is cyclic, then n — p\P2 - ■ - Pr is a product of distinct primes and pi does not 
divide pj — 1 for all i, j. [If n is not of this form, construct noncyclic groups of order n 
using direct products of noncyclic groups of order p 2 and pq t where p | q — 1 .] 

56. If G is a finite group in which every proper subgroup is abelian, show that G is solvable. 


4.6 THE SIMPLICITY OF A n 

There are a number of proofs of the simplicity of A„, n > 5. The most elementary 
involves showing A n is generated by 3-cycles. Then one shows that a normal subgroup 
must contain one 3-cycle hence must contain all the 3-cycles so cannot be a proper 
subgroup. We include a less computational approach. 

Note that A 3 is an abelian simple group and that A4 is not simple (^2(^4) = 1). 

Theorem 24. A n is simple for all n > 5. 

Proof: By induction on n. The result has already been established for n = 5, 
so assume n > 6 and let G = A„. Assume there exists H < G with H ^ 1 or G. 
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For each i G [1,2, , «} let G/ be the stabilizer of i in the natural action of G on 

i g {1,2, .. Thus Gj < G and G; = A„_i. By induction, G; is simple for 
1 < i < n. 

Suppose first that there is some r G H with r ^ 1 but x(i) — i for some 
(€{1,2,..., n}. Since r g H D G, and H fl G, < G/, by the simplicity of G, 
we must have H fl Gj = Gj, that is 


Gi < //. 

By Exercise 2 of Section 1, oG/c^ 1 = G 0 (j), so for all i, crG,c r" 1 < cHc~ l = //. 
Thus 

G; < //, for all j G {1, 2, ... , n}. 

Any A G A„ may be written as a product of an even number, 2f , of transpositions, so 

X = X1X2 - * * A M 

where A* is a product of two transpositions. Since n > 4 each A* g Gy, for some j, 
hence 

G = { G\, G 2 , - . . , G„ ) < //, 

which is a contradiction. Therefore if r ^ 1 is an element of H then r(i) ^ i for all 
i g {1, 2, . . . , n], i.e., no nonidentity element of H fixes any element of {1,2,..., n). 
It follows that if n , r 2 are elements of H with 

x\ (i) = r 2 (0 for some i, then t\ = t 2 (4.2) 

since then t ^ -1 Tj (i) = i. 

Suppose there exists are// such that the cycle decomposition of r contains a 
cycle of length > 3, say 


T = (fli fl 2 (I3 . . . )(fci &2 - 

Let cr g G be an element with cr(ai) = a\ y a(a 2 ) = a 2 but (7(03) 7^ £3 (note that such 
a o' exists in A„ since n > 5). By Proposition 10 

Ti = oto --1 = (aia 2 a(a 3) . . . Xtrfbj) o(/^) 

so r and r, are distinct elements of // with r(oi) — Ti(a\) = a 2 , contrary to (2). This 
proves that only 2-cycles can appear in the cycle decomposition of nonidentity elements 
of//. 

Let r g H with r ^ 1 , so that 

r = (ai a 2 ) (0304X05^)... 

(note that ti > 6 is used here). Let a — (a\ a 2 ) (03 05 ) g G. Then 

n = o-ro- -1 = (ai a 2 )(a 5 04)^3 ««) • • • . 

hence r and rj are distinct elements of H with r(o t ) = ti(ni) = n 2 , again contrary to 
(2). This completes the proof of the simplicity of A*. 
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EXERCISES 


Let G be a group and let £2 be an infin ite set. 

1. Prove that A„ does not have a proper subgroup of index < n for all n > 5. 

2. Find all normal subgroups of S» for all n > 5. 

3. Prove that A n is the only proper subgroup of index < n in S„ for all n > 5. 

4 . Prove that A„ is generated by the set of all 3-cycles for each n > 3. 

5. Prove that if there exists a chain of subgroups G\ < G 2 < - - . < G such that G = G, 
and each G,- is simple then G is simple. 

6. Let D be the subgroup of So consisting of permutations which move only a finite number 
of elements of ft (described in Exercise 17 in Section 3) and let A be the set of all elements 
<7 € D such that a acts as an even permutation on the (finite) set of points it moves. Prove 
that A is an infinite simple group. [Show that every pair of elements of D lie in a finite 
simple subgroup of £).] 

7. Under the notation of the preceding, exercise prove that if H < So and H ± 1 then 
A < H, i.e., A is the unique (nontrivial) mi nim al normal subgroup of Sq. 

8. Under the notation of the preceding two exercises prove that |D| = |A| = |ft|. Deduce 
that 

if So = Sa then |ft| = |A|. 

[Use the fact that D is generated by transpositions. You may assume that countable unions 
and finite direct products of sets of cardinality |ft| also have cardinality |ft|.] 
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CHAPTER 5 


Direct and Semidirect Products 
and Abelian Groups 


In this chapter we consider two of the easier methods for constructing larger groups 
from smaller ones, namely the notions of direct and semidirect products. This allows 
us to state the Fundamental Theorem on Finitely Generated Abelian Groups, which in 
particular completely classifies all finite abelian groups. 

5.1 DIRECT PRODUCTS 

We begin with the definition of the direct product of a finite and of a countable number 
of groups (the direct product of an arbitrary collection of groups is considered in the 
exercises). 


Definition. 

(1) The direct product G\ x G2 x ■ ■ • x G n of the groups G 1 , G2, - ■ ■ , G„ with 
operations , *2, . . . , respectively, is the set of n-tuples (gi, g2, ■ ■ ■ , gn) 
where g,- € G, with operation defined componentwise: 

(gi» £2. - - • , g, j) * (hi, h 2 , h „ ) = (gi +! h u gi +2*12 gn +n h„). 

(2) Similarly, the direct product Gi x G 2 x ■ ■ • of the groups G[, G 2 , ■ ■ - with 
operations *j, *2, . . . , respectively, is the set of sequences (g\ , g 2 , . . . ) where 
gi e G-, with operation defined componentwise: 

(£1. 82, ...)*(&!, h 2 , = (gi *i h u g2*ih 2 , - 


Although the operations may be different in each of the factors of a direct product, 
we shall, as usual, write all abstract groups multiplicatively, so that the operation in (1) 
above, for example, becomes simply 

tei. 8i> ■ ■ ■ , gn)(hi, h 2 h n ) = (gih|, g 2 h 2 , . . . , g„h„). 


152 



Examples 

( 1 ) Suppose Gj = R (operation addition) for / = 1 , 2 , . . . , n. Then R x R x - - - x R 
(/i -factors) is the familiar Euclidean n-space R K with usual vector addition: 

(a\,a 2 ,...,a n ) + (£»j, k>2, ■ ■ ■ , b n ) = (a\ +bi, a 2 + In , a n +b n ). 


( 2 ) To illustrate that groups forming the direct product (and corresponding operations) 
may be completely general, let G\ = Z, let G 2 = S3 and let G3 = GZ,2(R)> where the 
group operations are addition, composition, and matrix multiplication, respectively. 
Then the operation in Gi x G2 x G3 is defined by 


( n .a.(“ *))<»'.*.(' *)> = (" 


+ m,o or. 


( 1 ap 4 - br 
cp + dr 


aq +bs\ 
cq + ds ) 


Proposition 1 . If Gi, . . . , G n are groups, their direct product is a group of order 
|Gil IG2I ■ • • |G„| (if any G/ is infinite, so is the direct product). 

Proof: Let G = Gi x G2 x ■ • • x G n . The proof that the group axioms hold 
for G is straightforward since each axiom is a consequence of the fact that the same 
axiom holds in each factor, G, , and the operation on G is defined componentwise. For 
example, the associative law is verified as follows: 

Let (flu ai, - - • ,a„), (b\,b2,..., b n ), and (cj, c 2 , . . . , c„) e G. Then 

(au 02,..., a„)[(b u b 2 , .... b„)(cu c 2 ,..., c„)] 

= (a lt a 2 , . . . , a n )(b\Ci, b 2 c 2 , ..., b„c n ) 

= (aiibici), o 2 (b 2 c 2 ), a„(b„c„)) 

= ((oibi)ci, (a 2 b2)c 2 , .... ( a n b„)c„ ) 

= [(«t, o 2 , ■ - ■, a n )(b u bi, . . . , b„)](ci, c 2 , . . . , c„), 

where in the third step we have used the associative law in each component The 
remaining verification that the direct product is a group is si mila r the identity of 
G is the n-tuplc (lj , l 2 , - ■ . , i„), where 1 , is the identity of G, and the inverse of 
(gu 82 , . ■ ■ , gn) is (gT l >82 l . ■ • ■ . where gf l is the inverse of gi in G,-. 

The formula for die order of G is clear. 

If the factors of the direct product are rearranged, the resulting direct product is 
isomorphic to the original one (cf. Exercise 7 ). 

The next proposition shows that a direct product, G\ x G2 * - - - * G„, contains an 
isomorphic copy of each Gy. One can think of these specific copies as the “coordinate 
axes” of the direct product since, in the case oft xt, they coincide with the x and y 
axes. One should be careful, however, not to think: of these “coordinate axes” as the only 
copies of the groups Gi in the direct product For example int x 1 any line through 
the origin is a subgroup of R x R isomorphic to R (and RxM has infinitely many pairs 
of lines which are coordinate axes, viz. any rotation of a given coordinate system). The 
second part of the proposition shows that there are projection homomorphisms onto 
each of the components. 
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Proposition 2. Let Gj. G 2 , . . . , G„ be groups and let G = G] x ■ ■ • x G„ be their 
direct product 

(1) For each fixed i the set of elements of G which have the identity of Gj in the 7 th 
position for all j ^ i and arbitrary elements of G, in position i is a subgroup 
of G isomorphic to G,- : 

Gj = {(1. 1, .... 1. g,-. 1, 1) I gi 6 GJ, 

(here gj appears in the i 1)1 position). If we identify Gj with this subgroup, then 
Gj < G and 

G/Gj = Gj x - - ■ x Gj _ 1 x Gj + 1 x • • • x G j, . 

(2) For each fixed i define rr, : G -> Gj by 

tf;((gi,g2, = g/- 

Then rr, is a suijective homomorphism with 

ker jtj = {(g, g,_i , 1, g,+j g„) | gj € Gj for all j ^ i) 

= Gi x ■ ■ ■ x G,-_ 1 x G,+| x * ■ ■ x G„ 

(here the 1 appears in position /)■ 

(3) Under the identifications in part (1), if x e G, and y e Gj for some i 7, then 
xy = yx. 

Proof: (1) Since the operation in G is defined componentwise, it follows easily 
from the subgroup criterion that (( 1 , 1 , . . . , 1 , gj , 1, . . . , 1) | gj € Gj] is a subgroup of 
G. Furthermore, the map g t (I, 1, . . . , 1, g,-, 1, . . . , 1) is seen to be an isomorphism 
of Gj with this subgroup. Identify G, with this isomorphic copy in G. 

To prove the remaining parts of (1) consider the map 

<p:G — ► Gi x - ■ ■ x G/_! x G /+ i x ■ x G„ 

defined by 

^(gl 1 g2> ■ ■ ■ > gn) (gi 1 ■ ■ * * gi — 1 » g/ + l » - ■ - 1 g/i) 

(i.e., <p erases the I th component of G). The map <p is a homomorphism since 
0 >((gi»---.g«)(J»i. - .-.^/i)) =<p((g\h\>---,g»h n )) 

= (gl^ J * ■ - ■ 1 gi — 1^1 — 1 » gi’ + l ^/+l 1 ■ ■ ■ 1 gn^n) 

(gl 1 • ■ ■ 1 gi — 1 1 gi + 1 » • ■ ■ » gn)(^l > ■ - ■ 1 — 1 j , . . . , h / ,) 

= ^((gl, • • • , gn))<p((h], • ■ - , K)). 

Since the entries in position j are arbitrary elements of Gj for all j, <p is suijective. 
Furthermore, 

ker <p = {(gi , . . - , gn) I gj = 1 for all j ^ i} = Gj. 

This proves that Gj is a normal subgroup of G (in particular, it again proves this copy 
of Gj is a subgroup) and the First Isomorphism Theorem gives the final assertion of 
part(l). 
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In (2) the argument that jr; is a surjective homomorphism and the kernel is the 
subgroup described is very similar to that in part (1 ), so the details are left to the reader. 

In part (3) if jc = (1 1, g it 1, .... 1) and y = (1, . . . , 1. gj, 1, 1). where 

the indicated entries appear in positions i, j respectively* then 

xy = (l l,g;, 1, .... 1, gj. 1, ... , 1) = yx 

(where the notation is chosen so that i < j). This completes the proof. 

A generalization of this proposition appears as Exercise 2. 

We shall continue to identify the “coordinate axis” subgroups described in part (1) 
of the proposition with their isomorphic copies, the Gi ’s. The I th such subgroup is often 
called the r 115 component or I th factor of G. For instance, when we wish to calculate in 
Z„ x Z m we can let x be a generator of the first factor, let y be a generator of the second 
factor and write the elements of Z„ x Z m in the form x a y b . This replaces the formal 
ordered pairs (x, 1) and (1, y) with jc and y (so x a y h replaces (x a , y b )). 

Examples 

(1) Under the notation of Proposition 2 it follows from part (3) that if Xj e Gi, 1 <i <n, 
then for all k e % 

(*i*2 ■■ ■x fl ) k = x\x\. ..x*. 

Since the order of x\ x% - - ■ x* is the smallest positive integer k such that x b = 1 for all 
i, we see that 

1*1*2 ■ • = l.c.m.(|jti|, |jf 2 1 I* n I) 


(where this order is i nfini te if and only if one of the Xj ’s has infinite order). 

(2) Let p be a prime and for «eZ + consider 


Ep* = Z p x Z p x ■ ■ ■ x Z p (n factors). 


Then E ^ is an abelian group of order p >> with the property thatx p = 1 for all * e Ep» . 
This group is the elemmtaty abelian group of order p" described in Section 4.4. 

(3) For p a prime, we show that the elementary abelian group of order p 2 has exactly p + 1 
subgroups of order p (in particular, there are more than the two obvious ones). Let 
E = Ep 2 . Since each Donidentity element of £ has order p, each of these generates a 
cyclic subgroup of E of order p. By Lagrange’s Theorem distinct subgroups of order 
p intersect trivially. Thus the p 2 — 1 nonidentity elements of £ are partitioned into 
subsets of size p — 1 (i.e., each of these subsets consists of the nonidentity elements 
of some subgroup of order p). There must therefore be 


subgroups of order p. When p = 2, £ is the Klein 4-group which we have already 
seen has 3 subgroups of order 2 (cf. also Exercises 10 and 11). 
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EXERCISES 


1. Show that the center of a direct product is the direct product of the centers: 

Z(G\ x G 2 x ■ ■ • x G„) = Z(Gi) x Z(G 2 ) x ■ ■ ■ x Z(G„). 

Deduce that a direct product of groups is abelian if and only if each of the fee tors is abelian. 

2. Let Gy, G 2 , ■■■,G n be groups and let G = Gi x ■ ■ ■ x G„ . Let 7 be a proper, nonempty 

subset of {1 n } and let .7 = {1 n) — I. Define G/ to be the set of elements of G 

that have the identity of Gj in position j for all j € 7. 

(a) Prove that Gj is isomorphic to the direct product of the groups G,- , i e /. 

(b) Prove that G/ is a normal subgroup of G and G/Gj = Gj. 

(c) Prove that G = G/ x Gj. 

3. Under the notation of the preceding exercise let I and K be any disjoint nonempty subsets 
of {1, 2, .... n} and let G/ and Gk be the subgroups of G defined above. Prove that 
xy = yx for all x € G/ and all y e Gk- 

4. Let A and B be finite groups and let p be a prime. Prove that any Sylow p-subgroup 
of A x B is of the form P x Q, where P e Syl p (A) and Q e Syl p (B). Prove that 
r p (A x B) = n p (A)n p (B). Generalize both of these results to a direct product of any 
finite number of finite groups (so that the number of Sylow p-subgroups of a direct product 
is the product of the numbers of Sylow p-subgroups of the factors). 

5. Exhibit a nonnonnal subgroup of Qg x Z4 (note that every subgroup of each factor is 
normal). 

6. Show that all subgroups of Qg x E 2 n are normal. 

7. Let Gy, G 2 , ... ,G n . be groups and let n be a fixed element of S n - Prove that the map 

<Pn ‘ Gl X G 2 X ■ ■ ■ X G n -* G n - i(i) X G„~ i(2) X ■ ■ ■ X Gjf-ifa) 

defined by 

(i»l« g2> • ■ ■ » 8n) ~ (8n~' (1)* Sn -1 (2)> • ■ - ■ 8n ~ ' (n)) 

is an isomorphism (so that changing the order of the factors in a direct product does not 
change the isomorphism type). 

8. Let Gi = G 2 = - - - = G„ and let G = G) x - ■ ■ x G„. Under the notation of the 
preceding exercise show that <p n e Aut(G). Show also that the map n ^ is an 
injective homomorphism of S n into Aut(G). (In particular. 0 <Pn 7 =<P 7 t l n 2 - It is at this 
point that the 7i ~ y ’s in the definition of are needed. The underlying reason for this is 
because if e- t is the n-tuple with 1 in position i and zeros elsewhere, 1 < i < n, then S„ 

acts on {ey , by ji - e, = e n (iy, this is a left group action. If the n-tuple (gi g„) 

is represented by gyey H 1- g„e„, then this left group action on {e\ e n ] extends to 

a left group action on sums by 

7T • (S 1^1 + 82^2 H h 8n e n ) = gl^(l) + 82 e n(2) H b 8n^jr(n)- 

The coefficient of e„(/) on the right hand side is gi, so the coefficient of e-, is g „- 1 (i y Thus 

the right hand side may be rewritten as g^-i^ei + -\ 1- g n -i which is 

precisely the sum attached to the n-tuple (g^-ifi), 8n~'(2) &r -l (»o)‘ 1° other words, 

any permutation of the “position vectors” e\ e„ (which fixes their coefficients) is the 

same as the inverse permutation on the coefficients (fixing the e; 's). If one uses n ’s in place 
of jr _1 ’s in the definition of <pn then the map tt ^ is not necessarily a homomorphism 
— it corresponds to a right group action.) 
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9. Let G, be a field F for all i and use the preceding exercise to show that the set of n x n 
matrices with one 1 in each row and each column is a subgroup of GE„(F) isomorphic to 
S„ (these matrices are called permutation matrices since they simply permute the standard 
basis ey e n (as above) of the n -dimensional vector space F x F x ■ - x F). 

10. Let p be a prime. Let A and B be two cyclic groups of order p with generators x and y, 
respectively. Set E = A x B so that £ is the elementary abelian group of order p 2 . E p i. 
Prove that the distinct subgroups of £ of order p are 

(*>. (xy), (j cy 2 ), .... ( xy p ~ l ), (y) 

(note that there are p + 1 of them). 

11. Let p be a prime and let neZ + . Find a formula for the number of subgroups of order p 
in the elementary abelian group E p » . 

12. Let A and B be groups. Assume Z(A) contains a subgroup Z\ and Z(B) contains a 
subgroup Zz with Zy = Zz. Let this isomorphism be given by the map x t i-> y, for all 
x i € Zy. A central product of A and £ is a quotient 

(A x B)/Z where Z = {(*,, yf 1 ) | x, € Zy) 

and is denoted by A* B — it is not unique since it depends on Zy, Zz and the isomorphism 
between them. (Think of A * B as the direct product of A and B “collapsed” by identifying 
each element x, € Zy with its corresponding element y; e Zz .) 

(a) Prove that the images of A and £ in the quotient group A * £ are isomorphic to A 
and By respectively, and that these images intersect in a central subgroup isomorphic 
to Zi. Find |A * £[. 

(b) Let Za = ( x ) . Let Dg = (r,s) and Qg = ( i, j ) be given by their usual generators 
and relations. Let Z 4 * Dg be the central product of Z 4 and Dg which identifies 
x 2 and r 2 (i.e., Zy = (x 2 ), Zz = (r 2 ) and the isomorphism is jc 2 t-> r 2 ) and let 
Z 4 * Qg be the central product of Z 4 and Qg which identifies x 2 and — 1 . Prove that 
Z 4 * Dg = Z 4 * Qg. 

13. Give presentations for the groups Z 4 *Dg and Z 4 * Qg constructed in the preceding exercise. 

14. Let G = Ay x Az x ■ ■ ■ x A„ and for each i let £/ be a normal subgroup of A; . Prove that 
By x £2 x ■ • • x B n < G and that 

(Ay x Az x - - - x A„)/(£i x fl 2 x - - ■ x £„) ^ (At/fii) x (A 2 /Bz) x-x ( A„/£„ ). 

The following exercise describes the direct product of an arbitrary collection of groups. The 
terminology for the Cartesian product of an arbitrary collection of sets may be found in the 
Appendix. 

15. Let / be any nonempty index set and let (G/, */) be a group for each i e I. The direct 

product of the groups G, , i € / is the set G = Yh e / (^e Cartesian product of the G; ’s) 

with a binary operation defined as follows: if f] a, and f] bi are elements of G, then their 
product in G is given by 

(U ai )(U b ‘) = U^^ b ^ 

'iet ' ' ig! ' ie/ 

(i.e., the group operation in the direct product is defined componentwise). 

(a) Show that this binary operation is well defined and associative. 

(b) Show that the element f] 1; satisfies the axiom for the identity of G, where 1/ is the 
identity of Gj for all i. 
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(c) Show that the element Y\ a t 1 is the inverse of [~[ a; , where the inverse of each com- 
ponent element a-, is taken in the group Gy . 

Conclude that the direct product is a group. 

(Note that if / = {1,2 n), this definition of the direct product is the same as the 

rt-tuple definition in the text) 

16. State and prove the generalization of Proposition 2 to arbitrary direct products. 

17. Let / be any nonempty index set and let Gy be a group for each i e /. The restricted 
direct product or direct sum of the groups G; is the set of elements of the direct product 
which are the identity in all but finitely many components, that is, the set of all elements 
\ \ a.j e |~[/6/ G/ such that a,- = 1,- for all but a finite number of i e /. 

(a) Prove that the restricted direct product is a subgroup of the direct product. 

(b) Prove that the restricted direct product is normal in the direct product. 

(c) Let / = and let p, be the I th integer prime. Show that if G, = Z/p,Z for all 
i € Z + , then every element of the restricted direct product of the Gy ’s has finite order 
but |~[; e £+ Gy has elements of infinite order. Show that in this example the restricted 
direct product is the torsion subgroup of the direct product (cf. Exercise 6, Section 
2.1). 

18. In each of (a) to (e) give an example of a group with the specified properties: 

(a) an infinite group in which every element has order 1 or 2 

(b) an infinite group in which every element has finite order but for each positive integer 
n there is an element of order n 

(c) a group with an element of infinite order and an element of order 2 

(d) a group G such that every finite group is isomorphic to some subgroup of G 

(e) a nontrivial group G such that G = G x G. 


5.2 THE FUNDAMENTAL THEOREM OF FINITELY GENERATED 
ABELIAN GROUPS 


Defmitioo. 

(1) A group G is finitely generated if there is a finite subset A of G such that 
G — (A). 

(2) For each r € Z with r > 0, let If = Z x Z x - ■ • x Z be the direct product of 
r copies of the group Z, where Z° = I, The group Z r is called the free abelian 
group of rank r. 

Note that any finite group G is, a fortiori, finitely generated: simply take A — G 
as a set of generators. Also, U is finitely generated by gj, g 2 , . . . , «»-> where e-, is the 
n -tuple with 1 in position i and zeros elsewhere. We can now state the fundamental 
classification theorem for (finitely generated) abelian groups. 

Theorem 3. (Fundamental Theorem of Finitely Generated Abelian Groups) Let G be 
a finitely generated abelian group. Then 

a) 

G = T x Z„, x Z„, x ■ ■ • x Z„,, 

for some integers r, n\, ■ ■ ■ ,n s satisfying the following conditions: 
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(a) r > 0 and nj > 2 for all 7, and 

(b) n /+ i | w; for 1 < i < j - 1 

( 2 ) the expression in ( 1 ) is unique: if G = U x Z m , x Z W2 x ■ x Z^ , where f and 
tfij, W2, - - - , m u satisfy (a) and (b) (i.e., / > 0, mj > 2 for all 7 and m /+l | m\ 
fori <i<u— 1), then t — r , u = s and mi = ni for all i. 

Proof: We shall derive this theorem in Section 12.1 as a consequence of a more 
general classification theorem. For finite groups we shall give an alternate proof at the 
end of Section 6 . 1 . 

Definition. The integer r in Theorem 3 is called the free rank or Betti number of G 
and the integers n \ , /12, . . . , n s are called the invariant factors of G. The description of 
G in Theorem 3 ( 1 ) is called the invariant factor decomposition of G. 

Theorem 3 asserts that the free rank and (ordered) list of invariant factors of an 
abelian group are uniquely determined, so that two finitely generated abelian groups 
are isomorphic if and only if they have the same free rank and the same list of invariant 
factors. Observe that a finitely generated abelian group is a finite group if and only if 
its free rank is zero. 

The order of a finite abelian group is just the product of its invariant factors (by 

Proposition 1 ). If G is a finite abelian group with invariant factors «i, n 2 n s , where 

m + 1 | m, 1 < i < s — 1 , then G is said to be of type (ni , W2» • • ■ » n s )- 

Theorem 3 gives an effective way of listing all finite abelian groups of a given 
order. Namely, to find (up to isomorphism) all abelian groups of a given order n one 
must find all finite sequences of integers n \ , 712, . . . , n s such that 

(1) «/ > 2 for all 7 € {1, 2, . . . ,s}, 

(2) /ij+i | 1 < i < s — 1, and 

(3) /ti« 2 ■ ■ • n s =n . 

Theorem 3 states that there is a bijection between the set of such sequences and 
the set of isomorphism classes of finite abelian groups of order n (where each sequence 
corresponds to the list of invariant factors of a finite abelian group). 

Before illustrating how to find all such sequences for a specific value of n we make 
some general comments. First note that ni > n^ > ■ ■ ■ > n s , so is the largest 
invariant factor. Also, by property ( 3 ) each n, divides n . If p is any prime divisor of n 
then by ( 3 ) we see that p must divide n; for some i. Then, by (2), p also divides n j for 
all 7 < i. It follows that 

every prime divisor ofn must divide the first invariant factor ni . 

In particular, if n is the product of distinct primes (all to the first power) 1 we see that 
n | n\ , hence n = n 1 . This proves that if n is squarefree, there is only one possible list 
of invariant factors for an abelian group of order n (namely, the list n 1 = n ): 


1 Such integers are called squarefree since they are not divisible by any square > 1 . 
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Corollary 4, If n is the product of distinct primes, then up to isomorphism the only 
abelian group of order n is the cyclic group of order n, Z„ . 


The factorization of n into prime powers is the first step in determining all possible 
lists of invariant factors for abelian groups of order n. 

Example 

Suppose n = 180 = 2 2 ■ 3 2 ■ 5. As noted above we must have 2-3-5 | n\, so possible 
values of n \ are 

n 1 = 2 2 3 2 - 5, 2 2 ■ 3 ■ 5, 2 - 3 2 ■ 5. or 2-3-5. 

For each of these one must work out all possible « 2 ’s (subject to «2 | n \ and«i «2 | n). For 

each resulting pair n \ , /12 one must work out all possible > 23 ’s etc. until all lists satisfying 
(1) to (3) are obtained. 

For instance, if n\ = 2 ■ 3 2 - 5, the only number n 2 dividing n\ with ri\ W 2 dividing n 
is n .2 = 2. In this case njn 2 = n, so this list is complete: 2 - 3 2 - 5, 2. The abelian group 
corresponding to this list is Z 90 x Z 2 . 

If n\ =2-3-5, the only candidates for «2 are = 2 , 3 or 6. If «2 = 2 or 3, then 
since nj, | « 2 we would necessarily have 223 = «2 (and there must be a third term in the 
list by property (3)). This leads to a contradiction because n 1/12 213 would be divisible by 
2 3 or 3 3 respectively, but n is not divisible by either of these numbers. Thus the only list 
of invariant factors whose first term Ls2-3-5is2-3-5, 2-3. The corresponding abelian 
group is Z30 x Ze. 

Similarly, all permissible lists of invariant factors and the corresponding abelian groups 
of order 180 are easily seen to be the following: 


Invariant Factors 

Abelian Groups 

2 2 ■ 3 2 • 5 

Ziso 

2 ■ 3 2 - 5, 2 

Z 90 x Z 2 

2 2 ■ 3 ■ 5, 3 

Z60 X Z 3 

2-3-5, 2-3 

Z 30 X Z(y 


The process we carried out above was somewhat ad hoc , however it indicates that 
the determination of lists of invariant factors of all abelian groups of a given order n 
relies strongly on the factorization of n. The following theorem (which we shall see 
is equivalent to the Fundamental Theorem in the case of finite abelian groups) gives a 
more systematic and computationally much faster way of determining all finite abelian 
groups of a given order. More specifically, if the factorization of n is 


an a k 

“ = P\ P 2 ‘ ■■/>* . 


it shows that all permissible lists of invariant factors for abelian groups of order n may 
be determined by finding permissible lists for groups of order pf' for each i. For a 
prime power, p a , we s hall see that the problem of determining all permissible lists is 
equivalent to the determination of all partitions of a (and does not depend on p). 
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Theorem 5. Let G be an abelian group of order n > 1 and let the unique factorization 
of n into distinct prime powers be 

n — P\ P2 ‘ ' ‘ Pk • 

Then 

(1) G = Ai x A 2 x ■ • ■ x Ak, where |A,| = p a f 

(2) for each A € {Ai, A 2 , . . . , A fc ] with |A| = p a , 

A — Z p&i x x - - x ZpPt 

with £1 > > ■ ■ ■ > # > 1 and /h + fh + ■ ■ • + A = or (where t and 

£1 , . . . , P, depend on 1) 

(3) the decomposition in (1) and (2) is unique, i.e., if G = B\ x B 2 x • • ■ x B m , 
with |B ; j = for all t, then B, = A, and B; and A,- have the same invariant 
factors. 

Definition. The integers ph described in the preceding theorem are called the ele- 
mentary divisors of G. The description of G in Theorem 5(1) and 5(2) is called the 
elementary divisor decomposition of G. 

The subgroups A* described in part (1) of the theorem are the Sylow /^-subgroups 
of G. Thus (1) says that G is isomorphic to the direct product of its Sylow subgroups 
(note that they are normal — since G is abelian — hence unique). Part 1 is often referred 
to as The Primary Decomposition 'Theorem for finite abelian groups. 2 As with Theorem 
3, we shall prove this theorem later 

Note that for p a prime, | p y if and only if >3 < y. Furthermore, p& p& = p° 
if and only if /h + - ■ - + A = <*. Thus the decomposition of A appearing in part 
(2) of Theorem 5 is the invariant factor decomposition of A with the “divisibility” 
conditions on the integers pP> translated into “additive” conditions on their exponents. 
The elementary divisors of G are now seen to be the invariant factors of the Sylow 
p-subgroups as p runs over all prime divisors of G. 

By Theorem 5, in older to find all abelian groups of order n = p°' p *2 • - ‘ pT 
one must find for each i, 1 < / < k, all possible lists of invariant factors for groups 
of order The set of elementary divisors of each abelian group is then obtained 
by taking one set of invariant factors from each of the k lists. The abelian groups are 
the direct products of the cyclic groups whose orders are the elementary divisors (and 
distinct lists of elementary divisors give nonisomorphic groups). The advantage of this 
process over the one described following Theorem 2 is that it is easier to systematize 
how to obtain all possible lists of invariant factors, p^ , p^ 2 , . . . , p& , for a group of 
prime power order p&. Conditions (1) to (3) for invariant factors described earlier then 
become 

(1) > 1 for all 7 € {1, 2, . . . , r), 

(2) fit > A+i for all i , and 

(3) + $2 + * ■ * + fit = ft- 

2 Recall that forabclian groups the Sylow p-subgroups are sometimes called the p -primary components. 
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Hence, each list of invariant factors in this case is simply a partition of ft (ordered in 
descending order). In particular, the number of nonisomorphic abelian groups of order 
pP (= the number of distinct lists) equals the number of partitions of ft. This number is 
independent of the prime p. For example the number of abelian groups of order p 5 is 
obtained from the list of partitions of 5: 


Invariant Factors 

Abelian Groups 

5 

ZpS 

4, 1 

Z P 4 X Zp 

3, 2 

Zp 3 x Zp2 

3, 1, 1 

Zp 3 x Zp x Zp 

2, 2, 1 

Zp2 x Zp2 x Zp 

2, 1, 1, 1 

Zp2 x Zp x Zp x Zp 

1, 1, 1, 1, 1 

Zp x Zp x Zp x Zp x Zp 


Thus there are precisely 7 nonisomorphic groups of order p 5 , the first in the list being 
the cyclic group, Zpi, and the last in the list being the elementary abelian group, EpS . 

If n = Pi'p^ 2 ' - P a k k and g,- is the number of partitions of ct h we see that the 
number of (distinct, nonisomorphic) abelian groups of order n equals - ■ - q k . 

Example 

If n — 1800 - 2 3 3 2 5 2 we list the abelian groups of this order as follows: 


Order 

Partitions of 

Abelian Groups 

2 3 

3; 2. L; 1, 1, 1 

Z8, Z4 X Z 2 , Z2 X Z2 x Z2 

3 2 

2: 1. 1 

Z9, Z3 x Z3 

5 2 

2; 1, I 

Z25, Z5 x Z5 


We obtain the abelian groups of order 1800 by taking one abelian group from each of the 
three lists (right hand column above) and taking their direct product. Doing this in all 
possible ways gives all isomorphism types: 


Zg x Z9 x Z25 
Zg x Z9 x Z5 x Z5 
Zg x Z3 x Z3 x Z25 
Zg x Z3 x Z3 x Zj x Z5 
Z4 x Z2 x Z9 x Z25 
Z4XZ2XZ9XZ5XZ5 


Z4 X Z2 X Z3 X Z3 X Z25 
Z4 X Z2 x Z3 X Z3 X Zj X Z5 
Z2 X Z2 x Z2 X Z9 x Z25 
Z2 X Z2 X Z2 x Z9 X Z5 X Z5 
Z2 X Z2 X Z2 x Z3 X Z3 X Z25 
Z2XZ2XZ2XZ3XZ3XZ5X Z5. 


By the Fundamental Theorems above, this is a complete list of all abelian groups of order 
1800 — every abelian group of this order is isomorphic to precisely one of the groups 
above and no two of the groups in this list are isomorphic. 


We emphasize that the elementary divisors of G are not invariant factors of G (but 
invariant factors of subgroups of G). For instance, in case 1 above the elementary 
divisors 8, 9, 25 do not satisfy the divisibility criterion of a list of invariant factors. 
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Our next aim is to illustrate how to pass from a list of invariant factors of a finite 
abelian group to its list of elementary divisors and vice versa. We show how to determine 
these invariants of the group no matter how it is given as a direct product of cyclic groups. 
We need the following proposition. 

Proposition 6. Let m,n € Z + . 

(1) Z m x Z„ = Z w if and only if (m, n) — 1. 

(2) If n = p*' p % 7 ■ ■ ■ p° k then Z„ = Z p °\ xZ p? x-..xZ p? . 

Proof: Since (2) is an easy exercise using (1) and induction on k, we concentrate 
on proving (1). Let Zm = {x ), Z„ = (y ) and let l = l.c.m.(m, n). Note that l — mn 
if and only if (ni t n) = 1. Let x a y b be a typical element of Z m x Z n . Then (as noted 
in Example 1, Section 1) 

uvy = 

= \ a \ h = 1 (because m \ l and n | /). 

If (m, n) ^ 1, eveiy element of Z m x Z n has order at most l, hence has order strictly 
less than mn, so Z m x Z n cannot be isomorphic to Z mn . 

Conversely, if ( m,n ) = 1, then |xy| = l.c.m.(lx|, |y|) = mn. Thus, by order 
considerations, Z m x Z„ = ( xy ) is cyclic, completing the proof. 

Obtaining Elementary Divisors from Invariant Factors 

Suppose G is given as an abelian group of type (n L , m, . . . , n*), that is 

G = Z„, x Z„, x ■ ■ ■ x Z n , 

Let n — p\ l P 2 ■ • ■ p% k = n[W2 • • ■ . Factor each n,- as 

n> = Pi n p 2 a ■ - - where fa > 0. 

By the proposition above, 

Z r tl — Z ti x ■ - x Zb^ , 

"l Pk 

for each /. If = 0, Z^t, — 1 and this factor may be deleted from the direct 

product without changing the isomorphism type. Then the elementary divisors of G 
are precisely the integers 

pf* . 1 < j < k, 1 < i < s such that fi,j ^ 0. 

For example, if |G| = 2 3 - 3 2 - 5* and G is of type (30, 30, 2), then 

G = Z’io x Zy o x Z2. 

Since Z 30 = Z 2 x Z 3 x Z5, G = Z 2 x Z3 x Z5 x Z2 x Z 3 x Z5 x Z2. The elementaiy 
divisors of G are therefore 2, 3, 5, 2, 3, 5, 2, or, grouping like primes together (note that 
rearranging the order of the factors in a direct product does not affect the isomorphism 
type (Exercise 7 of Section 1)), 2, 2, 2, 3, 3, 5, 5. In particular, G is isomorphic to 
the last group in the list in the example above. 
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If for each j one collects all the factors Z ^ together, the resulting direct product 

p ) 

forms the Sylow pj -subgroup, Aj, of G. Thus the Sylow 2-subgroup of the group in 
the preceding paragraph is isomoiphic to Z2 x Z2 x Z2 (i.e., the elementary abelian 
group of order 8). 

Obtaining Elementary Divisors from any cyclic decomposition 

The same process described above will give the elementary divisors of a finite abelian 
group G whenever G is given as a direct product of cyclic groups (not just when the 
orders of the cyclic components are the invariant factors). Forexample.if G = Z^xZ^, 
the list 6, 15 is neither that of the invariant factors (the divisibility condition fails) nor 
that of elementary divisors (they are not prime powers). To find the elementary divisors, 
factor 6 = 2-3 and 15 = 3-5. Then the prime powers 2, 3, 3, 5 are the elementary 
divisors and 

G = Z2 x Z3 x Z3 x Z5. 

Obtaining Invariant Factors from Elementary Divisors 

Suppose G is an abelian group of order n, where n = p* 1 ■ ■ - and we are given 

the elementary divisors of G. The invariant factors of G are obtained by following these 
steps: 

(1) First group all elementary divisors which are powers of the same prime together. 
In this way we obtain k lists of integers (one for each pj). 

( 2 ) In each of these k lists arrange the integers in nonincreasing order. 

( 3 ) Among these k lists suppose that the longest (i.e., the one with the most terms) con- 
sists of / integers. Make each of the k lists of length t by appending an appropriate 
number of 1 ’s at the end of each list. 

( 4 ) For each i e {1,2 /} the I th invariant factor, m, is obtained by taking the 

product of the I th integer in each of the t (ordered) lists. 

The point of ordering the lists in this way is to ensure that we have the divisibility 
condition nm | n-,. 

Suppose, for example, that the elementary divisors of G are given as 2, 3, 2, 25, 3, 
2 (so |G| = 2 3 ■ 3 2 • 5 2 ). Regrouping and increasing each list to have 3 (= /) members 
gives: 


P = 2 

P = 3 

P = 5 

2 

3 

25 

2 

3 

1 

2 

1 

1 


so the invariant factors of G are 2 • 3 ■ 25, 2-31. 2-1-1 and 

G = Z 150 x Z$ x Z2. 

Note that this is the penultimate group in the list classifying abelian groups of order 
1800 computed above. 

The invariant factor decompositions of the abelian groups of order 1800 are as 
follows, where the I th group in this list is isomorphic to the 7 th group computed in the 
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previous list: 


z i800 

Z 360 X Z5 
^600 X ^3 
Z120 X Z15 
Z900 X Z2 
Z180 x Z10 


Z 300 x Zg 
Z&o x Z 30 
Z 450 x Z 2 x Z 2 
Z 90 x Z 10 x Z 2 
Zi50 x Z 6 x Z 2 
Z 30 x Z 30 x Z 2 . 


Using the uniqueness statements of the Fundamental Theorems 3 and 5, we can use 
these processes to determine whether any two direct products of finite cyclic groups are 
isomorphic. For instance, if one wanted to know whether Zs x Z\s = Z\o x Z 9 , first 
determine whether they have the same order (both are of order 90) and then (the easiest 
way in general) determine whether they have the same elementary divisors: 


Zs x Z 15 has elementary divisors 2 , 3, 3, 5 and is isomorphic to Z 2 x Z 3 x Z 3 x Z 5 
Z 10 x Z 9 has elementary divisors 2, 5, 9 and is isomorphic to Z 2 x Z 5 x Z 9 . 

The lists of elementary divisors are different so (by Theorem 5) they are not isomorphic. 
Note that Z$ x Z 15 has no element of order 9 whereas Zio x Z 9 does (cf. Exercise 5). 

The processes we described above (with some elaboration) form a proof (via Propo- 
sition 6 ) that for finite abelian groups Theorems 3 and 5 are equivalent (i.e., one implies 
the other). We leave the details to the reader. 

One can now better understand some of the power and some of the limitations of 
classification theorems. On one hand, given any positive integer n one can explicitly 
describe all abelian groups of order n, a significant achievement On the other hand, 
the amount of information necessary to determine which of the isomorphism types of 
groups of order n a particular group belongs to may be considerable (and is large if n 
is divisible by large powers of primes). 

We close this section with some terminology which will be useful in later sections. 


Definition. 

(1) If G is a finite abelian group of type (« t , n 2 n t ), the integer t is called the 

rank of G (the free rank of G is 0 so there will be no confusion). 

(2) If G is any group, the exponent of G is the smallest positive integer n such that 
x n = 1 for all x € G (if no such integer exists the exponent of G is 00 ). 

EXERCISES 

1 . In each of parts (a) to (e) give the number of nonisomorphic abelian groups of the specified 
order — do not list the groups: (a) order 100, (b) order 576, (c) order 1155, (d) order 
42875, (e) order 2704. 

2. In each of parts (a) to (c) give the lists of invariant factors for all abelian groups of the 
speci fied order: 

(a) order 270, (b) order 9801, (c) order 320, (d) order 105, (e) order 441 00. 

3. In each of parts (a) to (e) give the lists of elementary divisors for all abelian groups of the 
specified order and then match each list with the corresponding list of invariant factors 
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found in the preceding exercise: 

(a) order 270, (b) order 9801, (c) order 320, (d) order 105, (e) order 44100. 

4. In each of parts (a) to (d) dete rmin e which pairs of abelian groups listed are isomorphic 
(here the expression {a\ , a 2 , ... , a*} denotes the abelian group Z ax x Z^ x--- x Z^). 

(a) {4 , 9). { 6 . 6 ), [ 8 , 3}, {9,4), { 6 , 4}, {64}. 

(b) {2 2 , 2 - 3 2 ), {2 2 • 3 , 2 - 3}, [2 3 ■ 3 2 }, [2 2 -3 2 ,2). 

(c) [& ■ 7 2 , ¥ ■ 5 ■ 7}, {3 2 • ^ - 7 , 5 • 7 2 }, {3 ■ 5 2 , i 1 , 3 ■ 5 ■ 7}, 

{5 2 - 7 , 3 2 - 5 , 7 2 }. 

(d) {2 2 - 5 ■ 7 , 2 3 ■ 5 3 , 2 ■ 5 2 ), {2 3 • 5 3 • 7 , 2 3 - 5 3 }, {2 2 , 2 ■ 7, 2 3 , 5 3 . 5 3 }, 

[2-5 3 , 2 2 5 3 , 2 3 , 7}. 

5. Let G be a finite abelian group of type - Prove that G contains an element 

of order m if and only if m | ni> Deduce that G is of exponent n 1 . 

6 . Prove that any finite group has a finite exponent Give an example of an infinite group with 
finite exponent Does a finite group of exponent m always contain an element of order /n? 

7. Let p be a prime and let A = ( xy ) x (* 2 ) x ■ ■ ■ x (x„ ) be an abelian />- group, where 
[jc/{ = p a ' > 1 for alii. Define the />* -power map 

(p : A — ► A by <p : x i-> x p . 

(a) Prove that <p is a homomorphism. 

(b) Describe the image and kernel of <p in terms of the given generators. 

(c) Prove both ker <p and A/ im <p have rank n (i.e., have the same rank as A) and prove 
these groups are both isomorphic to the elementary abelian group, Ey, of order p". 

8 . Let A be a finite abelian group (written multiplicatively) and let p be a prime. Let 

A p = [a p | a € A] and A p = [x \ x p = 1} 

(so A p and A p are the image and kernel of the p^-power map, respectively). 

(a) Prove that A/A p = A p . [Show that they are both elementary abelian and they have 
the same order.] 

(b) Prove that the number of subgroups of A of order p equals the number of subgroups 
of A of index p. [Reduce to the case where A is an elementary abelian p-group.] 

9. Let A = Zfio x Z 45 x Z 12 x Z%. Find the number of elements of order 2 and the number 
of subgroups of index 2 in A. 

10. Let n and k be positive integers aDd let A be the free abelian group of rank n (written 
additively). Prove that A/kA is isomorphic to the direct product of n copies of Z/JfcZ 
(here kA = [ka | a e A)). [See Exercise 14, Section 1.] 

11. Let G be a nontrivial finite abelian group of rank t. 

(a) Prove that the rank of G equals the maximum of the ranks of its Sylow subgroups. 

(b) Prove that G can be generated by t elements but no subset with fewer than t elements 
generates G. [One way of doing this is by using pari (a) together with Exercise 7.] 

12. Let n and m be positive integers with d = (n, m). Let Z„ = (x) and Z m = (y). Let A 
be the central product of ( x ) and ( y ) with an element of order d identified, which has 
presentation (x,y \ x n — y = 1, xy = yx> x* = y^ ). Describe A as a direct product 
of two cyclic groups. 

13. Let A = (xi ) x • - - x (x r ) be a finite abelian group with |jc/| = m for 1 < / < r. 
Find a presentation for A. Prove that if G is any group containing commuting elements 
gi,...,g r such that g”' = 1 for 1 < i < r, then there is a unique homomorphism from A 
to G which sends x,- to g, for all i. 
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14. For any group G define the dual group of G (denoted G) to be the set of all homomorphisms 

from G into the multiplicative group of roots of unity in C. Define a group operation in 
G by pointwise multiplication of functions: if x , V' are homomorphisms from G into the 
group of roots of unity then x ^ is the homomorphism given by (x V 0 (i?) = X (£)V^(g) for 
all g € G, where the latter multiplication takes place in C. 

(a) Show that this operation on G makes G into an abelian group. [Show that the identity 
is the map g 1 for all g € G and the inverse of x € G is the map g x (g ) -1 •] 

(b) If G is a finite abelian group, prove that G = G. [Write G as ( xi ) x ■ x ( x r ) and 
if n,- = |jc> | define xi to be the homomorphism which sends x; to e 2n 'l n < and sends 
xj to 1, for all j ^ i . Prove x/ has order n; in G and G = < xi ) x ■ • * * { Xr )J 

(This result is often phrased: a finite abelian group is self-dual. It implies that the lattice 
diagram of a finite abelian group is the same when it is turned upside down. Note however 
that there is no natural isomorphism between G and its dual (the isomorphism depends on 
a choice of a set of generators for G). This is frequently stated in the form: a finite abelian 
group is noncanonically isomorphic to its dual.) 

15. Let G = ( x ) x ( y ) where \x\ = 8 and |y | = 4. 

(a) Find all pairs a, b in G such that G = (a) x (b) (where a and b are expressed in 
terms of x and y). 

(b) Let H = { x 2 y , y 2 ) = Z 4 x Z 2 . Prove that there are no elements a, b of G such that 
G = (a)x(h) and H = (a 2 )x{h 2 ) (i.e., one cannot pick direct product generators 
for G in such a way that some powers of these are direct product generators for H). 

16. Prove that no finitely generated abelian group is divisible (cf. Exercise 19, Section 2.4). 


5.3 TABLE OF GROUPS OF SMALL ORDER 

At this point we can give a table of the isomorphism types for most of the groups of 
small order. 

Each of the un famili ar non-abelian groups in the table on the following page will 
be constructed in Section 5 on semidirect products (which will also explain the notation 
used for them). For the present we give generators and relations for each of them (i.e., 
presentations of them). 

The group Z 3 x Z 4 of order 12 can be described by the generators and relations: 
(x, y | x 4 = y 3 = 1, x _1 yx = y _1 ), 

namely, it has a normal Sylow 3-subgroup (( y )) which is inverted by an element of 
order 4 (x) acting by conjugation (x 2 centralizes y). 

The group (Z 3 x Z 3 ) x Z 2 has generators and relations: 

i 

(x, y, z | x 2 = y 3 = z 3 = 1 , yz = zy, * -1 yx = y - \ x~ l zx = z -1 ), 

namely, it has a normal Sylow 3-subgroup isomorphic to Z 3 x Z 3 (( y, z )) inverted by 
an element of order 2 (x) acting by conjugation. 

The group Z 5 x Z 4 of order 20 has generators and relations: 

(x, y | x 4 = y 5 = 1 , x"‘yx = y~ l ), 

namely, it has a normal Sylow 5-subgroup ((y )) which is inverted by an element of 
order 4 (x) acting by conjugation (x 2 centralizes y). 
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Order 

No. of Isomorphism 
Types 

Abelian Groups 

Non-abelian Groups 

1 

1 

Zi 

none 

2 

1 

Z2 

none 

3 

1 

z 3 

none 

4 

2 

Z4, Z 2 x Z 2 

none 

5 

1 

Z 5 

none 

6 

2 

Z 6 

£3 

7 

1 

z 7 

none 

8 

5 

Zg, Z4 x Z 2 , 

Z 2 x Z 2 x Z 2 

£> 8 , Q% 

9 

2 

Z9, Z3 X Z3 

none 

10 

2 

Z10 

*>10 

11 

1 

Zn 

none 

12 

5 

z l2 , Zg x Z 2 

A 4, Z) 12 , Z3 x Z4 

13 

1 

Z13 

none 

14 

2 

Z14 

Dj 4 

15 

1 

Z 15 

none 

16 

14 

Z16* Zg x Z 2> 

Z4 x Z4, Z4 x Z 2 x Z 2 , 

Z 2 x Z 2 x Z 2 x 7 a 

not listed 

17 

1 

Z17 

none 

18 

5 

Zig, Zs x Z 3 

£>18, S3 x Z 3 , 

(Z3 x Z 3 ) x Z 2 

19 

1 

Z )9 

none 

20 

5 

Z20, Z10 x Z 2 

£>20 

Z5 x Z 4> F 2 q 


The group F 2 o of order 20 has generators and relations: 

(x,y \x 4 = y 5 = l, xyx~ ] = y 2 ), 

namely, it has a normal Sylow 5 -subgroup (( y )) which is squared by an element of order 
4 (x) acting by conjugation. One can check that this group occurs as the normalizer of 
a Sylow 5 -subgroup in S5, e.g., 

F 20 = (( 2354 ), ( 12345 )). 

This group is called the Frobenius group of order 20 . 
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EXERCISE 


1. Prove that D^, Z 2 x Dg, Z 2 x Qg, Z 4 * Dg, QD 16 and M are nonisomorphic non-abelian 
groups of order 16 (where Z 4 * Dg is described in Exercise 12, Section 1 and QD^, and 
M are described in the exercises in Section 2.5). 

5.4 RECOGNIZING DIRECT PRODUCTS 

So far we have seen that direct products may be used to both construct “larger” groups 
from “smaller” ones and to decompose finitely generated abelian groups into cyclic 
factors. Even certain non-abelian groups, which may be given in some other form, may 
be decomposed as direct products of smaller groups. The purpose of this section is 
to indicate a criterion to recognize when a group is the direct product of some of its 
subgroups and to illustrate the criterion with some examples. 

Before doing so we introduce some standard notation and elementary results on 
commutators which will streamline the presentation and which will be used again in 
Chapter 6 when we consider nilpotenl groups. 

Definition. Let G be a group, let x, y e G and let A, B be nonempty subsets of G. 

(1) Define [x, y] = x~ 1 y“'xy, called the commutator of x and y. 

(2) Define [A, B\ — { [a, b] \ a e A, b e B ), the group generated by commuta- 
tors of elements from A and from B. 

(3) Define G' = ( [x, y] | x, y € G ), the subgroup of G generated by commutators 
of elements from G, called the commutator subgroup of G. 

The commutator of x and y is 1 if and only if x and y commute, which explains 
the terminology. The following proposition shows how commutators measure the “dif- 
ference” in G between xy and yx. 

Proposition 7. Let G be a group, let x,y e G and let H < G. Then 

(1) xy = yx[x. y] (in particular, xy = yx if and only if [x, yj = 1). 

(2) H < G if and only if [H.G] < H. 

(3) a[x, y] = [<r(x), a(y)] for any automorphism a of G, G' char G and GjG' is 
abelian. 

(4) G/G' is the largest abelian quotient of G in the sense that if // <j G and G/H 
is abelian, then G' < H. Conversely, if G' < H , then H < G and G/H is 
abelian. 

(5) If <p : G — *■ A is any homomorphism of G into an abelian group A , then <p 
factors through G' i.e., G' < ker^» and the following diagram commutes: 



Proof: (1) This is immediate from the definition of [x, y]. 


Sec. 5.4 Recognizing Direct Products 


169 



(2) By definition, H < G if and only if g x hg e H for all g e G and all h e H. 
For h g //, g~ l hg € H if and only if h~ l g~ l hg e H , so that H < G if and only if 
[h, g) e H for all h e H and all g e G. Thus H < G if and only if [H, G) < H, 
which is (2). 

(3) Let c € Aut(G) be an automorphism of G and let x, y e G. Then 

tf(I>>:y]) = a(x~ l y~ l xy) 

= G(x)- l o(y)- i o(x)o(y) 

Thus for every commutator [x t y] of G', o([x, y]) is again a commutator. Since o has 
a 2-sided inverse, it follows that it maps the set of commutators bijectively onto itself. 
Since the commutators are a generating set for G\ a(G') = G\ that is, G' char G. 

To see that G/G' is abelian, let xG' and yG' be arbitrary elements of G/G'. By 
definition of the group operation in G/G ’ and since [jc, y] e G' we have 

( xG f )(yG ') = C xy)G ' 

= (yx[x,y])G' 

= (yx)G' = (yG')(JcG # ), 

which completes the proof of (3). 

(4) Suppose H < G and G/H is abelian. Then for all x, y € G we have 
(xH)(yH) = (yH)(xH). so 

1 H = ( xH)-\yH)~\xH)(yH ) 

= x~ l y~ l xyH 

= [*> y\H- 

Thus [x, y] G H for all x, y € G, so that G' < H. 

Conversely, if G 1 < H y then since G/G' is abelian by (3), every subgroup of G/G' 
is normal. In particular, H/G' < G/G 1 . By the Lattice Isomorphism Theorem// < G. 
By the Third Isomorphism Theorem 

G/H = ( G/G')/(H/G ') 

hence G/H is abelian (being isomorphic to a quotient of the abelian group G/G'). This 
proves (4). 

(5) This is (4) phrased in terms of homomorphism s. 

Passing to the quotient by the commutator subgroup of G collapses all commutators 
to the identity so that all elements in the quotient group commute. As (4) indicates, a 
strong converse to this also holds: a quotient of G by H is abelian if and only if the 
commutator subgroup is contained in H (i.e., if and only if G' is mapped to the identity 
in the quotient G/H). 

We shall exhibit a group (of order 96) in the next section with the property that one 
of the elements of its commutator subgroup cannot be written as a single commutator 
[x, y] for any x and y. Thus G' does not necessarily consist only of the set of (single) 
commutators (but is the group generated by these elements). 
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Examples 

(1) A group G is abelian if and only if G' = 1. 

(2) Sometimes it is possible to compute the commutator subgroup of a group without 
actually calculating commutators explicitly. For instance, if G — Dg, then since 
Z(D 8 ) = (r 2 ) <\ D& and Dg/Z(Dg) is abelian (the Klein 4-group), the commutator 
subgroup Dg is a subgroup of Z(Dg). Since Dg is not itself abelian its commutator 
subgroup is nontrivial. The only possibility is that Dg = Z(Dg). By a similar 
argument, Q'g = Z(Qg) = { — 1 ). More generally, if G is any non-abelian group of 
order p 3 , where p is a prime, G' = Z(G) and IG'I = p (Exercise 7). 

(3) Let Di,, = (r,s \ r n = s 2 = l,s -l rr = r~ ] ). Since [r, r] = r~ 2 , we have 
(r -2 ) = (r 2 ) < D* ln . Furthermore, (r 2 ) <! Dm and the images of r and s in 
Dm /(r 2 ) generate this quotient. They are commuting elements of order < 2, so the 
quotient is abelian and D' ln < { r 2 ). Thus D^ = (r 2 ). Finally, note that if n (= |r |) 
is odd, ( r 2 ) = (r) whereas if n is even, ( r 2 ) is of index 2 in ( r ). Hence D^ is of 
index 2 or 4 in Dm according to whether n is odd or even, respectively. 

(4) Since conjugation by g e G is an automorphism of G, [a 8 ,b 8 ) = [a , b ] 8 for all a, b € 
G by (3) of the proposition, i.e., conjugates of commutators are also commutators. 
For example, once we exhibit an element of one cycle type in S n as a commutator, 
every element of the same cycle type is also a commutator (cf. Section 4.3). For 
example, every 5-cycle is a commutator in S 5 as follows: labelling the vertices of a 

pentagon as 1 5 we see that Dio < S$ (a subgroup of A 5 in fact). By the preceding 

example an element of order 5 is a commutator in Dio, hence also in £ 5 . Explicitly, 
(14253) = [(12345). (25)(4 3)]. 

The next result actually follows from the proof of Proposition 3.13 but we isolate 
it explicitly for reference: 

Proposition 8. Let H and K be subgroups of the group G. The number of distinct 
ways of writing each element of the set H K in the form hk, for some h e H and k e K 
is \H fl K\. In particular, if H fl K = l, then each element of H K can be written 
uniquely as a product hk, for some h e H and k e K. 

Proof: Exercise. 

The main result of this section is the following recognition theorem. 

Theorem 9. Suppose G is a group with subgroups H and K such that 

(1) H and K are normal in G, and 

(2) HHK = 1. 

Then HK = H x K. 

Proof: Observe that by hypothesis (1), H K is a subgroup of G (see Corollary 3. 15). 
Let h e H and let k e K. Since H < G, k~ x hk e H , so that h~ l (jc x hk) e H. 
Similarly, ( h~ x k~ x h)k e K. Since H n K = 1 it follows that h~ [ k~ 1 hk = 1, i.e., 
hk = kh so that every element of H commutes with every element of K . 
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By the preceding proposition each element of HK can be written uniquely as a 
product hk , with h e H and k e K. Thus the map 

(p:HK H xK 
hk >-» ( h , k ) 

is well defined. To see that is a homomorphism note that if h \ , h 2 e // and k\ , k 2 e AT, 
then we have seen that hi and k\ commute. Thus 

(hihKhiki) = thM&M 

and the latter product is the unique way of writing (h\k\)(h 2 k 2 ) in the form hk with 
h e H and k € K. This shows that 

(p(h\k\h 2 k 2 ) — <p(h\h 2 k\k 2 ) 

= ih x h 2 ,kxk 2 ) 

= ihy, k{)(h 2 , k 2 ) — (p(hikx)<p(h 2 k 2 ) 

so that (p is a homomorphism. The homomorphism <p is a bijection since the represen- 
tation of each element of HK as a product of the form hk is unique, which proves that 
ip is an isomorphism. 

Definition. If G is a group and H and K are normal subgroups of G with HC\K = 1 , 
we call H K the internal direct product of H and K. We shall (when emphasis is called 
for) call H x K the external direct product of H and K. 

The distinction between internal and external direct product is (by Theorem 9) 
purely notational: the elements of the internal direct product are written in the form 
hk, whereas those of the external direct product are written as ordered pairs ( h,k ). We 
have in previous instances passed between these. For example, when Z„ = (a) and 
Z m = (b) we wrote x — ( a , 1) and y (1, b) so that every element of Z n x Z m was 
written in the form x r y s . 

Examples 

(1) If n is a positive odd integer, we show D^„ = D 2n x Z 2 . To see this let 

D 4 „ = (r,s | r 2 " = s 2 = 1, srs = r -1 ) 

be the usual presentation of Z> 4 n . Let H = (s,r 2 ) and let K = ( r n ). Geometrically, 
if D 4n is the group of symmetries of a regular 2n-gon, H is the group of symmetries of 
the regular n-gon inscribed in the 2/i-gon by joining vertex 2/ to vertex 2i 4- 2, for all 
i mod 2n (and if one lets ri = r 2 , H has the usual presentation of the dihedral group 
of order 2 n with generators r\ and s). Note that H < Dt, n (it has index 2). Since 
|rj = 2 n, [r"| = 2. Since srs = r -1 , we have sr n s = r~ n = r n , that is, s centralizes 
r n . Since clearly r centralizes r", K < Z(D/i n ). Thus K < Finally, K £ H 
since r 2 has odd order (or because r" sends vertex 1 into vertex i + n, hence does 
not preserve the set of even vertices of the 2n-gon). Thus H D K = 1 by Lagrange. 
Theorem 9 now completes the proof. 

(2) Let / be a subset of (1 , 2, . . . , n] and let G be the setwise stabilizer of 7 in S n , i.e., 

G = [a e S„ | cr(i') e / for all i e /}. 
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Let J = {1.2 n \ — I be the complement of 7 and note that G is also the setwise 

stabilizer of J. Let H be the pointwise stabilizer of / and let A" be the pointwise 
stabilizer of {1, 2, . . . , n) - I, i.e., 

77 = [o € G | cr(i) = i for all i e /) 

K = {t € G | t(j) = j for all j € J }. 

It is easy to see that 77 and K are normal subgroups of G (in fact they are kernels of 
the actions of G on 7 and J, respectively). Since any element of 77 H K fixes all of 

{1,2 w), we have 77 n K = 1. Finally, since every element a of G stabilizes the 

sets 7 and J, each cycle in the cycle decomposition of a involves only elements of 7 
or only elements of J . Thus a may be written as a product ojoj, where oi € H and 
O] e K. This proves G = HK. By Theorem 9, G = 77 x K. Now any permutation 
of J can be extended to a permutation in 5„ by letting it act as the identity on 7. 
These are precisely the permutations in 77 (and similarly the permutations in K are 
the permutations of 7 which are the identity on J), so 

77 = Sy K = S, and G = S m x S n - m , 

where m = |/| (and, by convention, Sq = 1). 

(3) Let a € S n and let 7 be the subset of { 1 , 2, . . . , n} fixed pointwise by a\ 

I = [i € {1, 2, .... n) | a(i) = i}. 

If C = Cs„(p), then by Exercise 18 of Section 4.3, C stabilizes the set 7 and its 
complement 7. By the preceding example, C is isomorphic to a subgroup of 77 x K, 
where 77 is the subgroup of all permutations in S„ fixing / pointwise and AT is the set 
of all permutations fixing J pointwise. Note that a e 77. Thus each element, a, of C 
can be written (uniquely) as a = ajaj , for some aj e 77 and aj € K. Note further 
that if r is any permutation of {1, 2 which fixes each j e J (i.e., any element 
of AT), then a and t commute (since they move no common integers). Thus C contains 
all such r, i.e., C contains the subgroup K. This proves that the group C consists of 
all elements a/aj € H x K such that aj is arbitrary in K and aj commutes with a 
in 77: 


Cs^(o) = C//(o) x K 
= C Sy (o).x S r . 

In particular, if a is an nt -cycle in S n , 

C^fo - ) = ( <7 ) X S n — m . 

The latter group has order m(n — m) !, as computed in Section 4.3. 


EXERCISES 

Let G be a group. 

L Prove that if x. y e G then (y , x ] = [x. y]" 1 . Deduce that for any subsets A and B of G, 
[A, £] = [£, A] (recall that [A, B] is the subgroup of G generated by the commutators 
[a, b]). 

2- Prove that a subgroup 77 of G is normal if and only if [G, 77] < 77. 

3. Let a,b,ce G. Prove that 
(a) [a, be] = [a, c](c -1 [ a,b]c ) 
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(b) [ ab , c\ = (& _l [a, c]b)[b, c]. 

4. Find the commutator subgroups of S4 and A4. 

5. Prove that A,, is the commutator subgroup of S n for all >; > 5. 

6. Exhibit a representative of each cycle type of A5 as a commutator in S5. 

7. Prove that if p is a prime and P is a non-abelian group of order p 3 then P' = Z(P). 

8. Assume x, y € G and both x and y commute with [x, y]. Prove that for all n e Z + , 

n(n— l) 

(xy) n = x f, y' , Ly 1 x] - V J \ 

9. Prove that if p is an odd prime and P is a group of order p 3 then the p* power map 
x »-► x p is a homomorphism of P into Z(P). If P is not cyclic, show that the kernel of the 
p* power map has order p 2 or p 3 . Is the squaring map a homomorphism in non-abelian 
groups of order 8? Where is the oddness of p needed in the above proof? [Use Exercise 8.] 

10. Prove that a finite abelian group is the direct product of its Sylow subgroups. 

11. Prove that if G = HK where H and K are characteristic subgroups of G with H O K = 1 
then Aut(G) = Aut (H) x Aut(A'). Deduce that if G is an abelian group of finite order 
then Aut(G) is isomorphic to the direct product of the automorphism groups of its Sylow 
subgroups. 

12. Use Theorem 4. 17 to describe the automorphism group of a finite cyclic group. 

13. Prove that D&, is not isomorphic to D4,, x Z2. 

14. Let G = {(aij) € GL„(F) \ a\j = 0 if i > j> and an = 022 = ■-■ = a, m ) t where F is 
a field, be the group of upper triangular matrices all of whose diagonal entries are equal. 
Prove that G = D x U, where D is the group of all nonzero multiples of the identity 
matrix and U is the group of upper triangular matrices with l’s down the diagonal. 

15. If A and B are normal subgroups of G such that G/A and G/B are both abelian, prove 
that G/(A f\ B ) is abelian. 

16. Prove that if AT is a normal subgroup of G then K' <1 G. 

17. If K is a normal subgroup of G and K is cyclic, prove that G ; < Cg(K). [Recall that the 
automorphism group of a cyclic group is abelian.] 

18. Let K\, K 2 K„ be non-abelian simple groups and let G = K\ x K 2 * ■ ■ ■ x K n . 

Prove that every normal subgroup of G is of the form G 1 for some subset / of {1, 2, .... n) 

(where Gy is defined in Exercise 2 of Section 1). [If N < G and x = ( ai a„) € N 

with some aj ^ 1, then show that there is some g, € Gj not commuting with a, . Show 
[(1, . . . , gi 1), x] e Ki DA/ and deduce Kj < N.] 

19. A group H is called perfect if H’ = H (Le., H equals its own commutator subgroup). 

(a) Prove that every non-abelian simple group is perfect. 

(b) Prove that if H and K are perfect subgroups of a group G then (H, K) is also perfect 
Extend this to show that the subgroup of G generated by any collection of perfect 
subgroups is perfect 

(c) Prove that any conjugate of a perfect subgroup is perfecL 

(d) Prove that any group G has a unique maximal perfect subgroup and that this subgroup 
is normal. 

20. Let H(F) be the Heisenberg group over the field F, cf. Exercise 1 1 of Section 1 .4. Find 
an explicit formula for the commutator [X, Y], where X,Y € //(F), and show that the 
commutator subgroup of H(F) equals file center of H(F) (cf. Section 2.2, Exercise 14). 
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5.5 SEMI DIRECT PRODUCTS 


In this section we study the “semi direct product” of two groups H and K , which is a 
generalization of the notion of the direct product of H and K obtained by relaxing the 
requirement that both H and K be normal. This construction will enable us (in certain 
circumstances) to build a “larger” group from the groups H and K in such a way that 
G contains subgroups isomorphic to H and K, respectively, as in the case of direct 
products. In this case the subgroup H will be normal in G but the subgroup K will not 
necessarily be normal (as it is for direct products). Thus, for instance, we shall be able 
to construct non-abelian groups even if H and K are abelian. This construction will 
allow us to enlarge considerably the set of examples of groups at our disposal. As in 
the preceding section, we shall then prove a recognition theorem that will enable us to 
decompose some familiar groups into smaller “factors ” from which we shall be able to 
derive some classification theorems. 

By way of motivation suppose we already have a group G containing subgroups H 
and K such that 

(a) H < G (but K is not necessarily normal in G), and 

(b) HHK = 1. 

It is still true that HK is a subgroup of G (Corollary 3.15) and, by Proposition 8, 
every element of HK can be written uniquely as a product hk, for some h € H and 
k € K, i.e., there is a bijection between HK and the collection of ordered pairs (h, k) t 
given by hk (/i, k ) (so the group H appears as the set of elements (/?, 1) and K 
appears as the set of elements (1, A)). Given two elements h\k\ and h 2 k 2 of HK , we 
first see how to write their product (in G) in the same form: 


(h\k\)(h 2 k 2 ) = h\k\h 2 (k^ l k\)k 2 

= hi(kih 2 k^)kik 2 (5-1) 

= /2 3 fc 3 , 

where h 3 = h\{k\h 2 k^) and k 3 = kik 2 . Note that since H < G. k\h 2 ky X e H, so 
h 3 e H and k 3 e K. 

These calculations were predicated on the assumption that there already existed a 
group G containing subgroups H and K with H < G and H n K = 1. The basic 
idea of the semidirect product is to turn this construction around, namely start with two 
(abstract) groups H and K and try to define a group con tainin g (an isomorphic copy 
of) them in such a way that (a) and (b) above hold. To do this, we write equation (1), 
which defines the multiplication of elements in our group, in a way that makes sense 
even if we do not already know there is a group containing H and K as above. The 
point is that k 3 in equation (1) is obtained only from multiplication in K (namely kik 2 ) 
and h 3 is obtained from multiplying h i and k\h 2 ky l in H. If we can understand where 
the element k\h 2 k^ 1 arises (in terms of H and K and without reference to G), then the 
group HK will have been described entirely in terms of H and K. We can then use 
this description to define the group HK using equation (1) to define the multiplication. 
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Since H is normal in G, the group K acts on H by conjugation: 

k-h = khk~ l for h e H, k € K 
(we use the symbol * to empha^ethb^ction) so that (1) can be written 

/ (h\k\)(h 2 k 2 ) kl‘h 2 )(kik 2 ). 


(5.2) 


The action of K on H by conjugation gives a homomorphism tp of K into Aut(H), so 

(2) shows that the multiplication in HK depends only on the multiplication in H, the 
multiplication in K and the homomorphism tp, hence is defined intrinsically in terms 
of H and K> 

We now use this interpretation to define a group given two groups H and K and a 
homomorphism <p from K to Aut (H) (which will turn out to define conjugation in the 
resulting group). 


Theorem 10. Let H and K be groups and let tp be a homomorphism from K into 
Aut (H). Let -denote the (left) action of K on H determined by <p. Let G be the set of 
ordered pairs (h, k) with h € H and k € K and define the following multiplication on 
G: 


(hu ki)(h2, k 2 ) — (hi ki-h 2t k\k 2 ). 


(1) This multiplication makes G into a group of order |G| = \H\\K\. 

(2) The sets {(/i, 1) [ h € H) and {(1, k) \ k € tf) are subgroups of G and the 
maps h i — (h, 1) for h e H and k i-> (1, k) for k € K are isomorphisms of 
these subgroups with the groups H and K respectively: 

H = {(/i, 1) | h € H) and K = {(1, k) \ k e K). 


Identifying H and K with their isomorphic copies in G described in (2) we have 

(3) H < G 

(4) H D K = 1 

(5) for all h € H and k G K, khk~ l = k-h = (p(k)(h). 


Proof: It is straightforward to check that G is a group under this multiplication 
using the fact that ■ is an action of K on H. For example, the associative law is verified 
as follows: 

((a, x)(b, y)) (c, z) = (axb, xy)(c, z ) 

= (a x-b (xy )-c, xyz) 

= (a x-b X'(y c) , xyz) 

= ( ax (by-c)> xyz) 

= ( a,x)(by-c , yz) 

= (a, x) ((b, y)(c, z)) 

for all (a, x), (b, y), ( c , z) € G. We leave as an exercise the verification that (1,1) is 
the identity of G and that 

(h,k)~ l = (k~ l -h-\ k~ l ) 
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for each (h,k) € G. The order of the group G is clearly the product of the orders of H 
and K, which proves (1). 

Let = {(/», 1) | h e H) and K = {(I, k) \ k e K }. We have 

(o,l)(b t l) = (a\-b, 1 ) = (ab, 1) 

for all a,b e H and 

(l,x)(l,y) = (l.xy) 

for all x, y e K , which show that H and K are subgroups of G and that the maps in 
(2) are isomorphisms. 

It is clear that H fl K = 1 , which is (4). Now, 

(1 ,*)(*. DU ,*)" 1 - ((!,*)(*, 1))(1» k~ l ) 

= (k-h , k)(\,k~ l ) 

= (k-h k \ , kk~ l ) 

= (k-h, 1) 

so that identifying (h, 1) with h and (1, k) with k by the isomorphisms in (2) we have 
khk~ l = k-h , which is (5). 

Finally, we have just seen that (under the identifications in (2)) K < Nq(H). Since 
G = HK and certainly H < Nc(H), we have Nc(H) = G, i.e., H < G, which 
proves (3) and completes the proof. 

Definition. Let H and K be groups and let <pbea homomorphism from K into Aut(//) . 
The group described in Theorem 10 is called the semidirect product of H and AT with 
respect to <p and will be denoted by H K (when there is no danger of confusion we 
shall sinoply write H >J K). 

The notation is chosen to remind us that the copy of H in H x K is the normal 
“factor'’ and that the construction of a semidirect product is not symmetric in H and 
K (unlike that of a direct product). Before giving some examples we clarify exactly 
when the semidirect product of H and K is their direct product (in particular, we see 
that direct products are a special case of semidirect products). See also Exercise 1 . 

Proposition 1L Let H and K be groups and let (p : K Aut (H) be a homomorphism. 

Then the following are equivalent: 

(1) the identity (set) map between H x K and H x K is a group homomorphism 
(hence an isomorphism) 

(2) (p is the trivial homomorphism from K into Aut (H) 

(3) K < H x K . 

Proof: (1) => (2) By definition of the group operation in H x K 

(hi, k\)(h2, kf) = (hi ki -h ls kykf) 

for all hi, /j 2 € //andki,k 2 € K. By assumption (1), (hi,ki)(h 2 ,kf) = (hih 2 ,k\k 2 ). 
Equating the first factors of these ordered pairs gives ki -h 2 — h 2 for all h 2 € H and all 
k\ € K , i.e., K acts trivially on H. This is (2). 
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(2) => (3) If <p is trivial, then the action of K on H is trivial, so that the elements of 
H commute with those of K by Theorem 10(5). In particular, H normalizes K. Since 
K normalizes itself, G — HK normalizes K> which is (3). 

(3) =$■ (I) If K is normal in H x K then (as in the proof of Theorem 9) for all 
h e H and k e K, [h, &] e H n K = 1. Thus hk = kh and the action of K on H 
is trivial. The multiplication in the semi direct product is then the same as that in the 
direct product: 

(hi,ki)(h 2 , k 2 ) = (h\ h 2 , k { k 2 ) 

for all hi, h 2 e H and k\, k 2 e K. This gives (1) and completes the proof. 

Examples 

In all examples H and K are groups and <p is a homomorphism from K into Aut (//) with 
associated action of K on H denoted by a dot. Let G = H x K and as in Theorem 10 we 
identify H and K as subgroups of G. We shall use Propositions 4. 16 and 4.17 to determine 
homomorphisms <p for some specific groups H. In each of the following examples the 
proof that tp is a homomorphism is easy (since K will often be cyclic) so the details are 
omitted. 

(1) Let H be any abelian group (even of infinite order) and let K = ( x ) = Z 2 be the 
group of order 2. Define <p : K Aut (H) by mapping x to the automorphism of 
inversion on H so that the associated action is x-h = h~ y , for all h e H. Then G 
contains the subgroup H of index 2 and 

xhx~^ — /i _1 for all h e H. 

Of particular interest is the case when H is cyclic: if H = Z„, one recognizes G as 
D 2 n and if H = 7L we denote G by Dqo- 

(2) We can generalize the preceding example in a number of ways. One way is to let W be 
any abelian group and to let K = ( x ) ~ be cyclic of order 2n. Define <p again by 
mapping x to inversion, so that x 2 acts as the identity on H. In G, xhx~ l = h~ l and 
x 2 hx~ 2 = h for all h e H. Thus x 2 e Z(G). In particular, if H = Z3 and K = Z4, 
G is a non- abelian group of order 12 which is not isomorphic to A4 or D\ 2 (since its 
Sylow 2-subgroup, K, is cyclic of order 4). 

(3) Following up on the preceding example let H — (h ) = Z 2 « and let K = (x ) = Z4 
with x hx~ l = h~ l in G. As noted above, x 2 e Z(G). Since x inverts h (i.e., inverts 
//), jr inverts the unique subgroup ( z ) of order 2 in H, where z = h 2 " . Thus 
jtzx -1 = zT x = z, so x centralizes z. It follows that z e Z(G). Thus x 2 z e Z(G) 
hence (x 2 z) ^ G. Let G = G/(x 2 z). Since x 2 and z are distinct commuting 
elements of order 2, the order of xp-z is 2, so |G| = j|G| = 2" +I . By factoring out the 

product x 2 z to form G we identify x 2 and h 2 " ' in the quotient In particular, when 
n = 2, both x and h have order 4, x inverts h and h 2 = Jt 2 . It follows that G = (28 in 
this case. In general, one can check that G has a unique subgroup of order 2 (namely 
( x 2 » which equals the center of G. The group G is called the generalized quaternion 
group of order 2 ri+1 and is denoted by £>2 n+l : 

Qv+i =(h,x\ h T =x 4 = 1, x~ l hx = h~ l t h 2 "'' =x 2 ). 

(4) Let h = Q (under addition) and let K = (x > = Z. Define <p by mapping x to the 
map “multiplication by 2“ on H, so that x acts on h e H by x-h = 2h. Note that 
multiplication by 2 is an automorphism of H because it has a 2-sided inverse, namely 
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multiplication by 5 . In the group G, Z < <Q> and the conjugate xTLx 1 of Z is a 
proper subgroup of Z (namely 2Z). Thus x i Ng(%) even though xZx~ l < Z (note 
that x l Zx is not contained in Z). This shows that in order to prove an element g 
normalizes a subgroup A in an infinite group it is not sufficient in general to show that 
the conjugate of A by g is just contained in A (which is sufficient for finite groups). 

(5) For H any group let AT = Aut(H) with <p the identity map from K to Aut (H). The 
semidirect product H xi Aut (H) is called the holomorph of H and will be denoted by 
Hol(Z/). Some holomorphs are described below; verifications of these isomorphisms 
are given as exercises at the end of this chapter. 

(a) Hol(Z 2 * Z 2 )~ S 4 . 

(b) If |G| = n and jt : G -> S',, is the left regular representation (Section 4.2), then 
Ns„(jt(G)) = Hol(G). In particular, since the left regular representation of a 
generator of Z„ is an n-cycle in S„ we obtain that for any n-cycle (1 2 ... n): 

Ns „(( (12 ...«)» = Hol(Z„) = Z„ x Aut (Z„). 

Note that the latter group has order n<p(n). 

( 6 ) Let p and q be primes with p < q, let H = Z q and let K = Z p . We have already seen 
that if p does not divide q — 1 then every group of order pq is cyclic (see the example 
following Proposition 4.16). This is consistent with the fact that if p does not divide 
q - 1, there is no nontrivial homomorphism from Z p into Aut(Z <? ) (the latter group is 
cyclic of order q — 1 by Proposition 4.17). Assume now that p \ q — 1 . By Cauchy’s 
Theorem, Aut(Z^) contains a subgroup of order p (which is unique because Aut (Z q ) 
is cyclic). Thus there is a nontrivial homomorphism, <p, from K into Aut (H). The 
associated group G = H x K has order pq and K is not normal in G (Proposition 1 1). 
In particular, G is non-abelian. We shall prove shortly that G is (up to isomorphism) 
the unique non-abelian group of order pq. If p = 2, G must be isomorphic to D^q. 

(7) Let p be an odd prime. We construct two non isomorphic non-abelian groups of order 
p 3 (we shall later prove that any non-abelian group of order p 3 is isomorphic to one 
of these two). 

Let H = Z p x Z p and let K = Z p . By Proposition 4.17, Aut(fl r ) = GL^p) 
and |GL 2 (F P )| = (p 2 — lXp 2 — p). Since p \ |Aut(f/)|, by Cauchy’s Theorem H 
has an automorphism of order p. Thus there is a nontrivial homomorphism, <p, from 
K into Aut(/f) and so the associated group H x K is a non-abelian group of order p 3 . 
More explicitly, if H = (a) x. (b) t and x is a generator for K then x acts on a and b 
by 

xa =ab and xb = b 

which defines the action of x on all of H. With respect to the F p -basis a, b of the 
2 -dimensional vector space H the action of x (which can be considered in additive 
notation as a nonsingular linear transformation) has matrix 

(J ®)€CL 2 (F p ). 

The resulting semidirect product has the presentation 

(x, a,b | x p = a p = b p = 1 , ab = ba , xax~ l = ab, xhx -1 = b) 

(in fact, this group is generated by {x, a), and is called the Heisenberg group over 
Z/pZ, cf. Exercise 25). 

Next let H =- Zp and K = Z p . Again by Proposition 4.17, Aut (H) = Z p (p- 1 ), 
so H admits an automorphism of order p. Thus there is a nontrivial homomorphism. 
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<fi, from K into Aut(ff) and so the group H » K is non-abelian and of order p 3 . More 
explicitly, if H = { y ), and x is a generator for K then x acts on y by 

x y — y l+p . 

The resulting semidirect product has the presentation 

< x , y | x p = y pl = 1, xyx~ l — y i+p ). 

These two groups are not isomorphic (the former contains no element of order 
p 2 , cf. Exercise 25, and the latter clearly does, namely y). 

(8) Let H = Qg x (Z2 x Z2) = ( i, j ) x (( a ) x ( b )) and let A" = ( y ) = Z3. The map 
defined by 

i y-t j j k = ij a i-> b by-tab 

is easily seen to give an automorphism of H of order 3. Let <p be the homomorphism 
from K to Aut(ff) defined by mapping y to this automorphism, and let G be the 
associated semidirect product, so that y e G acts by 

y-i = j yj—k y-a — b y-b = ab. 

The group G = H x K is a non-abelian group of order 96 with the property that the 
element i 2 a g G' but i 2 a cannot be expressed as a single commutator [x, y], for any 
x,y e G (checking the latter assertion is an elementary calculation). 

As in the case of direct products we now prove a recognition theorem for semidirect 
products. This theorem will enable us to “break down” or “factor” all groups of certain 
orders and, as a result, classify groups of those orders. The strategy is discussed in 
greater detail following this theorem. 

Theorem 12 . Suppose G is a group with subgroups H and K such that 

(1) H < G, and 

(2) HHK = 1. 

Let <p : K Aut (H) be the homomorphism defined by mapping k € K to the 

automorphism of left conjugation by k on H. Then H K = H x K. In particular, if 
G = HK with H and K satisfying (1) and (2), then G is the semidirect product of H 
and K. 

Proof: Note that since H < G, H K is a subgroup of G. By Proposition 8 every 
element of HK can be written uniquely in the form hk> for some h e H and k e K. 
Thus the map hk (/t, k ) is a set bijection from HK onto H xi K. The fact that this 
map is a homomorphism is the computation at the begi nnin g of this section which led 
us to the formulation of the definition of the semidirect product. 

Definition. Let H be a subgroup of the group G. A subgroup K of G is called a 
complement for H in G if G = HK and H D K — 1. 

With this terminology, the criterion for recognizing a semidirect product is simply 
that there must exist a complement for some proper normal subgroup of G. Not every 
group is the semidirect product of two of its proper subgroups (for example, if the group 
is simple), but as we have seen, the notion of a semidirect product greatly increases our 
list of known groups. 
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Some Classifications 

We now apply Theorem 12 to classify groups of order n for certain values of n. The 
basic idea in each of the following arguments is to 

(a) show every group of order n has proper subgroups H and K satisfying the hypoth- 
esis of Theorem 12 with G = HK 

(b) find all possible isomorphism types for H and K 

(c) for each pair H, K found in (b) find all possible homomorphisms tp : K — >■ AuX(H) 

(d) for each triple H, K, <p found in (c) form the semidirect product H x K (so any 
group G of order n is isomorphic to one of these explicitly constructed groups) and 
among all these semidirect products determine which pairs are isomorphic. This 
results in a list of the distinct isomorphism types of groups of order n . 

In order to start this process we must first find subgroups H and K (of an arbitrary 
group G of order n) satisfying the above conditions. In the case of “small” values 
of n we can often do this by Syiow’s Theorem. To show normality of H we use the 
conjugacy part of Syiow’s Theorem or other normality criteria established in Chapter 4 
(e.g., Corollary 4.5). Some of this work has already been done in the examples in 
Section 4.5. In many of the examples that follow, \H\ and | AT | are relatively prime, so 
H H K — 1 holds by Lagrange’s Theorem. 

Since H and K are proper subgroups of G one should think of the determination 
of H and K as being achieved inductively. In the examples we discuss, H and K will 
have sufficiently small order that we shall know all possible isomorphism types from 
previous results. For example, in most instances H and K will be of prime or prime 
squared order. 

There will be relatively few possible homomorphisms cp : K — ► Aut(//) in our 
examples, particularly after we take into account certain symmetries (such as replacing 
one generator of K by another when K is cyclic). 

Finally, the semidirect products which emerge from this process will, in our exam- 
ples, be small in number and we shall find that, for the most part, they are (pairwise) not 
isomorphic. In general, this can be a more delicate problem, as Exercise 4 indicates. 

We emphasize that this approach to “factoring” every group of some given order 
n as a semidirect product does not work for arbitrary n. For example, Qg is not a 
semidirect product since no proper subgroup has a complement (although we saw that 
it is a quotient of a semidirect product). Empirically, this process generally works well 
when the group order n is not divisible by a large power of any prime. At the other 
extreme, only a small percentage of the groups of order p a for laige a (p a prime) are 
nontrivial semidirect products. 

Example: (Groups of Order pq , p and q primes with p < q) 

Let G be any group of order pq, let P e Syl p (G) and let Q e Syl q (G). In Example 1 of the 
applications of Syiow’s Theorems we proved that G = Q x P, for some <p : P -> Aut(Q). 
Since P and Q are of prime order, they are cyclic. The group Aut(0 is cyclic of order 
q — 1 . If p does not divide q — 1, the only homomorphism from P to Aut( Q) is the trivial 
homomorphism, hence the only semidirect product in this case is the direct product, i.e.. 
G is cyclic. 

Consider now the case when p \ q — 1 and let P — (y). Since Aut(0 'S cyclic it 
contains a unique subgroup of order p, say ( y ), and any homomorphism <p: P -*■ Aut (0 
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must map y to a power of y . There are therefore p homomorphisms : P -> Aut(t2) given 
by <pt(y) = y', 0 < i < p - 1. Since <po is the trivial homomorphism, Q> P = Q y P 
as before. Each <p t for i ^ 0 gives rise to a non-abelian group, G;, of order pq. It is 
straightforward to check that these groups are all isomorphic because for each <pi, i > 0 , 
there is some generator y { - of P such that = y. Thus, up to a choice for the 

(arbitrary) generator of P, these semi direct products are all the same (see Exercise 6 . See 
also Exercise 28 of Section 4.3). 

Example: (Groups of Order 30) 

By the examples following Sy low’s Theorem every group G of order 30 contains a subgroup 
H of order 15. By the preceding example H is cyclic and H is normal in G (index 2). By 
Sylow’s Theorem there is a subgroup K of G of order 2. Thus G = HK and H n K = 1 
so G = H x K, for some <p : K — »■ Aut(//). By Proposition 4.16, 

Aut(Z J5 ) = (Z/15Z)* = Z 4 x Z 2 . 

The latter isomorphism can be computed directly, or one can use Exercise 1 1 of the pre- 
ceding section: writing H as (a) x (b) = Z$ x Z 3 , we have (since these two subgroups 
are characteristic in H) 

Aut (H) = Aut(Zs) xAut(Z 3 ). 

In particular, Aut (H) contains precisely three elements of order 2, whose actions on the 
group H = (a) x ( h ) are the following: 


a 

i-)- a 1 

\ a 

t->- a 1 ) 

1 a 


a 1 

b 

^ b~ l \ 


h* b j 

[b 




Thus there are three nontrivial homomorphisms from K into Aul(//) given by sending the 
generator of K into one of these three elements of order 2 (as usual, the trivial homomor- 
phism gives the direct product: H x K = Z 30 ). 

Let K = (k). If the homomorphism <pi : K —> Aut (H) is defined by mapping k to 
the first automorphism above (so that k a = a and k b = b~ l gives the action of k on H) 
then Gi = H K is easily seen to be isomorphic to Z5 x Dg (note that in this se mi dir ect 
product k centralizes the element a of if of order 5, so the factorization as a direct product 
is (a) x (b,k)). 

If <p 2 is defined by mapping k to the second automorphism above, then G 2 — ti K 
is easily seen to be isomorphic to Z 3 x D to (note that in this semidirect product k centralizes 
the element b of H of order 3, so the factorization as a direct product is (b) x (a t k)). 

If is defined by mapping k to the third automorphism above then G 3 = H > 0 ^ K is 
easily seen to be isomorphic to D 30 . 

Note that these groups are all nonisomorphic since their centers have orders 30 (in the 
abelian case), 5 (for Gi), 3 (for G 2 ), and 1 (for G 3 ). 

We emphasize that although (in hindsight) this procedure does not give rise to any 
groups we could not already have constructed using only direct products, the argument 
proves that this is the complete list of isomorphism types of groups of order 30. 

Example: (Groups of Order 12) 

Let G be a group of order 12, let V e Sy ( 2 (G) and let T e Sy 13 (G). By the discussion of 
groups of order 12 in Section 4.5 we know that either V or T is normal in G (for purposes 
of illustration we s hall not invoke the full force of our results from Chapter 4, namely that 
either T < G or G — A 4 ). By Lagrange’s Theorem V n T = 1. Thus G is a semidirect 
product Note that V = Z 4 or Z 2 x Z 2 and T = Z 3 . 
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Case 1\ V < G 

We must determine all possible homomorpbisms from T into Aut(V). If V ~ Z 4 , 
then Aut(V) = Z 2 and there axe no nontrivial homomorphisms from T into Aut( V). Thus 
the only group of order 12 with a normal cyclic Sylow 2-subgroup is Z 12 . 

Assume therefore that V = Z 2 x Z^. In this case Aut(V) = S 3 and there is a unique 
subgroup of Aut(V) of order 3, say ( y >. Thus if T = (y), there are three possible 
homomorphisms from T into Aut(V): 

<Pi :T Aut(V) defined by <Pi(y) = y\ i = 0, l , 2. 

As usual, tf>Q is the trivial homomorphism, which gives rise to the direct product 
Z 2 x Z 2 x Z 3 , Homomorphisms <p\ and (pi give rise to isomorphic semidirect products 
because they differ only in the choice of a generator for T (i.e., (py(y) = y and <pi(y') = y, 
where / = y 2 and / is another choice of generator for T — see also Exercise 6). The 
unique non-abelian group in this case is A 4 . 

Case 2: T < G 

We must determine all possible homomorphisms from V into Aut(T). Note that 
AutfT) = (X) = Zo, where X inverts T. If V = (x) = Z 4 , there are precisely two 
homomorphisms from V into Aut(T): the trivial homomorphism and the homomorphism 
which sends x to X. As usual, the trivial homomorphism gives rise to the direct product: 
Z 3 x Z 4 = Z 12 . The nontrivial homomorphism gives the semidirect product which was 
discussed in Example 2 following Proposition 1 1 of this section. 

Finally, assume V = (a) x (b) = Z 2 x Z 2 . There are precisely three nontrivial 
homomorphisms from V into Aut(T) determined by specifying their kernels as one of the 
three subgroups of order 2 in V. For example, <p\ (a) = X and <py(b) = X has kernel ( ab), 
that is, in this semidirect product both a and b act by inverting T and ab centralizes T. If 
(pi and <p 3 have kernels ( a ) and (b), respectively, then one easily checks that the resulting 
three semidirect products are all isomorphic to £3 x Z 2 , where the Z 2 direct factor is the 
kernel of tpi . For example, 

V = <a,T) x (ab). 

In summary, there are precisely 5 groups of order 12, three of which are noD-abelian. 

Example: (Groups of Order pr*, p an odd prime) 

Let (J be a group of order p 3 , pan. odd prime, and assume G is not cyclic. By Exercise 9 
of the previous section the map jc x p is a homomorphism from G into Z(G) and the 
kernel of this homomorphism has order p 2 or p 3 . In the former case G must contain an 
element of order p 2 and in the latter case every nonidentity element of G has order p. 

Case 1: G has an element of order p 2 

Let jc be an element of order p 2 and let H = ( jc ). Note that since H has index 
p, H is normal in G by Corollary 45. If £ is the kernel of the p^ power map, then in 
this case E = Z p x Z p and E O H = (x p ). Let y be any element of E — H and let 
K — ( y ). By construction, H H K = 1 and so G is isomorphic to Z p i >oZ p , for some 
(p : K ->• Aut (H). If <p is the trivial homomorphism, G = Z p i x Z p , so we need only 
consider the nontrivial homomorphisms. By Proposition 4. 17 Aut(//> = Z p ( p _i) is cyclic 
and so contains a unique subgroup of order p, explicitly given by ( y ) where 

y(x) = x [+p . 

As usual, up to choice of a generator for the cyclic group K, there is only one nontrivial 
homomorphism, <p , from K into Aut (H), given by <p(y) = y\ hence up to isomorphism 
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there is a unique non- abelian group H x A in this case. This group is described in Example 7 
above. 

Case 2: every nonidentity element of G has order p 

In this case let H be any subgroup of G of order p 2 (see Exercise 29, Section 4.3). 
Necessarily H = Z p x Z p . Let A = ( y ) for any element y of G — H. Since H has index 
p, H < G and since A has order p but is not contained in H, H fl A = 1 . Then G is 
isomorphic to (Z p x Z p ) » Z p , for some 40 : A -¥ Aut(//). If <p is trivial, G = Z p x Z p x Z p 
(the elementary abelian group), so we may assume <p is nontrivial. By Proposition 4.17, 

Aut (H) ^ GL 2 (F P ) 

so |Aut(//)| = (p 2 — 1 )(p 2 — p). Note that a Sylow p-subgroup of Aut(H) has order 
p so all subgroups of order p in Aut(tf) are conjugate in Aut(H) by Sylow’s Theorem. 
Explicitly, (as discussed in Example 7 above) every subgroup of order p in Aut(H) is 
conjugate to ( y ), where if H = { a ) x ( b ), the automorphism y is defined by 

y (a) = ab and y ( b ) = b. 

With respect to the F p -basis a, b of the 2-dimensional vector space H the automorphism 
has matrix 

(J j)eOLj(F,). 

Thus (again quoting Exercise 6) there is a unique isomorphism type of semidirect product 
in this case. 

Finally, since the two non-abelian groups have different orders for the kernels of the 
p 0 ' power maps, they are not isomorphic. A presentation for this group is also given in 
Example 7 above. 


EXERCISES 

Let H and K be groups, let <p be a homomorphism from K into Aut(H) and, as usual, identify 
H and K as subgroups of G = H K. 

1. Prove that Ck(H ) = ker^j (recall that Cjc(H) = Cc,(H) H K). 

2. Prove that Ch(K) = Nh (X). 

3. In Example 1 following the proof of Proposition 11 prove that every element of G — H 
has order 2. Prove that G is abelian if and only if h 2 = 1 for all h e H. 

4 . Let p = 2 and check that the construction of the two non-abelian groups of order p 3 is 
valid in this case. Prove that both resulting groups are isomorphic to Dg. 

5. Let G = Hol(Z 2 x Z 2 ). 

(a) Prove that G = H x K where H = Z 2 x Z 2 and K = £3. Deduce that |G| — 24. 

(b) Prove that G is isomorphic to £4. [Obtain a homomorphism from G into £4 by letting 
G act on the left cosets of K. Use Exercise 1 to show this representation is faithful.] 

6. Assume that AT is a cyclic group, H is an arbitrary group and <p\ and <p 2 are homomorphisms 
from K into Aut(/7) such that cp\ ( K ) and <pi( K) are conjugate subgroups of Aut (f/). If A" is 
infinite assume q>\ and (pi are injective. Prove by constructing an explicit isomorphism that 
H x) V[ K = H K (in particular, if the subgroups <p\ (A) and 402(A) are equal in Aut(/7), 
then the resulting semidirect products are isomorphic). [Suppose c<pi(A)cr -1 = <P 2 (A) 
so that for some a € Z we have o<p\ (k)a -1 = 402 (k) a for all k € A. Show that the map 
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i)/ : /?x Vl K ->• H K defined by k)) = ( a(h ), k°) is a homomorphism. Show 
rp is bijective by constructing a 2 -sided inverse.] 

7. This exercise describes thirteen isomorphism types of groups of order 56. (It is not too 
difficult to show that every group of order 56 is isomorphic to one of these.) 

(a) Prove that there are three abelian groups of order 56. 

(b) Prove that every group of order 56 has either a normal Sylow 2 - subgroup or a normal 
Sylow 7-subgroup. 

(c) Construct the following non-abelian groups of order 56 which have a normal Sylow 
7-subgroup and whose Sylow 2-subgroup S is as specified: 

one group when S = Z 2 x Z 2 x Z 2 

two nonisomorphic groups when S = Z4 x Z2 

one group when S = Z% 

two nonisomorphic groups when S = Q& 

three nonisomorphic groups when S = D 8 . 

[For a particular S, two groups are not isomorphic if the kernels of the maps from S 
into AutfZ?) are not isomorphic.] 

(d) Let G be a group of order 56 with a nonnormal Sylow 7-subgroup. Prove that if S is 
the Sylow 2-subgroup of G then 5 = Z 2 x Z 2 x Z 2 . [Let an element of order 7 act 
by conjugation on the seven nonidentity elements of S and deduce that they all have 
the same order.] 

(e) Prove that there is a unique group of order 56 with a nonnormal Sylow 7-subgroup. 
[For existence use the fact that IGL 3 CF 2 )] = 168; for uniqueness use Exercise 6 .] 


8 . Construct a non-abelian group of order 75 . Classify all groups of order 75 (there are three 
of them). [Use Exercise 6 to show that the non-abelian group is unique.) (The classification 
of groups of order pq 1 , where p and q are primes with p < q and p not dividing q — 1 , 
is quite s imil ar.) 


Show that the matrix 


ix (? 4) is 


an element of order 5 in GL 2 OF 19 ). Use this matrix 


to construct a non-abelian group of order 1805 and give a presentation of this group. 
Classify groups of order 1805 (there are three isomorphism types). [Use Exercise 6 to 
prove uniqueness of the non-abelian group.] (A general method for finding elements 
of prime order in GL„ (TFp) is described in the exercises in Section 12.2; this particular 
matrix of order 5 in GL 2 OFJ 9 ) appears in Exercise 16 of that section as an illustration of 
the method.) 


10 . This exercise classifies the groups of order 147 (there are six isomorphism types). 

(a) Prove that there are two abelian groups of order 147. 

(b) Prove that every group of order 147 has a normal Sylow 7-subgroup. 

(c) Prove that there is a unique non-abelian group whose Sylow 7-subgroup is cyclic. 

(d) Let /] = ^ j ^ and /2 = ^ ^ be elements of GL 2 OF 7 ). Prove P = { ri . *2 ) is 

a Sylow 3-subgroup of GL 2 OF 7 ) and that P = Z 3 x Z 3 . Deduce that every subgroup 
of GL 2 (Fy) of order 3 is conjugate in GL 2 OP 7 ) to a subgroup of P. 

(e) By Example 3 in Section 1 the group P has four subgroups of order 3 and these 

are: ft = (n), P 2 = (/ 2 >, ft = (nfc), and P4 = (trt|). For / = l,2,3.41et 
G, = (Z 7 x Z 7 ) Z 3 , where <p, is an isomorphism of Z 3 with the subgroup Pj of 

Aut(Z 7 x Z 7 ). For each i describe G,- in terms of generators and relations. Deduce 
that G 1 = G 2 . 

(F) Prove that G\ is not isomorphic to either G 3 or G 4 , [Show that the center of G 1 has 
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order 7 whereas the centers of G3 and G4 are trivial.] 

(g) Prove that G3 is not isomorphic to G4. [Show that every subgroup of order 7 in G3 
is normal in G3 hut that G4 has subgroups of order 7 that are not normal.] 

(h) Classify the groups of order 147 by showing that the six nonisomorphic groups de- 
scribed above (two from part (a), one from part (c) and Gi, G3, and G4) are all the 
groups of order 147. [Use Exercise 6 and part (d).] (The classification of groups of 
order pq 2 , where p and q are primes with p < q and p \ q — 1, is quite s imilar .) 

11. Classify groups of order 28 (there are four isomorphism types). 

12. Classify the groups of order 20 (there are five isomorphism types). 

13. Classify groups of order 4 p, where p is a prime greater than 3 . [There are four isomorphism 
types when p = 3 (mod 4) and five isomorphism types when p = l(mod 4).] 

14. This exercise classifies the groups of order 60 (there are thirteen isomorphism types). 
Let G be a group of order 60, let P be a Sylow 5-subgroup of G and let Q be a Sylow 
3-subgroup of G. 

(a) Prove that if P is not normal in G then G = A 5 . [See Section 4.5.] 

(b) Prove that if P < G but Q is not normal in G then G = A4 x Z5. [Show in this case 
that P < Z(G), GjP — A 4 , a Sylow 2-subgroup T of G is normal and TQ = A4.] 

(c) Prove that if both P and Q are normal in G then G = Z15 x T where T = Z4 or 
Z2 x Show in this case that there are six isomorphism types when T is cyclic 
(one abelian) and there are five isomorphism types when T is the Klein 4-group (one 
abelian). [Use the same ideas as in the classifications of groups of orders 30 and 20.] 

15. Let p be an odd prime. Prove that every element of order 2 in GL 2 (F P ) is conjugate to a 
diagonal matrix with ±l’s on the diagonal. Classify the groups of order 2 p 2 . [If A is a 
2x2 matrix with A 2 — I and ui, V 2 is a basis for the underlying vector space, look at A 
acting on the vectors u>i = i>i 4 - V2 and W2 = v\ — V2-] 

16. Show that there are exactly 4 distinct homomorphisms from Z2 into Aut(Zg). Prove that 
the resulting semi direct products are the groups: Zg x Z2, Z>i6, the quasidihedral group 
QD\(j and the modular group M (cf. the exercises in Section 2.5). 

17. Show that for any n > 3 there are exactly 4 distinct homomorphisms from Z2 into Aut(Z2* ) . 
Prove that the resulting semi direct products give 4 nonisomorphic groups of order 2” +1 . 
[Recall Exercises 21 to 23 in Section 2.3.] (These four groups together with the cyclic 
group and the generalized quaternion group, Q 2 n+i , are all the groups of order 2 n+l which 
possess a cyclic subgroup of index 2.) 

18. Show that if H is any group then there is a group G that contains H as a normal sub- 
group with the property that for every automorphism a of H there is an element g e G 
such that conjugation by g when restricted to H is the given automorphism a , i.e., every 
automorphism of H is obtained as an inner automorphism of G restricted to H. 

19. Let H be a group of order n, let K = Aut (H) and form G = Hol(H ) = H x K (where <p 
is the identity homomorphism). Let G act by left multiplication on the left cosets of AT in 
G and let rr be the associated permutation representation n : G ->■ S n . 

(a) Prove the elements of H are coset representatives for the left cosets of K in G and with 
this choice of coset representatives iz restricted to H is the regular representation of H . 

(b) Prove tt(G) is the normal izer in S„ of n(H). Deduce that under the regular repre- 
sentation of any finite group H of order n, the normaltzer in S n of the image of H is 
isomorphic to Hol(tf). [Show |G) = |Ns n (7r(//))| using Exercises 1 and 2 above.] 

(c) Deduce that the normalizer of the group generated by an n-cycle in S„ is isomorphic 
to Hol(Z„) and has order n<p(n). 
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20. Let p be an odd prime. Prove that if P is a non-cyciic p-group then P contains a normal 
subgroup U with U = Z p x Z p . Deduce that for odd primes p a p- group that contains 
a unique subgroup of order p is cyclic. (For p = 2 it is a theorem that the generalized 
quaternion groups Q 2 " are the only non-cyclic 2-groups which contain a unique subgroup 
of order 2). [Proceed by induction on |P|. Let Z be a subgroup of order p in Z(P) and 
let P = P/Z. If P is cyclic then P is abelian by Exercise 36 in Section 3.1 — show the 
result is true for abelian groups. When P is not cyclic use induction to produce a normal 
subgroup H of P with H = Z p x Z p . Let H be the complete preimage of H in P, so 
\H\ — p 3 . Let Hq = [x e H | x p = 1} so that Hq is a characteristic subgroup of H of 
order p 2 or p 3 by Exercise 9 in Section 4. Show that a suitable subgroup of Hq gives the 
desired normal subgroup I/.] 

21. Let p be an odd prime and let P be a p-group. Prove that if every subgroup of P is normal 
then P is abelian. (Note that Q& is a non-abelian 2-group with this property, so the result 
is false for p = 2.) [Use the preceding exercises and Exercise 15 of Section 4.] 

22. Let F be a field let n be a positive integer and let G be the group of upper triangular 
matrices in GL„(F ) (cf. Exercise 16, Section 2.1) 

(a) Prove that G is the semidirect product U >1 D where U is the set of upper triangular 
matrices with l’s down the diagonal (cf. Exercise 17, Section 2.1) and D is the set of 
diagonal matrices in GL„(F). 

(b) Let n= 2. Recall that U = F and D = F x x F x (cf. Exercise 11 in Section 3.1). 
Describe the homomorphism from D into Aut(I/) explicitly in terms of these isomor- 
phisms (I.e., show how each element of F x x F x acts as an automorphism on F). 

23. Let K and L be groups, let n be a positive integer, let p : K — > S n be a homomorphism 
and let H be the direct product of n copies of L. In Exercise 8 of Section 1 an injective 
homomorphism ip from S n into Aut (//) was constructed by letting the elements of S„ 
permute the n factors of H. The composition \]j o p is a homomorphism from G into 
Aut(H). The wreath product of L by K is the semidirect product H x K with respect to 
this homomorphism and is denoted by L 1 K (this wreath product depends on the choice 
of permutation representation p of K — if none is given explicitly, p is ass ume d to be the 
left regular representation of A"). 

(a) Assume K and L are finite groups and p is the left regular representation of K . Find 
|L 1 K \ in taros of |K| and |L|. 

(b) Let p be a prime, let K = L = Z p and let p be the left regular representation of K. 
Prove that Z p ? Z p is a non-abelian group of order p p+l and is isomorphic to a Sylow 
/^-subgroup of Sjp . [The p copies of Z p whose direct product makes up H may be 
represented by p disjoint p-cycles; these are cyclically permuted by AT.] 

24. Let n be an integer > 1. Prove the following classification: every group of order n is 

abelian if and only if n = p“ 2 - - - where p\ p r are distinct primes, a, = 1 or 

2 for all / g {1, .... r) and p,- does not divide pj 1 — 1 for all i and j. [See Exercise 56 in 
Section 4.5.] 

25. Let H (F p ) be the Heisenberg group over the finite field F p = Z/pZ (cf. Exercise 20 in 
Section 4). Prove that H (F 2 ) = Dg, and that H(F P ) has exponent p and is isomorphic to 
the first non-abelian group in Example 7. 
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CHAPTER 6 


Further Topics in Group Theory 


6.1 p-GROUPS, NILPOTENT GROUPS, AND SOLVABLE GROUPS 

Let p be a prime and let G be a finite group of order p a n, where p does not divide 
n. Recall that a (finite) p-group is any group whose order is a power of p. Sylow’s 
Theorem shows that p-groups abound as subgroups of G and in order to exploit this 
phenomenon to unravel the structure of finite groups it will be necessary to establish 
some basic properties of p-groups. In the next section we shall apply these results in 
many specific instances. 

Before giving the results on p-groups we first recall a definition that has appeared 
in some earlier exercises. 

Definition. A maximal subgroup of a group G is a proper subgroup M of G such that 
there are no subgroups H oiG with M < H < G. 

By order considerations every proper subgroup of a finite group is contained in 
some maximal subgroup. In contrast, infinite groups may or may not have maximal 
subgroups. For example, pZ is a maximal subgroup of Z whereas <Q> (under +) has no 
maximal subgroups (cf. Exercise 1 6 at the end of this section). 

We now collect all the properties of p-groups we shall need into an omnibus theo- 
rem: 

Theorem 1. Let p be a prime and let P be a group of order p a , a > 1 . Then 

(1) The center of P is nontrivial: Z(P) ± 1. 

(2) If H is a nontrivial normal subgroup of P then H intersects the center non- 
trivially: H H Z{P) ± 1. In particular, every normal subgroup of order p is 
contained in the center. 

(3) If E is a normal subgroup of P then H contains a subgroup of order p b that is 

normal in P for each divisor p b of \H\. In particular, P has a normal subgroup 
of order p b for every b € (0, 1, a }. 

(4) If H < P then H < Np(H) (i.e., every proper subgroup of P is a proper 
subgroup of its normalizer in P). 

(5) Every maximal subgroup of P is of index p and is normal in P. 

Proof: These results rely ultimately on the class equation and it may be useful for 
the reader to review Section 4.3. 
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Part 1 is Theorem 8 of Chapter 4 and is also the special case of part 2 when H — P . 
We therefore begin by proving (2); we shall not quote Theorem 8 of Chapter 4 although 
the argument that follows is only a slight generalization of the one in Chapter 4. Let 
H be a nontrivial normal subgroup of P. Recall that for each conjugacy class C of P, 
either CQ // or C Hi // = 0 because H is normal (this easy fact was shown in a remark 
preceding Theorem 4.12). Pick representatives of the conjugacy classes of P: 


^1 1 0,2i * • ■ i Q-r 

with au...,ak e H and a*+i , . . . , a T £ H. Let C\ be the conjugacy class of a,- in P , 
for all i. Thus 


Ci Q H , !</'<& and C, HI H = 0, k + 1 < i < r. 


By renumbering a \ , . . . , if necessary we may assume a\, ...,Os represent classes of 
size 1 (i.e., are in the center of P) and a s+ \ , . . . , a* represent classes of size > 1. Since 
H is the disjoint union of these we have 


k 

|ff| = |Hnz(/>)| + J2 

l’=y+l 


\p\ 

\Cp{tn)\' 


Now p divides \H\ and p divides each term in the sum 1^ : C/ > ( fl /)l so p 

divides their difference: | H HI Z(P) |. This proves H Hi Z(P) ^ 1. If \H\ = p, since 
H H) Z(P) ^ 1 we must have H < Z(P). This completes the proof of (2). 

Next we prove (3) by induction on a. If a < 1 or H = 1, the result is trivial. Assume 
therefore that a > 1 and H ^ 1. By part 2, // Hi Z(P) ^ 1 so by Cauchy’s Theorem 
H Hi Z(P) contains a (normal) subgroup Z of order p. Use bar notation to denote 
passage to the quotient group PjZ. This quotient has order p a ~ l and H < P. By 
induction, for every nonnegative integer b such that p b divides |//| there is a subgroup 
K of H of order p b that is normal in P. If K is the complete preimage of K in P then 
| AT | = p b+l . The set of all subgroups of H obtained by this process together with the 
identity subgroup provides a subgroup of H that is normal in P for each divisor of | H \ . 
The second assertion of part 3 is the special case H = P. This establishes part 3. 

We prove (4) also by induction on |P|. If P is abelian then all subgroups of P 
are normal in P and the result is trivial. We may therefore assume |P| > p (in fact, 
|P| > p 1 by Corollary 4.9). Let H be a proper subgroup of P. Since all elements 
of Z(P) co mm ute with all elements of P, Z(P) normalizes every subgroup of P. By 
part 1 we have that Z(P) ^ 1. If Z(P) is not contained in H, then H is properly 
contained in { H, Z(P) ) and the latter subgroup is contained in Np(H ) so (4) holds. 
We may therefore assume Z(P) < H. Use bar notation to denote passage to the 
quotient P/Z(P). Since P has smaller order than P by (1), by induction H is properly 
contained in N-p(H). It follows directly from the Lattice Isomorphism Theorem that 
Np(H ) is the complete preimage in P of N-p(H)> hence we obtain proper containment 
of H in its normalizer in this case as well. This completes the induction. 

To prove (5) let M be a maximal subgroup of P. By definition, M < P so by part 
4, M < Np(M). By definition of maximality we must therefore have Np(M ) = P, 
i.e., M < P. The Lattice Isomorphism Theorem shows that P/M is a p-group with 
no proper nontrivial subgroups because M is a maximal subgroup. By part 3, however, 
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P/M has subgroups of every order dividing \P/M\. The only possibility is | P / M\ ~ p. 
This proves (5) and completes the proof of the theorem. 

Definition. 

(1) For any (finite or infinite) group G define the following subgroups inductively: 

Z 0 (G) = 1, Z,(G) = Z(G) 

and Z,+ 1 (G) is the subgroup of G containing Z, (G) such that 

Z,- +1 (G)/Z,-(G) = Z(G/Z ; (G)) 

(i.e., Z, + i(G) is the complete preimage in G of the center of G/Z, (G) under 
the natural projection). The chain of subgroups 

Z 0 (G) < Z,(G) < Z 2 (G) < ■ ■ ■ 

is called the upper central series of G. (The use of the term “upper” indicates 
that Z,(G) < Z i+ i(G).) 

(2) A group G is called nilpotent if Z C (G) = G for some ce2. The smallest such 
c is called the nilpotence class of G. 

One of the exercises at the end of this section shows that Z, (G) is a characteristic 
(hence normal) subgroup of G for all /. We use this fact freely from now on. 

Remarks: 

(1) If G is abelian then G is nilpotent (of class 1, provided |G| > 1), since in this 
case G = Z(G) — Zi(G). One should think of nilpotent groups as lying between 
abelian and solvable groups in the hierarchy of structure (recall that solvable groups 
were introduced in Section 3.4; we shall discuss solvable groups further at the end 
of this section): 

cyclic groups C abelian groups C nilpotent groups C solvable groups C all groups 

(all of the above containments are proper, as we shall verify shortly). 

(2) For any finite group there must, by order considerations, be an integer n such that 

Z n (G ) = Z„ +1 (G ) = Z„ +2 (G ) = • - - . 

For example, Z n (Sf) — 1 for all n £ Once two terms in the upper central 
series are the same, the chain stabilizes at that point (i.e., all terms thereafter are 
equal to these two). For example, if G — Z 2 x £3, 

Z(G) = Zi(G) = Z 2 (G) = Z n (G) has order 2 for all n. 

By definition, Z n (G) is a proper subgroup of G for all n for non-nilpotent groups. 

(3) For infinite groups G it may happen that all Z;(G) are proper subgroups of G (so 
G is not nilpotent) but 

OO 

C = \Jz i (C). 

i=0 
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Groups for which this hold are called hypemilpotent — they enjoy some (but 
not all) of the properties of nilpotent groups. While we shall be dealing mainly 
with finite nilpotent groups, results that do not involve the notion of order, Sylow 
subgroups etc. also hold for infin ite groups. Even for infini te groups one of the 
main techniques for dealing with nilpotent groups is induction on the nilpotence 
class. 

Proposition 2. Let p be a prime and let P be a group of order p a . Then P is nilpotent 
of nilpotence class at most a — 1 . 

Proof: For each i > 0, P fZ, (P) is a p-group, so 

if\P/Zj(P)\ > 1 then Z{P/ZfP)) / 1 

by Theorem 1(1). Thus if Zj(P) ^ G then |Z/ + i(P)| > p\Zi(P)\ and so |Z,- + i (F)| > 
p' +1 . In particular, \Z a (P)\ > p a , so P = Z a (P ). Thus P is nilpotent of class < a. 
The only way P could be of nilpotence class exactly equal to a would be if 1 2/ (P ) | = p l 
for all i. In this case, however, Z 0 - 2 (P) would have index p 2 in P, so P/Z a -z(P) 
would be abelian (by Corollary 4.9). But then P/Z a - 2 (P) would equal its center and 
so Z fl _, (P) would equal P, a contradiction. This proves that the class of P is < a — 1 . 

Example 

Both Dg and Qg are nilpotent of class 2. More generally, is nilpotent of class n — 1. 
This can be proved inductively by showing that |Z(Z> 2 ")I = 2 and Dz n IZ(Dzr) = £> 2 "-' 
for n > 3 (the details are left as an exercise). If n is not a power of 2, Din is not nilpotent 
(cf. Exercise 10). 

We now give some equivalent (and often more workable) characterizations of nilpo- 
tence for finite groups: 

Theorem 3. Let G be a finite group, let pi , pi, ■ - ■ , p* be the distinct primes dividing 
its order and let P, € Syl P; ( G ), 1 < i < s. Then the following are equivalent: 

( 1 ) G is nilpotent 

(2) if H < G then H < N C (H), i.e., every proper subgroup of G is a proper 
subgroup of its normalizer in G 

(3) P, < G for 1 < / < s , i.e., every Sylow subgroup is normal in G 

( 4 ) G £ P y x P 2 x ■ - ■ x P s . 

Proof: The proof that (1) implies (2) is the same argument as for p-groups — the 
only fact we needed was if G is nilpotent then so is G/Z(G) — so the details are omitted 
(cf. the exercises). 

To show that (2) implies (3) let P = P, for some i and let N = N C (P). Since 
P < N, Corollary 4.20 gives that P is characteristic in N. Since P char N < Nc(N) 
we get that P < Ng(N). TTtis means Nc(N) < N and hence Nc(N) = N. By (2) we 
must therefore have N = G t which gives (3). 

Next we prove (3) implies (4). For any t , 1 < / < s we show inductively that 

P\ Pi ■ ■ ■ Pi = Py x P 2 x ■ • * x P , . 
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Note first that each P/ is norma] in G so P\ ■ ■ - P, is a subgroup of G. Let H be the 
product Pi ■ ■ ■ Pi - 1 and let K = P t , so by induction H = P\ x ■ ■ ■ x P,l- In particular, 
\H\ = |P,| • |P 2 | ■ ■ ■ |P f -i|. Since | AT | = |P f |, the orders of H and K are relatively 
prime. Lagrange’s Theorem implies H n K — 1. By definition, P\ ■ ■ P, — UK, 
hence Theorem 5.9 gives 

HK = H x K = (Pi x • - x P,-i ) xP, = P]X-.-xP ( 

which completes the induction. Now take t = s to obtain (4). 

Finally, to prove (4) implies (1) use Exercise 1 of Section 5.1 to obtain 

Z(P\ x ■ - x P s ) * Z(P\) x . ■ - x Z(P S ). 

By Exercise 14 in Section 5.1, 

G/Z(G) = (P,/Z(P,)) x x (P,/Z(P,)). 

Thus the hypotheses of (4) also hold for G/Z(G). By Theorem 1, if Pi ± 1 then 
Z(P/) ^ 1, so if G ^ 1, |G/Z(G)| < |G|. By induction, G/Z(G) is nilpotent, so by 
Exercise 6, G is nilpotent This completes the proof. 

Note that the first part of the Fundamental Theorem of Finite Abelian Groups 
(Theorem 5 in Section 5.2) follows immediately from the above theorem (we shall give 
another proof later as a consequence of the Chinese Remainder Theorem): 

Corollary 4. A finite abelian group is the direct product of its Sylow subgroups. 

Next we prove a proposition which will be used lata - to show that the multiplicative 
group of a finite field is cyclic (without using the Fundamental Theorem of Finite Abelian 
Groups). 

Proposition 5. If G is a finite group such that for all positive integers n dividing its 
order, G contains at most n elements x satisfying x n = 1, then G is cyclic. 

Proof: Let |G| — p°y - - - />“' and let P t be a Sylow pi -subgroup of G for 
i = 1,2,..-,.?. Since /?“' | |G| and the elements of P, are solutions of x Pi = 1, 
by hypothesis Pj must contain all solutions to this equation in G. It follows that P/ is 
the unique (hence normal) Sylow -subgroup of G. By Theorem 3, G is the direct 
product of its Sylow subgroups. By Theorem 1, each P; possesses a normal subgroup 

Mi of index p,. Since |M, | = pf~ l and G has at most p° l ~ x solutions to x Pi ‘ = 1, 
by Lagrange’s Theorem (Corollary 9, Section 3.2) M con tains all elements x of G 

<Xf - 1 o/ 

satisfying x p ‘ = 1. Thus any element of P, not contained in M, satisfies x Pi = 1 

but x p > 1, i.e., x is an element of order p { . This proves P t is cyclic for all i, so G 
is the direct product of cyclic groups of relatively prime order, hence is cyclic. 

The next proposition is called Frattini’s Argument. We shall apply it to give another 
characterization of finite nilpotent groups. It will also be a valuable tool in the next 
section. 
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Proposition 6. (Frattini’s Argument) Let G be a finite group, let H be a normal subgroup 
of G and let P be aSylow p-subgroup of //. Then G = HNc(P) and \ G : H | divides 

|W C (P)|. 

Proof: By Corollary 3.15, HNc(P) is a subgroup of G and HNc(P) = Nc(P)H 
since H is a normal subgroup of G. Let g € G. Since P g < H g = H, both P and P g 
are Sylow p -subgroups of H. By Sylow’s Theorem applied in H, there exists x € H 
such that P s = P x . Thusgx -1 € A^c(P)andsog € Nc(P)x. Since g was an arbitrary 
element of G, this proves G = Nc(P)H. 

Apply the Second Isomorphism Theorem to G = Nc(P)H to conclude that 

I G : H\ = \N C (P) : N c (P)nH\ 

so |G : H | divides |A/c(P)|, completing the proof. 

Proposition 7. A finite group is nilpotent if and only if every maximal subgroup is 
normal. 

Proof: Let G be a finite nilpotent group and let M be a maximal subgroup of G. 
As in the proof of Theorem 1 , since M < Nc(M) (by Theorem 3(2)) maximality of M 
forces Nc(M) = G, i.e., M < G. 

Conversely, assume every maximal subgroup of the finite group G is normal. Let 
P be a Sylow p- subgroup of G. We prove P <3 G and conclude that G is nilpotent by 
Theorem 3(3). If P is not normal in G let M be a maximal subgroup of G containing 
Nc(P). By hypothesis, M < G hence by Frattini’s Argument G = MNc(P)- Since 
Ng(P) < M we have MNc(P) = M, a contradiction. This establishes the converse. 

Commutators and the Lower Central Series 

For the sake of completeness we include the definition of the lower central series of a 
group and state its relation to the upper central series. Since we shall not be using these 
results in the future, the proofs are left as (straightforward) exercises. 

Recall that the commutator of two elements jc , y in a group G is defined as 

[x, y] = x -1 y _1 xy, 

and the commutator of two subgroups H and K of G is 

[H t K) = {[h,k]\h e H, k<=K). 

Basic properties of commutators and the commutator subgroup were established in 
Section 5.4. 

Definition. For any (finite or infinite) group G define the following subgroups induc- 
tively: 

G° = G, G 1 = [G, G] and G m = [G, G']. 

The chain of groups 

G° > G 1 > G 2 > ■ •■ 
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is called the lower central series of G. (The term “lower” indicates that G 1 > G ,+l .) 

As with the upper central series we include in the exercises at the end of this section 
the verification that G' is a characteristic subgroup of G for all i. The next theorem 
shows the relation between the upper and lower central series of a group. 

Theorem 8. A group G is nilpotent if and only if G" = 1 for some n > 0. More 
precisely, G is nilpotent of class c if and only if c is the smallest nonnegative integer 
such that G c = 1 . If G is nilpotent of class c then 

Zi(G) < G c-,_l < Z /+1 (G) for all i € (0, 1 c - 1). 


Proof: This is proved by a straightforward induction on the length of either the 
upper or lower central series. 

The terms of the upper and lower central series do not necessarily coincide in 
general although in some groups this does occur. 


Remarks : 

(1) If G is abelian, we have already seen that G’ = G l = 1 so the lower central series 
terminates in the identity after one term. 

(2) As with the upper central series, for any finite group there must, by order consid- 
erations, be an integer n such that 

G" = G" +1 = G n+1 = - - - . 


For non-nilpotent groups, G" is a nontrivial subgroup of G. For example, in 
Section 5.4 we showed that S3 = S3 = A3. Since S3 is not nilpotent, we must have 
S3 = A3. In fact 


(123) = [(12), (132)] e [S3, Sj] = Si 

Once two terms in the lower central series are the same, the chain stabilizes at that 
point i.e., all terms thereafter are equal to these two. Thus S3 = A3 for all i > 2. 
Note that S 3 is an example where the lower central series has two distinct terms 
whereas all terms in the upper central series are equal to the identity (in particular, 
for non-nilpotent groups these series need not have the same length). 

Solvable Groups and the Derived Series 

Recall that in Section 3.4 a solvable group was defined as one possessing a series: 

1 = H 0 < < - - < H, =G 

such that each factor H i+ i/Hi is abelian. We now give another characterization of 
solvability in terms of a descending series of characteristic subgroups. 
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Definition. For any group G define the following sequence of subgroups inductively: 

G (0) = G, G (1) = [G, G] and G (,+1) = [G (0 . G (i) ] for all i > 1. 

This series of subgroups is called the derived or commutator series of G. 

The terms of this series are also often written as: G (l) = G', G (2) = G",etc. Again 
it is left as an exercise to show that each G (,) is characteristic in G for all i. 

It is important to note that although G (0) = G° and G (l) = G l , it is not in general 
true that G^ = G'. The difference is that the definition of the i-f I s1 term in the lower 
central series is the commutator of the I th term with the whole group G whereas the 
i+ 1 st term in the derived series is the commutator of the 7 th term with itself. Hence 

G (0 < G' for all i 

and the containment can be proper. For example, in G = S 3 we have already seen that 
G 1 — G' — A3 and G 2 = [S3, A3] = A3, whereas G C) = [A 3 , A3] = 1 (A3 being 
abelian). 

Theorem 9. A group G is solvable if and only if G (fl) = 1 for some n > 0. 

Proof: Assume first that G is solvable and so possesses a series 

1 — Hq H\ <] - - - ^ H s = G 

such that each factor Hi+\f Hi is abelian. We prove by induction that G (l) < H s -,. This 
is true for i = 0, so assume G^ < H s -i . Then 

G (*+i) = [C (0 f G (0] < 

Since H s -i/H s --,-\ is abelian, by Proposition 5.7(4), [//*_;, H s -i] < 1 - Thus 

G< /+1J < which completes the induction. Since Ha = 1 we have G (s) = 1. 

Conversely, if G w = 1 for some n > 0, Proposition 5.7(4) shows that if we take 
Hi to be G^ _,) then H, is a normal subgroup of H i+ 1 with abelian quotient, so the 
derived series itself satisfies the defining condition for solvability of G. This completes 
the proof. 

If G is solvable, the smallest nonnegative n for which G (n) = 1 is called the 
solvable length of G. The derived series is a series of shortest length whose successive 
quotients are abelian and it has the additional property that it consists of subgroups that 
are characteristic in the whole group (as opposed to each just being normal in the next 
in the initial definition of solvability). Its “intrinsic” definition also makes it easier to 
work with in many instances, as the following proposition (which reproves some results 
and exercises from Section 3.4) illustrates. 

Proposition 10. Let G and K be groups, let H be a subgroup of G and let <p : G -> K 
be a surjective homomorphism. 

(1) f/ (,) < G (f) for all i > 0. In particular, if G is solvable, then so is H , i.e., 
subgroups of solvable groups are solvable (and the solvable length of H is less 
than or equal to the solvable length of G). 
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(2) <p(G (<) ) = K^. In particular, homomorphic images and quotient groups of 
solvable groups are solvable (of solvable length less than or equal to that of the 
domain group). 

(3) If N is normal in G and both N and G/N are solvable then so is G. 

Proof: Part 1 follows from the observation that since H < G, by definition of 
commutator subgroups, [H, H] < [G, G], i.e., H < G (1) . Then, by induction, 

H (i) <G i0 for all i € Z + . 

In particular, if G (r,) = 1 for some n, then also H (n) = 1. This establishes (1). 

To prove (2) note that by definition of commutators, 

<P([x t y\) = [<j>(x),<p(y)} 

so by induction ^(G (,) ) < K (, \ Since <p is suijective, every commutator in K is the 
image of a commutator in G, hence again by induction we obtain equality for all i. 
Again, if G^ = 1 for some n then K {n) — 1 . This proves (2). 

Finally, if G/N and N are solvable, of lengths n and m respectively then by (2) 
applied to the natural projection <p : G -> G /N we obtain 

<p(G (n) ) = (G/N) ln > = IN 

i.e., G^ < N. Thus G (n+w) = (G (w) > (/7,) < N™ = 1. Theorem 9 shows that G is 
solvable, which completes the proof. 

Some additional conditions under which finite groups are solvable are the following: 

Theorem 11. Let G be a finite group. 

(1) (Burnside) If | G | = p a q b for some primes p and q , then G is solvable. 

(2) (Philip Hall) If for every prime p dividing |G| we factor the order of G as 
| G | = p a m where ( p , m) = 1, and G has a subgroup of order m, then G is 
solvable (i.e., if for all primes p, G has a subgroup whose index equals the order 
of a Sylow p-subgroup, then G is solvable — such subgroups are called Sylow 
p-complements). 

(3) (Feit-Thompson) If |G| is odd then G is solvable. 

(4) (Thompson) If for every pair of elements x, y e G, (x, y) is a solvable group, 
then G is solvable. 

We shall prove Burnside’s Theorem in Chapter 19 and deduce Philip Hall’s gener- 
alization of it. As mentioned in Section 3.5, the proof of the Feit-Thompson Theorem 
takes 255 pages. Thompson’s Theorem was first proved as a consequence of a 475 page 
paper (that in turn relies ultimately on the Feit-Thompson Theorem). 

A Proof of the Fundamental Theorem of Finite Abelian Groups 

We sketch a group-theoretic proof of the result that every finite abelian group is a 
direct product of cyclic groups (i.e., Parts 1 and 2 of Theorem 5, Section 5.2) — the 
Classification of Finitely Generated Abelian Groups (Theorem 3, Section 5.2) will be 
derived as a consequence of a more general theorem in Chapter 12. 


196 


Further Topics in Group Theory 



By Corollary 4 it suffices to prove that for p a prime, any abelian p-group is a direct 
product of cyclic groups (the divisibility condition in Theorem 5.5 is trivially achieved 
by reordering factors). Let A be an abelian p- group. We proceed by induction on |A|. 

If E is an elementary abelian p-group (i.e., x p = I for all x € £), we first prove 
the following result: 

for any x e E, there exists M < E with E = M x ( x ) . 

If x = 1 , let M = E. Otherwise let M be a subgroup of E of maximal order subject to 
the condition that x not be an element of M. If M is not of index p in E, \&tE~E /M. 
Then E is elementary abelian and there exists y € E — (x). Since y has order p, we 
also have x £ ( y ). The complete preimage of (y) in E is a subgroup of E that does 
not contain x and whose order is larger than the order of M, contrary to the choice of 
M. This proves |E : M\ — p, hence 

E = M(x) and Mn(x) = l. 

By the recognition theorem for direct products, Theorem 5.9, E = M x ( x ), as asserted. 

Now let <p : A — > A be defined by cp(x) = x p (see Exercise 7, Section 5.2). Then 
<p is a homomorphism since A is abelian. Denote the kernel of <p by K and denote the 
image of (p by H. By definition K = {x e A [ x p = 1} and H is the subgroup of A 
consisting of p* powers. Note that both K and A/H are elementary abelian. By the 
First Isomorphism Theorem 

I A : H\ = \K\. 


By induction, 

H = (h\ ) x ■ ■ • x (h r ) 

= Zjjn x - ■ - x Z p or di > 1, i = 1, 2, . . . , r. 

By definition of <p, there exist elements g, € A such that g? = h is \ < i < r. Let 
-^0 = ( £i. ■ ■ 1 , g r )• It is an exercise to see that 

(a) A 0 = (gi ) x ■ - - x (g r ), 

(b) Aq/H = ( g\H ) x - - x (g r H) is elementary abelian of order p\ and 

(c) H n K = ( h { ) x ■ ■ ■ x ( hy ) is elementary abelian of order p r . 

If K is contained in H, then \K\ = \K f] H\ = p r = |Ao : H |. In this case by 
comparing orders we see that Ao = A and the theorem is proved. Assume therefore 
that K is not a subgroup of H and use the bar notation to denote passage to the quotient 
group A/H. Let x € K — H, so |xj = |*| = p. By the initial remark of the proof 
applied to the elementary abelian p- group E = A, there is a subgroup M of A such 
that 

A = M x (x). 


If M is the complete preimage in A of M, then since x has order p and x $ M, we have 
( x ) n M = 1 . By the recognition theorem for direct products, 

A = M x (x). 
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By induction, M is a direct product of cyclic groups, hence so is A. This completes the 
proof. 

The uniqueness of the decomposition of a finite abelian group into a direct product 
of cyclic groups (Part 3 of Theorem 5.5) can also be proved by induction using the p^- 
power map (i.e., using Exercise 7, Section 5.2). This is essentially the procedure we 
follow in Section 1 2. 1 for the uniqueness part of the proof of the Fundamental Theorem 
of Finitely Generated Abelian Groups. 

EXERCISES 

1. Prove that Z,(G) is a characteristic subgroup of G for all i. 

2. Prove Parts 2 and 4 of Theorem 1 for G a finite nilpotent group, not necessarily a p-group. 

3. If G is finite prove that G is nilpotent if and only if it has a normal subgroup of each order 
dividing \G\, and is cyclic if and only if it has a unique subgroup of each order dividing | G \ . 

4. Prove that a maximal subgroup of a finite nilpotent group has prime index. 

5. Prove Parts 2 and 4 of Theorem 1 for G an infinite nilpotent group. 

6. Show that if G/Z(G) is nilpotent then G is nilpotent- 

7. Prove that subgroups and quotient groups of nilpotent groups are nilpotent (your proof 
should work for infini te groups). Give an explicit example of a group G which possesses 
a normal subgroup H such that both H and G/H are nilpotent but G is not nilpotent 

8. Prove that if p is a prime and P is a non-abelian group of order p 3 then |Z(P) | = p and 
P/Z{P) = Z p x Z p . 

9. Prove that a finite group G is nil potent if and only if whenever a, b e G with (|u|, \b\) = 1 
then ab — ba. [Use Part 4 of Theorem 3.J 

10. Prove that Din is nilpotent if and only if n is a power of 2. [Use Exercise 9.] 

11. Give another proof of Proposition 5 under the additional assumption that G is abelian by 
invoking the Fundamental Theorem of Finite Abelian Groups. 

12. Find the upper and lower central series for A4 and S4. 

13. Find the upper and lower central series for A n and S ns n > 5. 

14. Prove that G‘ is a characteristic subgroup of G for till i. 

15. Prove that Z;(Z>2") = 

16. Prove that <Q> has no maximal subgroups. [Recall Exercise 21. Section 3.2.] 

17. Prove that G (,) is a characteristic subgroup of G for all i. 

18. Show that if O jG" and G" /G"' are both cyclic then G" = 1 . [You may assume G'" = 1. 
Then G/G" acts by conjugation on the cyclic group G" .] 

19. Show that there is no group whose commutator subgroup is isomorphic to S4. [Use the 
preceding exercise.] 

20. Let p be a prime, let P be a p-subgroup of the finite group G, let N be a normal subgroup 
of G whose order is relatively prime to p and let G = G/N. Prove the following: 

(a) Nq(P) = Ng(P) [Use Frattmi’s Argument] 

(b) Cc(P) = C^P). [Use part (a).] 

For any group G the Frattini subgroup of G (denoted by & (G)) is defined to be the intersection 
of all the maximal subgroups of G (if G has no maximal subgroups, set <P(G) = G). The next 
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few exercises deal with this important subgroup. 

21. Prove that 0(G) is a characteristic subgroup of G. 

22. Prove that if N < G then 0 ( N ) < 0 (G). Give an explicit example where this containment 
does not hold if N is not normal in G. 

23. Compute 0 (S 3 ), 0(M), 0 ( 54 ), Q(As) and #(£ 5 ). 

24. Say an element x of G is a nongenerator if for every proper subgroup H of G, ( x, H ) 
is also a proper subgroup of G. Prove that 0(G) is the set of nongenerators of G (here 
|G| > 1). 

25. Let G be a finite group. Prove that 0 ( G ) is nilpotent. [Use Frattini’s Argument to prove 
that every Sylow subgroup of 0 (G) is normal in G.] 

26. Let p be a prime, let P be a finite p-group and let P = P/0(P). 

(a) Prove that P is an elementary abelian p-group. [Show that P f < 0(P) and that 
x p e 0(P) for all x e P.\ 

(b) Prove that if N is any normal subgroup of P such that P/N is elementary abelian 
then 0(P) < N. State this (universal) property in terms of homomorphisms and 
commutative diagrams. 

(c) Let P be elementary abelian of order p 7- (by (a)). Deduce from Exercise 24 that if 
xT, X 2 , . . . , 3c7 are any basis for the r-dimensional vector space P over ¥ p and if x; 
is any element of the coset xf, then P = (xi, X 2 , • • • > x r )■ Show conversely that 
if yi . y 2 . • * • » yr is any set of generators for P, then s > r (you may assume that 
every minimal generating set for an r-dimensional vector space has r elements, i.e., 

every basis has r elements). Deduce Burnside’s Basis Theorem : a set yi y s is 

a minimal generating set for P if and only if yf, . . . , y^ is a basis of P = P/0(P). 
Deduce that any minimal generating set for P has r elements. 

(d) Prove that if P/0(P) is cyclic then P is cyclic. Deduce that if P/P' is cyclic then 
so is P. 

(e) Let 0 be any automorphism of P of prime order q with q ^ p. Show that if a fixes 
the coset x0(P) then a fixes some element of this coset (note that since 0(P) is 
characteristic in P every automorphism of P induces an automorphism of P/0(P)). 
[Use the observation that a acts a permutation of order 1 or q on the p a elements in 
the coset xtf>(P).] 

(f) Use parts (e) and (c) to deduce that every nontrivial automorphism of P of order 
prime to p induces a nontrivial automorphism on P /0(P). Deduce that any group 
of automorphisms of P which has order prime to p is isomorphic to a subgroup of 
Aut(P) = GL r (F p ). 

27. Generalize part (d) of the preceding exercise as follows: let p be a prime, let P be a p-group 

Y | 

and let P = P/0(P) be elementary abelian of order p r . Prove that P has exactly 

p - 1 

maximal subgroups. [Since every maximal subgroup of P contains 0(P), the maximal 
subgroups of P are, by the Lattice Isomorphism Theorem, in bijective correspondence 
with the maximal subgroups of the elementary abelian group P. It therefore suffices to 
show that the number of maximal subgroups of an elementary abelian p-group of order 
p r is as stated above. One way of doing this is to use the result that an abelian group is 
isomorphic to its dual group (cf. Exercise 14 in Section 5.2) so the number of subgroups 
of index p equals the number of subgroups of order p.J 

28. Prove that if p is a prime and P = Z p x Z p i then \0(P)\ = p and P/0(P) = Z p x Z p . 
Deduce that P has p + 1 maximal subgroups. 
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29 . Prove that if p is a prime and P is a non-abelian group of order p 3 then G>(P) = Z(P) 
and P/<P(P ) = Z p x Z p . Deduce that P has p + 1 maximal subgroups. 

30 . Let p be an odd prime. let P\ — Z p y. Z p i and let P 2 be the non-abelian group of order p 3 
which has an element of order p 2 . Prove that Pi and P 2 have the same lattice of subgroups. 

31 . For any group G a minimal normal subgroup is a normal subgroup M of G such that the 
only normal subgroups of G which are contained in Af are 1 and M. Prove that every 
minimal normal subgroup of a finite solvable group is an elementary abelian p -group 
for some prime p. [If M is a minimal normal subgroup of G, consider its characteristic 
subgroups: M' and {x p \ x e M).] 

32 . Prove that every maximal subgroup of a finite solvable group has prime power index. [Let 
H be a ma ximal subgroup of G and let M be a minim al normal subgroup of G — cf. 
the preceding exercise. Apply induction to G/M and consider separately the two cases: 
M < H and M £ H .) 

33 . Let n be any set of primes. A subgroup H of a finite group is called a Hall n -subgroup of G 
if the only primes dividing 1 H | are in the set n and \H\ is relatively prime to | G : H | . (Note 
that if 71 = {p}. Hall 7r -subgroups are the same as Sylow p-subgroups. Hall subgroups 
were introduced in Exercise 10 of Section 3.3). Prove the following generalization of 
Sylow’s Theorem for solvable groups: if G is a finite solvable group then for every set n 
of primes, G has a Hall rr -subgroup and any two Hall rr -subgroups (for the same set n) 
are conjugate in G. [Fix n and proceed by induction on |G|, proving both existence 
and conjugacy at once. Let M be a minimal normal subgroup of G, so M is a p-group for 
some prime p. If p e n, apply induction to G /M . If p ^ jr, reduce to the case |G[ = p a n, 
where p“ = \M [ and n is the order of a Hall rr-subgroup of G. In this case let N/M be 
a minimal normal subgroup of G/M , so N/M is a g-group for some prime q p. Let 
Q e Sylq(N). If Q < G argue as before with Q in place of M . If Q is not normal in G, 
use Frattini’s Argument to show Nc ( Q) is a Hall rr-subgroup of G and establish conjugacy 
in this case too.] 

The following result shows how to produce normal p-subgroups of some groups on which 

the elements of order prime to p act faithfully by conjugation. Exercise 26(f) then applies to 

restrict these actions and give some information about the structure of the group. 

34 . Let p be a prime dividing the order of the finite solvable group G. Assume G has no 
nontrivial normal subgroups of order prime to p. Let P be the largest normal p-subgroup 
of G (cf. Exercise 37, Section 4.5). Note that Exercise 31 above shows that P ± 1. Prove 
that C G (P) < P, i.e., C G (P ) = 2(P). [Let N = C G (P) and use the preceding exercise 
to show N = Z(P) x H for some Hall rr -subgroup H of N — here 7r is the set of all prime 
divisors of |N| except for p. Show H < G to obtain the desired conclusion: H = 1 .] 

35 . Prove that if G is a finite group in which every proper subgroup is nilpotent, then G 
is solvable. [Show that a minimal counterexample is simple. Let M and N be distinct 
maximal subgroups chosen with | M n iV | as large as possible and apply Part 2 of Theorem 
3 to show that M O N = 1. Now apply the methods of Exercise 53 in Section 4.5.] 

36 . Let p be a prime, let V be a nonzero finite dimensional vector space over the field of p 
elements and let (p be an element of GL(V) of order a power of p (i.e., V is a nontrivial 
elementary abelian p-group and (p is an automorphism of V of p -power order). Prove that 
there is some nonzero element v e V such that <p(y) = u, i.e., q> has a nonzero fixed point 
on V. 

37 . Let V be a finite dimensional vector space over the field of 2 elements and let <p be an 
element of GL(V) of order 2. (i.e., V is a nontrivial elementary abelian 2-group and <p is an 
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automorphism of V of order 2). Prove that the map u t-> v +#>(u) is a homomorphism from 
V to itself. Show that every element in the image of this map is fixed by <p. Deduce that the 
subspace of elements of V which are fixed by <p has dimension > ^(dimension V). (Note 
that if G is the semidirect product of V with ( <p ), where V < G and <p acts by conjugation 
on V by sending each v e V to <p(v), then the fixed points of <p on V are Cy(<p) and the 
above m ap is simply the commutator map: v [d, <p). In this terminology the problem 
is toshow that |Cv(^))| 2 > |V|.) 

38. Use the preceding exercise to prove that if P is a 2-group which has a cyclic center and 
M is a subgroup of index 2 in P, then the crater of M has rank < 2. [The group G/M of 
order 2 acts by conjugation on the F 2 vector space: (z e Z(M) \ z 2 = 1} and the fixed 
points of this action are in the center of P.] 


6.2 APPLICATIONS IN GROUPS OF MEDIUM ORDER 

The purpose of this section is to work through a number of examples which illustrate 
man y of the techniques we have developed. These examples use Sylow’s Theorems ex- 
tensively and demonstrate how they are applied in the study of finite groups. Motivated 
by the Holder Program we address primarily the problem of showing that for certain 
m every group of order n has a proper, nontrivial normal subgroup (i.e., there are no 
simple groups of order n). In most cases we shall stop once this has been accomplished. 
However readers should be aware that in the process of achieving this result we shall 
already have determined a great deal of information about arbitrary groups of given 
order n for the n that we consider. This information could be built upon to classify 
groups of these orders (but in general this requires techniques beyond the simple use of 
semidirect products to construct groups). 

Since for p a prime we have already proved that there are no simple p-groups 
(other than the cyclic group of order p, Z p ) and since the structure of p-groups can be 
very complicated (recall the table in Section 5.3), we shall not study the structure of 
p -groups explicitly. Rather, the theory of p-groups developed in the preceding section 
will be applied to subgroups of groups of non-prime-power order. 

Finally, for certain n (e.g., 60, 168, 360, 504,...) there do exist simple groups of 
order n so, of course, we cannot force every group of these orders to be nonsimple. 
As in Section 4.5 we can, in certain cases, prove there is a unique simple group of 
order n and unravel some of its internal structure (Sylow numbers, etc.). We shall study 
simple groups of order 168 as an additional test case. Thus the Sylow Theorems will 
be applied in a number of different contexts to show how groups of a given order may 
be manipulated 

We shall end this section with some comments on the existence problem for groups, 
particularly for finite simple groups. 

For n < 10000 there are 60 odd, non-prime-power numbers for which the congru- 
ence conditions of Sylow’s Theorems do not force at least one of the Sylow subgroups 
to be normal i.e., n p can be > 1 for all primes p \ n (recall that n p denotes the number 
of Sylow p -subgroups). For example, no numbers of the form pq> where p and q are 
distinct primes occur in our list by results of Section 4.5. In contrast, for even numbers 
< 500 there are already 46 candidates for orders of simple groups (the congruence 
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conditions allow many more possibilities). Many of our numerical examples arise from 
these lists of numbers and we often use odd numbers because the Sylow congruence 
conditions allow fewer values for n p . The purpose of these examples is to illustrate the 
use of the results we have proved. Many of these examples can be dealt with by more 
advanced techniques (for example, the Feit-Thompson Theorem proves that there are 
no simple groups of odd composite order). 

As we saw in the case n = 30 in Section 4.5, even though Sylow’s Theorem 
permitted n$ = 6 and n 3 — 10, further ex amina tion showed that any group of order 30 
must have both «5 = 1 and n 3 = 1. Thus the congruence part of Sylow’s Theorem is a 
sufficient but by no means necessary condition for normality of a Sylow subgroup. For 
many n (e.g., n = 1 20) we can prove that there are no simple groups of order n, so there 
is a nontrivial normal subgroup but this subgroup may not be a Sylow subgroup. For 
example, S5 and SZ^CFs) both have order 120. The group S$ has a unique nontrivial 
proper normal subgroup of order 60 (A5) and 6X2 (F5) has a unique nontrivial proper 
normal subgroup of order 2 (Z(SL 2 (F5)) = Z2), neither of which is a Sylow subgroup. 
Our techniques for producing normal subgroups must be flexible enough to cover such 
diverse possibilities. In this section we shall examine Sylow subgroups for different 
primes dividing n, intersections of Sylow subgroups, normalizers of p-subgroups and 
many other less obvious subgroups. The elementary methods we outline are by no 
means exhaustive, even for groups of “medium” order. 

Some Techniques 

Before listing some techniques for producing normal subgroups in groups of a given 
(“medium”) order we note that in all the problems where one deals with groups of 
order n, for some specific n, it is first necessary to factor n into prime powers and then 
to compute the permissible values of n p , for all primes p dividing n. We emphasize 
the need to be comfortable computing mod p when carrying out the last step. The 
techniques we describe may be listed as follows: 

(1) Counting elements. 

(2) Exploiting subgroups of s mall index. 

(3) Permutation representations. 

(4) Playing p-subgroups off against each other for different primes p. 

(5) Studying normalizers of intersections of Sylow p-subgroups. 

Counting Elements 

Let G be a group of order n, let p be a prime dividing n and let P e Syl p (G). If 
| P\ = p, then every nonidentity element of P has order p and every element of G of 
order p lies in some conjugate of P. By Lagrange’s Theorem distinct conjugates of P 
intersect in the identity, hence in this case the number of elements of G of order p is 
n p (p - 1). 

If Sylow p-subgroups for different primes p have prime order and we assume none 
of these is normal, we can sometimes show that the number of elements of prime order 
is > | G\. This contradiction would show that at least one of the n p ’s must be 1 (i.e., 
some Sylow subgroup is normal in G). 

This is the argument we used (in Section 4.5) to prove that there are no simple 
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groups of order 30. For another example, suppose |G| = 105 — 3-5*7. If G were 
simple, we must have «3 = 7, n$ =21 and riq = 15. Thus 

the number of elements of order 3 is 7 ■ 2 = 14 

the number of elements of order 5 is 21 ■ 4 = 84 

the number of elements of order 7 is 15 - 6 — 90 

the number of elements of prime order is 1 88 > \G \ . 

Sometimes counting elements of prime order does not lead to too many elements. 
However, there may be so few elements remaining that there must be a normal subgroup 
involving these elements. This was (in essence) the technique used in Section 4.5 to 

show that in a group of order 12 either = 1 or = 1. This technique works 

particularly well when G has a Sylow p- subgroup P of order p such that Nc(P) — P. 
For example, let |G| = 56. If G were simple, the only possibility for the number of 
Sylow 7-subgroups is 8, so 

the number of elements of order 7 is 8 - 6 = 48. 

Thus there are 56 — 48 = 8 elements remaining in G. Since a Sylow 2-subgroup 
contains 8 elements (none of which have order 7), there can be at most one Sylow 
2-subgroup, hence G has a normal Sylow 2-subgroup. 

Exploiting Subgroups of Small Index 

Recall that the results of Section 4.2 show that if G has a subgroup H of index k , 
then there is a homomorphism from G into the symmetric group 5* whose kernel is 
contained in H. If A: > 1, this kernel is a proper normal subgroup of G and if we are 
trying to prove that G is not simple, we may, by way of contradiction, assume that this 
kernel is the identity. Then, by the First Isomorphism Theorem, G is isomorphic to a 
subgroup of £*. In particular, the order of G divides k\. This argument shows that if k 
is the smallest integer with |G| dividing A:! for a finite simple group G then G contains 
no proper subgroups of index less than k. This smallest permissible index k should be 
calculated at the outset of the study of groups of a given order n. In the examples we 
consider this is usually quite easy: n will often factor as 

pTp? * * • P? with Pi < P2 < ' “ < Ps 

and a s is usually equal to 1 or 2 in our examples. In this case the minimal index of a 
proper subgroup will have to be at least p s (respectively 2 p s ) and this is often its exact 
value. 

For example, there is no simple group of order 3393, because if n — 3393 — 
3 2 - 13 - 29, then the minimal index of a proper subgroup is 29 (n does not divide 28! 
because 29 does not divide 28!). However any simple group of order 3393 must have 
= 13, so for P G Syli(G), Nc(P ) has index 13, a contradiction. 

Permutation Representations 

This method is a refinement of the preceding one. As above, if G is a simple group of 
order n with a proper subgroup of index k, then G is isomorphic to a subgroup of £*. 
We may identify G with this subgroup and so assume G < £&. Rather than relying only 
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on Lagrange's Theorem for our contradiction (this was what we did for the preceding 
technique) we can sometimes show by calculating within S* that S* contains no simple 
subgroup of order n. Two restrictions which may enable one to show such a result are 

(1) if G contains an element or subgroup of a particular order, so must S*, and 

(2) if P e Syl p (G ) and if P is also a Sylow p-subgroup of 5*, then \Nc(P)\ must 
divide |^Si(^)l- 

Condition (2) arises frequently when p is a prime, k = p or p + 1 and G has a 
subgroup of index k . In this case p 2 does not divide k!, so Sylow p- subgroups of G are 
also Sylow p-subgroups of S*. Since now Sylow p-subgroups of S* are precisely the 
groups generated by a p-cycle, and distinct Sylow p-subgroups intersect in the identity, 

, „ ^ _ the no. of p -cycles 

the no. of Sylow p-subgroups of 5* = — : 

the no. of p-cycles in a Sylow p-subgroup 

_ k ■ (k - 1) ■ • ■ (k — p + 1) 

" P(P ~ 1) * 

This number gives the index in Sk of the normalizer of a Sylow p-subgroup of £*. Thus 
for k = p or p + 1 

l^(P)| = p(p-l) (k = p or k = p + 1) 

(cf. also the corresponding discussion for centralizers of elements in symmetric groups 
in Section 4.3 and the last exercises in Section 4.3). This proves, under the above 
hypotheses, that |Nc(-P)l must divide p(p — 1). 

For example, if G were a simple group of order 396 = 2 2 ■ 3 2 • 1 1, we must have 
/in — 12, so if P e Syln(G) t \G : N C (P) \ — 12 and |N C (P)| = 33. Since G has 
a subgroup of index 12, G is isomorphic to a subgroup of S 12 . But then (considering 
G as actually contained in Su) P € Syln (£ 12 ) and \ Ns l2 (P)\ = 110. Since N C (P ) < 
Ns n (P), this would imply 33 | 110, clearly impossible, so we cannot have a simple 
group of order 396. 

We can sometimes squeeze a little bit more out of this method by working in A k 
rather than £*. This slight improvement helps only occasionally and only for groups of 
even order. It is based on the following observations (the first of which we have made 
earlier in the text). 

Proposition 12. 

(1) If G has no subgroup of index 2 and G < then G < A k . 

(2) If P € Syl p (Sk) for some odd prime p, then P e Syl p (Ak ) and |A^ t (P)| = 

5|AMf)|. 

Proof: The first assertion follows from the Second Isomorphism Theorem: if G is 
not contained in .A*, then Ak < GAk so we must have GAk — S k . But now 

2 = |S* : Ak\ = | GAk : A*| = \G : GO 

so G has a subgroup, G n Ak, of index 2. 
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To prove (2) note that if P e Syl p (Sk)> for some odd prime p, by (1) (or order con- 
siderations) P < Ak, hence P e Syl p (Ak) as well. By Frattini ’s Argument (Proposition 
6 ) 

S k = N Sk (P)A k 

so, in particular, Ns k (P) is not contained in A*. This forces Ns k (P) fl A* (= (P)) 

to be a subgroup of index 2 in A^(P). 

For example, there is no simple group of order 264. Suppose G were a simple 
group of order 264 = 2 3 - 3 - 11. We must have «n = 12. As usual, G would be 
isomorphic to a subgroup of Sn. Since G is simple (hence contains no subgroup of 
index 2), G < A\i . Let P e Syln(G). Since «u — 12 = \G : Nc(P) I, we have 
\Nc(P)\ = 22. As above, 

|JWP)| = = i 11(11 - 1) = 55; 

however, 22 does not divide 55, a contradiction to Nq(P) < N Au (P). 

Finally, we emphasize that we have only barely touched upon the combinatorial 
information available from certain permutation representations. Whenever possible in 
the remaining examples we shall illustrate other applications of this technique. 

Playing p-Subgroups Off Against Each Other for Different Primes p 

Suppose p and q are distinct primes such that every group of order pq is cyclic. This 
is equivalent to p f q — 1, where p < q. If G has a Sylow g -subgroup Q of order 
q and p | |Ag(( 2)|, applying Cauchy’s Theorem in N C (Q) gives a group P of order 
p norm alizin g Q (note that P need not be a Sylow p -subgroup of G). Thus PQ i s a 
group and if PQ is abelian, we obtain 

PQ<N C (P) and so q \ \N C {P)\. 

(A symmetric argument applies if Sylow p-subgroups of G have order p and q divides 
the order of a Sylow p-normalizer). This numerical information alone may be sufficient 
to force Nc (P) = G (i,e., P < G), or at least to force Nc(P) to have index smaller than 
the minim al index permitted by permutation representations, giving a contradiction by 
a preceding technique. 

For example, there are no simple groups of order 1785. If there were, let G be 
a simple group of order 1785 = 3*5*7 - 17. The only possible value for n 17 is 
35, so if Q is a Sylow 17-subgroup, [G : N C (Q)\ = 35. Thus 1A C ((?)| =3 -17. 
Let P be a Sylow 3-subgroup of N C (Q). The group PQ is abelian since 3 does not 
divide 17 — 1, so Q < N C (P ) and 17 | |A C (P)|. In this case P e 5y/ 3 (G). The 
permissible values of n 3 are 7, 85 and 595; however, since 17 | |A^(P)|, we cannot 
have 17 j |G : Nc(P) I = Thus n 3 = 7. But G has no proper subgroup of index 
< 17 (the minimal index of a proper subgroup is 17 for this order), a contradiction. 
Alternatively, if n 3 = 7, then | Nq(P)\ = 3 ■ 5 ■ 17, and by Sylow’s Theorem applied in 
Ng(P ) we have Q < Nq(P). This contradicts the fact that |A C (£))I =3 -17. 

We can refine this method by not requiring P and Q to be of prime order. Namely, 
if p and q are distinct primes dividing |G| such that Q € Syl q (G) and p | \N G (Q)\, 
let P g Syl p (Nc(Q )). We can then apply Sylow’s Theorems in N C (Q) to see whether 
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P < Nq(Q), and if so, force Nc (P) to be of small index. If P is a Sylow p-subgroup of 
the whole group G, we can use the congruence part of Sylow’s Theorem to put further 
restrictions on |Nc(P)l (as we did in the preceding example). If P is not a Sylow 
p-subgroup of G, then by the second part of Sylow’s Theorem P < P* £ Syl p (G). In 
this case since P < P*, Theorem 1(4) shows that P < N P *(P). Thus N G (P) (which 
contains N P .(P)) has order divisible by a larger power of p than divides |P( (as well 
as being divisible by | Q\). 

For example, there are no simple groups of order 3675. If there were, let G be 
a simple group of order 3675 = 3 • 5 2 ■ 7 2 . The only possibility for nq is 15, so for 
Q € Syh(G), | G : N C (Q)\ = 15 and |Afc(0l = 245 = 5 - 7 2 . Let N = N G (Q) and 
let P £ Syls(N). By the congruence conditions of Sylow’s Theorem applied in /V we 
get P < N. Since |P| = 5, P is not itself a Sylow 5-subgroup of G so P is contained 
in some Sylow 5-subgroup P* of G. Since P is of index 5 in the 5-group P*, P < P* 
by Theorem 1, that is P* < N G {P )- This proves 

(N. P*) < N C (P ) so 7 2 - 5 2 | |tf c (P)|. 

Thus |G : N C (P)\ | 3, which is impossible since P is not normal and G has no 
subgroup of index 3. 

Studying Normalizers of Intersections of Sylow p - Subgroups 

One of the reasons the counting arguments in the first method above do not immediately 
generalize to Sylow subgroups which are not of prime order is because if P £ Syl p (G) 
for some prime p and |P| = p°, a > 2, then it need not be the case that distinct 
conjugates of P intersect in the identity subgroup. If distinct conjugates of P do 
intersect in the identity, we can again count to find that the number of elements of 
p-power order is n p (\P\ — 1). 

Suppose, however, there exists R £ Syl p (G ) with R P and P n R ^ 1. Let 
Pq — P D R. Then Pq < P and Pq < R, hence by Theorem 1 

Po < N P (P 0 ) and P 0 < Nr(Pq). 

One can try to use this to prove that the normalizer in G of Po is sufficiently large (i.e., 
of sufficiently small index) to obtain a contradiction by previous methods (note that this 
normalizer is a proper subgroup since Po ^ 1). 

One special case where this works particularly well is when |Po| = p a } he., the 
two Sylow p-subgroups R and P have large intersection. In this case set N = N G (Pol- 
Then by the above reasoning (i.e., since Po is a maximal subgroup of the p-groups P 
and R), Pq < P and Po < R, that is, 

N has 2 distinct Sylow p-subgroups: P and R. 

In particular, \N\ = p a k> where (by Sylow’s Theorem) k > p + 1. 

Recapitulating, if Sylow p-subgroups pairwise intersect in the identity, then count- 
ing elements of p-power order is possible; otherwise there is some intersection of Sylow 
p-subgroups whose normalizer is ‘large.” Since for an arbitrary group order one cannot 
necessarily tell which of these two phenomena occurs, it may be necessary to split the 
nonsimplicity argument into two (mutually exclusive) cases and derive a contradiction 
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in each. This process is especially amenable when the order of a Sylow p-subgroup is 
p 2 (for example, this line of reasoning was used to count elements of 2-power order in 
the proof that a simple group of order 60 is isomorphic to A 5 — Proposition 23, Section 
4.5). 

Before proceeding with an example we state a lemma which gives a sufficient 
condition to force a nontrivial SyJow intersection. 

Lemma 13. In a finite group G if n p l(mod p 2 ), then there are distinct Sylow 
p-subgroups P and R of G such that P n R is of index p in both P and R (hence is 
normal in each). 

Proof: The argument is an easy refinement of the proof of the congruence part of 
Sy low’s Theorem (cf. the exercises at the end of Section 4.5). Let P act by conjugation 
on the set Syl p (G). Let Cfi, . . . , O s be the orbits under this action with Oy = {P}. If 
p 2 divides | P : P n R \ for all Sylow p-subgroups R of G different from P, then each 
0\ has size divisible by p 2 , 1 = 2, 3, .... r. In this case, since n p is the sum of the 
lengths of the orbits we would have n p = 1 + kp 2 , contrary to assumption. Thus for 
some R e Syl p (G) t |P : P n = p. 

For example, there are no simple groups of order 1053. If there were, let G be a 
simple group of order 1053 = 3 4 ■ 13 and let P e SyhiG). We must have 713 = 13. 
But 13 fi l(mod3 2 ) so there exist P, R e Syh(G) such that |P n P| = 3 3 . Let 
N = Ng(P H P), so by the above arguments P, R < N. Thus 3 4 | |N| and \N\ > 3 4 . 
The only possibility is N = G t i.e., P n R < G, a contradiction. 

Simple Groups of Order 168 

We now show how many of our techniques can be used to unravel the structure of 
and then classify certain simple groups by classifying the simple groups of order 168. 
Because there are no nontrivial normal subgroups in simple groups, this process departs 
from the methods in Section 5.5, but the overall approach typifies methods used in the 
study of finite simple groups. 

We begin by assuming there is a simple group G of order 168 = 2 3 - 3 ■ 7. We 
first work out man y of its properties: the number and structure of its Sylow subgroups, 
the conjugacy classes, etc. All of these calculations are based only on the order and 
simplicity of G. We use these results to first prove the uniqueness of G\ and ultimately 
we prove the existence of the simple group of order 168. 

Because ]G| does not divide 6! we have 

(1) G has no proper subgroup of index less than 7, 

since otherwise the action of G on the cosets of the subgroup would give a (necessarily 
injective since G is simple) homomorphism from G into some S n with n < 6. 

The simplicity of G and Sylow’s Theorem also immediately imply that 

(2) n-] = 8, so the normalizer cf a Sylow 7-subgroup has order 21. In particular, no 
element of order 2 normalizes a Sylow 7-subgroup and G has no elements of order 14. 
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If G had an element of order 2 1 then the normalizer of a Sylow 3-subgroup of G would 
have order divisible by 7. Thus would be relatively prime to 7. Since then n$ | 8 we 
would have n$ = 4 contrary to (1). This proves: 

(3) G has no elements of order 21. 

By Sy low’s Theorem ns = 7 or 28; we next rule out the former possibility. Assume 
ns = 7, let P g Syls(G) and let T be a Sylow 2-subgroup of the group Nc (P) of order 
24. Each Sylow 3-subgroup normalizes some Sylow 7-subgroup of G so P normalizes 
a Sylow 7-subgroup R of G. For every t e T we also have that P = tPt~ l normalizes 
The subgroup T acts by conjugation on the set of eight Sylow 7-subgroups of G 
and since no element of order 2 in G normalizes a Sylow 7-subgroup by (2), it follows 
that T acts transitively, i.e., every Sylow 7-subgroup of G is one of the tRt ~ l . Hence 
P normalizes every Sylow 7-subgroup of G, i.e., P is contained in the intersection 
of the normalizes of all Sylow 7-subgroups. But this intersection is a proper normal 
subgroup of G, so it must be trivial. This contradiction proves: 

(4) «3 = 28 and the normalizer of a Sylow 3-subgroup has order 6 . 

Since « 2 = 7 or 21, we have w 2 1 mod 8, so by Exercise 21 there is a pair of distinct 
Sylow 2-subgroups that have nontrivial intersection; over all such pairs let Ty and T 2 
be chosen with U = Ty D T 2 of maximal order. We next prove 

(5) U is a Klein 4-group and N C (U) = S 4 . 

Let N = Kq(U). Since \U\ = 2 or 4 and N permutes the nonidentity elements of 
U by conjugation, a subgroup of order 7 in N would commute with some element of 
order 2 in U , contradicting (2). It follows that the order of N is not divisible by 7. By 
Exercise 13, N has more than one Sylow 2-subgroup, hence \N\ = 2 a ■ 3, where a = 2 
or 3. Let P e SylsiN). Since P is a Sylow 3-subgroup of G, by (4) the group Nfj(P) 
has order 3 or 6 (with P as its unique subgroup of order 3). Thus by Sylow’s Theorem 
N must have four Sylow 3-subgroups, and these are permuted transitively by N under 
conjugation. Since any group of order 12 must have either a normal Sylow 2-subgroup 
or a normal Sylow 3-subgroup (cf. Section 4.5), |7V| = 24. Let K be the kernel of N 
acting by conjugation on its four Sylow 3-subgroups, so K is the intersection of the 
normalizers of the Sylow 3-subgroups of N. If K = 1 then N ~ S 4 as asserted; so 
consider when K 1. Since K < N^(P), the group K has order dividing 6, and 
since P does not normalize another Sylow 3-subgroup, P is not contained in AT. It 
follows that | AT | = 2. But now N/K is a group of order 12 which is seen to have more 
than one Sylow 2-subgroup and four Sylow 3-subgroups, contrary to the property of 
groups of order 12 cited earlier. This proves N = S 4 . Since 54 has a unique nontrivial 
normal 2-subgroup, V 4 , (5) holds. Since N = S 4 , it follows that N contains a Sylow 
2-subgroup of G and also that N^(P) = Ss (so also Nq(P) == Ss by (4)). Hence we 
obtain 

(6) Sylow 2-subgroups of G are isomorphic to D%, and 

(7) the normalizer in G of a Sylow 3-subgroup is isomorphic to Ss and so G has no 
elements of order 6 . 


208 


Further Topics in Group Theory 



By (2) and (7), no element of order 2 commutes with an element of odd prime order. 
If T e Sy 12 (G), then T = D% by ( 6 ), so Z(T) = (z ) where z is an element of order 
2. Then T < C c (z) and ]C c (z)| has no odd prime factors by what was just said, 
so Cq(z) — T. Since any element normalizing T would normalize its center, hence 
commute with z, it follows that Sylow 2-subgroups of G are self-normalizing. T hi s 
gives 

( 8 ) n .2 — 21 and Cq(z) — T, where T e Sy 12 (G) and Z(T ) — ( z ). 

Since \Cc(z)\ — 8 , the element z in ( 8 ) has 21 conjugates. By ( 6 ), G has one conjugacy 
class of elements of order 4, which by ( 6 ) and ( 8 ) contains 42 elements. By (2) there are 
48 elements of order 7, and by (4) there are 56 elements of order 3. These account for 
all 167 nonidentity elements of G, and so every element of order 2 must be conjugate 
to z, i.e., 

(9) G has a unique conjugacy class of elements of order 2. 

Continuing with the same notation, let T € Syh(G) with U < T and let W be the other 
Klein 4-group in T. It follows from Sylow’s Theorem that U and W are not conjugate 
in G since they are not conjugate in Nc(T ) = T (cf. Exercise 50 in Section 4.5). We 
argue next that 

(10) N C (W) ^ £4. 

To see this let W = ( z, w ) where, as before, ( z ) = Z(T). Since w is conjugate in 
G to z, Cc(u>) = 7o is another Sylow 2-subgroup of G containing W but different 
from T. Thus W = T D Tq. Since U was an arbitrary maximal intersection of Sylow 
2-subgroups of G, the argument giving (5) implies (10). 

We now record results which we have proved or which are easy consequences of 

( 1 ) to ( 10 ). 

Proposition 14 . If G is a simple group of order 168, then the following hold: 

( 1 ) = 21, /13 = 7 and ny = 8 

(2) Sylow 2-subgroups of G are dihedral, Sylow 3- and 7-subgroups are cyclic 

(3) G is isomorphic to a subgroup of Aj and G has no subgroup of index < 6 

( 4 ) the conjugacy classes of G are the following: the identity; two classes of el- 
ements of order 7 each of which contains 24 elements (represented by any 
element of order 7 and its inverse); one class of elements of order 3 containing 
56 elements; one class of elements of order 4 containing 42 elements; one class 
of elements of order 2 containing 21 elements 

(in particular, every element of G has order a power of a prime) 

(5) if T e Sy 12 (G) and U, W are the two Klein 4-groups in T, then U and W are 
not conjugate in G and N C (U) = N C (W) = S 4 

( 6 ) G has precisely three conjugacy classes of maximal subgroups, two of which 
are isomorphic to S 4 and one of which is isomorphic to the non-abelian group 
of order 21 . 

All of the calculations above were predicated on the assumption that there exists a 
simple group of order 1 68. The fact that none of these arguments leads to a contradiction 
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does not prove the existence of such a group, but rather just gives strong evidence that 
there may be a simple group of this order. We next illustrate how the internal subgroup 
structure of G gives rise to a geometry on which G acts, and so leads to a proof that a 
simple group of order 168 is unique, if it exists (which we shall also show). 

Continuing the above notation let U\, ... ,Uq be the conjugates of U and let 
W\, ... , Wq be the conjugates of W . Call the Uj points and the Wj lines. Define 
an ‘incidence relation” by specifying that 

the point U\ is on the line Wj if and only ifUi normalizes Wj. 

Note that Ui normalizes Wj if and only if U; Wj = D%, which in turn occurs if and 
only if Wj normalizes U-, . In each point or line stabilizer — which is isomorphic to £ 4 — 
there is a unique normal 4-group, V , and precisely three other (nonnormal) 4-groups 
Ay, A2, A3. The groups V Ay are the three Sylow 2-subgroups of the £4. We therefore 
have: 

(11) each line contains exactly 3 points and each point lies on exactly 3 lines. 

Since any two nonnormal 4-groups in an £4 generate the £4, hence uniquely determine 
the other two Klein groups in that £4, we obtain 

(12) any 2 points on a line uniquely determine the line (and the third point on it). 
Since there are 7 points and 7 lines, elementary counting now shows that 

(13) each pair of points lies on a unique line, and each pair of lines intersects in a 
unique point. 

(This configuration of points and lines thus satisfies axioms for what is termed a projec- 
tive plane.) It is now straightforward to show that the incidence geometry is uniquely 
determined and may be represented by the graph in Figure 1, where points are ver- 
tices and lines are the six sides and medians of the triangle together with the inscribed 
circle — see Exercise 27. This incidence geometry is called the projective plane of order 
2 or the Fano Plane , and will be denoted by T. (General ly, a projective plane of “order'’ 
N has N 2 + N + 1 points, and the same number of lines.) Note that at this point the 
projective plane jF does exist — we have explicitly exhibited points and lines satisfying 
(1 1) to (13) — even though the group G is not yet known to exisL 



Figure 1 

An automorphism of this plane is any permutation of points and lines that preserves 
the incidence relation. For example, any of the six symmetries of the triangle in Figure 1 
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give automorphisms of T, but we shall see that T has many more automorphisms than 
these. 

Each g € G acts by conjugation on the set of points and lines, and this action 
preserves the incidence relation. Only the identity element in G fixes all points and so 
via this action the group G would be isomorphic to a subgroup of the group of Aut^F), 
the group of all automorphisms of T. 

Any automorphism of T that fixes two points on a line as well as a third point not 
on that line is easily seen to fix all points. Thus any automorphism of T is uniquely 
determined by its action on any three noncoilinear points. Since one easily computes 
that there are 168 such triples, T has at most 168 automorphisms. This proves 

if the simple group G exists it is unique and G = Aut (T). 

Two steps in the classification process yet remain: to prove that T does have 168 
automorphisms and to prove Aut(.F) is indeed a simple group. Although one can do 
these graph-theoretically, we adopt an approach following ideas from the theory of 
“algebraic groups.” Let V be a 3 -dimensional vector space over the field of 2 elements, 
F 2 , so V is the elementary abelian 2-group Z 2 x Z 7 x Z 2 of order 8. By Proposition 17 
in Section 4.4, Aut(V) = GL(V ) = GlssfRf) has order 168. Call the seven 1- 
dimensional subspaces (i.e., the nontrivial cyclic subgroups) of V points,- call the seven 
2-dimensional subspaces (i.e., the subgroups of order 4) lines, and say the point p is 
incident to the line L if p C L. Then the points and lines are easily seen to satisfy the 
same axioms (11) to (13) above, hence to represent the Fano Plane. Since GL(V) acts 
faithfully on these points and lines preserving incidence, Aut(^F) has order at least 168. 
In light of the established upper bound for |Aut(jF)| this proves 

Aut (D = GL(V) = GL S ( F 2 ) and Aut(T) has order 168. 

Finally we prove that GL(V) is a simple group. By way of contradiction assume 
H is a proper nontrivial normal subgroup of GL(V). Let ft be the 7 points and let N be 
the stabilizer in GL{ V) of some point in ft. Since GL(V) acts transitively on ft, N has 
index 7. Since the intersection of all conjugates of N fixes all points, this intersection is 
the identity. Thus H £ N, and so GL(V) = HN. Since | H : H H N\ = \HN : A| 
we have 7 | \H\. Since GL(V) is isomorphic to a subgroup of Sj and since Sylow 
7-subgroups of S-j have normalizers of order 42, GL(V) does not have a normal Sylow 
7-subgroup, so by Sylow’s Theorem nj(GL(V)) = 8. A normal Sylow 7-subgroup of 
H would be characteristic in H, hence normal in GL(V ), so also H does not have a 
unique Sylow 7-subgroup. Since nq{H) = 1 mod 7 and nj(H) < n 7 (GL(V)) = 8 we 
must have nq(H) = 8. This implies \H\ is divisible by 8, so 56 | |f/|, and since H 
is proper we must have \H\ = 56. By usual counting arguments (cf. Exercise 7(b) of 
Section 5.5) H has a normal, hence characteristic, Sylow 2-subgroup, which is therefore 
normal in GL(V). But then GL(V) would have a unique Sylow 2-subgroup. Since 
the set of upper triangular matrices and the set of lower triangular matrices are two 
subgroups of GL 3 QF 2 ) each of order 8, we have a contradiction. In summary we have 
now proven the following theorem. 
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Theorem 15. Up to isomorphism there is a unique simple group of order 1 68, GL 3 (IF 2 ), 
which is also the automorphism group of the projective plane T. 

Note that we might just as well have called the Wj points and the £/,■ lines. This 
“duality” between points and lines together with the uniqueness of a simple group of 
order 168 may be used to prove the existence of an outer automorphism of G that 
interchanges points and lines i.e., conjugates U to W. 

Many families of finite simple groups can be classified by analogous methods. 
In more general settings geometric structures known as buildings play the role of the 
projective plane (which is a special case of a building of type Az). In this context the 
subgroups Nc (U) and Nq(W) are parabolic subgroups of G, and U, W are their unipo- 
tenl radicals respectively. In particular, all the simple linear groups (cf. Section 3.4) 
are characterized by the structure and intersections of their parabolic subgroups, or 
equivalently, by their action on an associated building. 

Remarks on the Existence Problem for Groups 

As in other areas of mathematics (such as the theory of differential equations) one 
may hypothesize the existence of a mathematical system (e.g., solution to an equation) 
and derive a great deal of information about this proposed system. In general, if after 
considerable effort no contradiction is reached based on the initial hypothesis one begins 
to suspect that there actually is a system which does satisfy the conditions hypothesized. 
However, no amount of consistent data will prove existence. Suppose we carried out 
an analysis of a hypothetical simple group G of order 3 3 ■ 7 • 13 ■ 409 analogous to our 
analysis of a simple group of oitler 168 (which we showed to exist). After a certain 
amount of effort we could show that there are unique possible Sylow numbers: 

n 3 = 7-409 n 7 = 3 2 ■ 13 ■ 409 n I3 = 3 2 ■ 7 - 409 W 409 = 3 2 ■ 7 ■ 13. 

We could further show that such a G would have no elements of order pq, p and 
q distinct primes, no elements of order 9, and that distinct Sylow subgroups would 
intersect in the identity. We could then count the elements in Sylow p-subgroups for 
all primes p and we would find that these would total to exactly |Gj. At this point 
we would have the complete subgroup structure and class equation for G. We might 
then guess that there is a simple group of this order, but the Feit-Thompson Theorem 
asserts that there are no simple groups of odd composite order. (Note, however, that 
the configuration for a possible simple group of order 3 3 • 7 - 13 - 409 is among the 
cases that must be dealt with in the proof of the Feit-Thompson Theorem, so quoting 
this result in this instance is actually circular. We prove no simple group of this order 
exists in Section 19.3; see also Exercise 29.) The point is that even though we have as 
much data in this case as we had in the order 168 situation (i.e., Proposition 14), we 
cannot prove existence without some new techniques. 

When we are dealing with nonsimple groups we have at least one method of building 
larger groups from smaller ones: semidirect products. Even though this method is fairly 
restrictive it conveys the notion that nonsimple groups may be built up from smaller 
groups in some constructive fashion. This process breaks down completely for simple 
groups; and so this demarcation of techniques reinforces our appreciation for the Holder 
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Program: deter minin g the simple groups, and finding how these groups are put together 
to form larger groups. 

The study of simple groups, as illustrated in the preceding discussion of groups of 
order 1 68, uses many of the same tools as the study of nonsimple groups (to unravel their 
subgroup structures, etc.) but also requires other techniques for their construction. As 
we mentioned at the end of that discussion, these often involve algebraic or geometric 
methods which construct simple groups as automorphisms of mathematical structures 
that have intrinsic interest, and thereby link group theory to other areas of mathematics 
and science in fascinating ways. Thus while we have come a long way in the analysis 
of finite groups, there are a number of different areas in this branch of mathematics on 
which we have just touched 

The analysis of infinite groups generally involves quite different methods, and in 
the next section we introduce some of these. 

EXERCISES 


Counting elements: 

1 . Prove that for fixed P e Sylp(G) if PHR = 1 for all/? € Syl p (G) — {/*}, thenfiO/^ = 1 
whenever P[ and P 2 are distinct Sylow p-subgroups of G. Deduce in this case that the 
number of nonidentity elements of p-power order in G is (|P| — \)\G : Nc(P) I- 

2. In the group S3 x S3 exhibit a pair of Sylow 2-subgroups that intersect in the identity and 
exhibit another pair that intersect in a group of order 2. 

3. Prove that if (G| =380 then G is not simple. [Just count elements of odd prime order.] 

4. Prove that there are no simple groups of order- 80, 351, 3875 or 5313. 

5. Let G be a solvable group of order pm, where p is a prime not dividing m, and let 
P € Syl p (G). If Ng(P) = P, prove that G has a normal subgroup of order in. Where 
was the solvability of G needed in the proof? (This result is true for nonsolvable groups 
as well — it is a special case of Burnside’s N/C-Theorem.) 

Exploiting subgroups erf 1 small index: 

6. Prove that there are no simple groups of order 2205, 4125, 5103, 6545 or 6435. 

Permutation representations: 

7 . Prove that there are no simple groups of order 1755 or 5265. [Use Sylow 3-subgroups to 
show G < S13 and look at the normalizer of a Sylow 13-subgroup.] 

8. Prove that there are no simple groups of order 792 or 918. 

9. Prove that there are no simple groups of order 336. 

Playing p -subgroups off against each other: 

10 . Prove that there are no simple groups of order 4095, 4389, 53 1 3 or 6669. 

11 . Prove that there are no simple groups of order 4851 or 5145. 

12. Prove that there are no simple groups of order 9555. [Let Q e Syl\$(G) and let P e 
Syh(Nc(Q)). Argue that Q < Ng(P) — w by is this a contradiction?] 

Normalizes of Sylow intersections: 

13. Let G be a group with more than one Sylow p -subgroup. Over all pairs of distinct Sylow 
p- subgroups let P and Q be chosen so that |P H <2| is maximal. Show that Ng(P O Q) 
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has more than one Sylow p- subgroup and that any two distinct Sylow p- subgroups of 
Nc(P n Q ) intersect in the subgroup P n Q. (Thus I No (P G Q ) | is divisible by p ■ | P n Q \ 
and by some prime other than p. Note that Sylow p-subgroups of Nq(P n Q) need not 
be Sylow in G.) 

14. Prove that there are no simple groups of order 144, 525, 2025 or 3159. 

General exercises: 

15. Oassify groups of order 105. 

16. Prove that there are no non-abelian simple groups of odd order < 10000. 

17. (a) Prove that there is no simple group of order 420. 

(b) Prove that there are no simple groups of even order < 500 except for orders 2, 60, 
168 and 360. 

18. Prove that if G is a group of order 36 then G has either a normal Sylow 2-subgroup or a 
normal Sylow 3-subgroup. 

19. Show that a group of order 12 with no subgroup of order 6 is isomorphic to A4. 

20. Show that a group of order 24 with no element of order 6 is isomorphic to £4. 

21. Generalize Lemma 13 by proving that if n p ^ l(mod p*) then there are distinct Sylow 
p-subgroups P and R of G such that P n R is of index < p* -1 in both P and R. 

22. Suppose over all pairs of distinct Sylow p-subgroups of G, P and R are chosen with 
|P 0 f?| maximal. Prove that Ng(P n R) is not a p-group. 

23. Let A and B be normal subsets of a Sylow p-subgroup P of G. Prove that if A and B are 
conjugate in G then they are conjugate in Nc,(P). 

24. Let G be a group of order pqr where p, q and r are primes with p < q < r. Prove that a 
Sylow r-subgroup of G is normal. 

25. Let G be a simple group of order p*qr where p, q and r are primes. Prove that | G| =60. 

26. Prove or construct a counterexample to the assertion: if G is a group of order 168 with 
more than one Sylow 7-subgroup then G is simple. 

27. Show that if T is any set of points and lines satisfying properties (11) to (13) in the 
subsection on simple groups of order 168 then the graph of incidences for T is uniquely 
determined and is the same as Figure 1 (up to relabeling points and lines). [Tike a line 
and any point not cm this line. Depict the line as the base of an equilateral triangle and 
the point as the vertex of this triangle not on the base. Use the axioms to show that the 
incidences of the remaining points and lines are then uniquely determined as in Figure 1.] 

28. Let G be a simple group of order 3 3 - 7 ■ 13 ■ 409. Compute all permissible values of n p 
for each p e {3, 7, 13, 409} and reduce to the case where there is a unique possible value 
for each n p . 

29. Given the information on the Sylow numbers for a hypothetical simple group of order 
3 3 -7 ■ 1 3 ■ 409, prove that there is no such group. [Work with the permutation representation 
of degree 819.] 

30. Suppose G is a simple group of order 720. Find as many properties of G as you can (Sylow 
numbers, isomorphism type of Sylow subgroups, conjugacy classes, etc.). Is there such a 
group? 


214 


Further Topics in Group Theory 



6.3 A WORD ON FREE GROUPS 


In this section we introduce the basic theory of so-called free groups. This will enable 
us to make precise the notions of generators and relations which were used in earlier 
chapters. The results of this section rely only on the basic theory of homomorphisms. 

The basic idea of a free group F(S) generated by a set S is that there are no relations 
satisfied by any of the elements in S (S is “free” of relations). For example, if S is the 
set [a, b) then the elements of the free group on the two generators a and b axe of the 
form a, aa, ab, abab , bab, etc., called words in a and b, together with the inverses of 
these elements, and all these elements are considered distinct If we group like terms 
together, then we obtain elements of the familiar form a, b~ 3 , aba^k? etc. Such 
elements are multiplied by concatenating their words (for example, the product of aba 
and b~ l a 3 b would simply be abab~ l a 3 b). It is natural at the outset (even before we 
know S is contained in some group) to simply define F(S) to be the set of all words in 5, 
where two such expressions are multiplied in F(S) by concatenating them. Although 
in essence this is what we do, it is necessary to be more formal in order to prove that 
this concatenation operation is well defined and associative. After all, even the familiar 
notation a n for the product a a ■ - a (n terms) is permissible only because we know that 
this product is independent of the way it is bracketed (cf . the generalized associative law 
in Section 1.1). The formal construction of F(S) is carried out below for an arbitrary 
set S. 

One important property reflecting the fact that there are no relations that must be 
satisfied by the generators in S is that any map from the set S to a group G can be 
uniquely extended to a homomorphism from the group F(S) to G (basically since we 
have specified where the generators must go and the images of all the other elements 
are uniquely determined by the homomorphism property — the fact that there are 
no relations to worry about means that we can specify the images of the generators 
arbitrarily). This is frequently referred to as the universal property of the free group 
and in fact characterizes the group F(S). 

The notion of “freeness” occurs in many algebraic systems and it may already be 
familiar (using a different terminology) from elementary vector space theory. When 
the algebraic systems are vector spaces, F(S) is simply the vector space which has S 
as a basis. Every vector in this space is a unique linear combination of the elements of 
5 (the analogue of a “word”). Any set map from the basis S to another vector space 
V extends uniquely to a linear transformation (i.e., vector space homomoiphism) from 
F(5) to V. 

Before be ginnin g the construction of F(S) we mention that one often sees the 
universal property described in the language of commutative diagrams. In this form it 
reads (for groups) as follows: given any set map <p from the set S to a group G there is a 
unique homomorphism <P : F(S) ->■ G such that <P\s = <P i.e., such that the following 
diagram commutes: 



Sec. 6.3 A Word on free Groups 


215 



As mentioned above, the only difficulty with the construction of F(S) is the ver- 
ification that the concatenation operation on the words in F(S) is well defined and 
associative. To prove the associative property for multiplication of words we return to 
the most basic level where all the exponents in the words of S are ±1. 

We first introduce inverses for elements of S and an identity. 

Let S~ l be any set disjoint from S such that there is a bijection from S to S~ l . 
For each s € S denote its corresponding element in S~ l by s~ l and similarly for each 
t e S~ ] let the corresponding element of S be denoted by t l (so (s 1 ) 1 — s). Take 
a singleton set not contained in S U and call it {1}. Let 1“ ! = 1 and for any 
x e S U U {1} let x 1 = x. 

Next we describe the elements of the free group on the set S. A word on 5 is by 
definition a sequence 

(si , J 2 , £ 3 , ■ ■ • ) where s-, € S U S -1 U {1} and Sj = 1 for all i sufficiently large 

(that is, for each sequence there is an N such that ,v,- = 1 for all / > N). Thus we can 
think of a word as a finite product of elements of S and their inverses (where repetitions 
are allowed). Next, in order to assure uniqueness of expressions we consider only words 
which have no obvious “cancellations” between adjacent terms (such as baa~ l b = bb). 
The word (si , sx, S 3 , . . . ) is said to be reduced if 

( 1 ) s/ + i 7 ^ sj -1 for all i with s,- ^ 1 , and 

( 2 ) if Si = 1 for some k t then s, = 1 for all i > k. 


The reduced word (1,1, 1,...) is called the empty word and is denoted by 1. We 
now simplify the notation by writing the reduced word (sf 1 , s^ , . . . , s e n n , 1 , 1 , 1 , . . . ), 
s, e S, €, = ±1, as Sj ' . . . s „ n . Note that by definition, reduced words r*' r^ ... r^ 

and Sj's^ 2 . . . s* K are equal if and only if n — m and S t — e, , 1 < / < n. Let F(S ) be 
the set of reduced words on 5 and embed S into F(S) by 


s h> (s, 1 , 1 , 1 , . ..). 


Under this set injection we identify S with its image and henceforth consider 5 as a 
subset of F(S). Note that if S = 0, F(S) — {1}. 

We are now in a position to introduce the binary operation on F(S). The principal 
technical difficulty is to ensure that the product of two reduced words is again a reduced 
word. Although the definition appears to be complicated it is simply the formal rule 
for “successive cancellation” of juxtaposed terms which are inverses of each other 
(e.g., ab~ l a tim es a~ x ba should reduce to an). Let r^r^ . . . rjj? and s\' ■ ■ ■ s* n be 
reduced words and assume first that m < n. Let k be the smallest integer in the range 
1 < £ < m + 1 such that s f k k 7 ^ r m & -k+\ ■ Th en the product of these reduced words is 
defined to be: 




6 1 


s 2 ‘ ■ 4 ") 


if Ir <T yyi 

M * r m-k+Y S k > U K 

“‘ s n"> if k = m+1 < n 

1 , if k = m +1 and m = n. 


The product is defined similarly when m > n, so in either case it results in a reduced 
word. 
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Theorem 16. F(S) is a group under the binary operation defined above. 


Proof: One easily checks that 1 is an identity and that the inverse of the reduced 
word jj l ^ 2 2 - - - s € n " is the reduced word . . .Sy* 1 . The difficult part of the proof 

is the verification of the associative law. This can be done by induction on the “length” 
of the words involved and considering various cases or one can proceed as follows: For 
each s € S U S -1 U {1} define o s : F(S) -> F(S) by 


cr s (4's2 2 ...J*") = 


-s 1 ■ . 

J 2 f 2 s < 3 . 




-l 


■«1 - r-l 


. -s„ n , if V = s 


Since a s -\ o o s is the identity map of F(S) -> F(S ), a s is a permutation of F(S). Let 
A(F) be the subgroup of the symmetric group on the set F(S) which is generated by 
{a s | s G S). It is easy to see that the map 


r 6 ' c * 2 

■M a 2 


, sl n l-y cr, 


€l 


S\ 


° 


o O’, 


is a (set) bijection between F ( S ) and A (S) which respects their binary operations. Since 
A(S) is a group, hence associative, so is F(S). 


The universal property of free groups now follows easily. 


Theorem 17. Let G be a group, S a set and <p : S -> G a set map. Then there is a unique 
group homomorphism 4> : F(S) — y G such that the following diagram commutes: 

5 


inclusion 



Proof: Such a map 0 must satisfy 0(s\ x s1f . . .sf) = (p(s\Y x <p(siY 2 . . .<p(s„Y n 
if it is to be a homomorphism (which proves uniqueness), and it is straightforward to 
check that this map is in fact a homomorphism (which proves existence). 

Corollary 18. F(S) is unique up to a unique isomorphism which is the identity map 
on the set S. 

Proof This follows from the universal property. Suppose F(S) and F'(S) are 
two free groups generated by S. Since S is contained in both F(S) and F'(S), we have 
natural injections S F'(S) and S <-> F(S). By the universal property in the theorem, 
it follows that we have unique associated group homomorphisms 0 : F(S) -y F'(S) 
and <P' : F'(S) — y F(S) which are both the identity on S. The composite 0'0 is a 
homomorphism from F(S) to F(S) which is the identity on S , so by the uniqueness 
statement in the theorem, it must be the identity map. Similarly 0 0' is the identity, so 
0 is an isomorphism (with inverse 0% which proves the corollary. 
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Definition. The group F(S) is called the free group on the set £. A group F is a free 
group if there is some set £ such that F = F(S) — in this case we call 5 a set of free 
generators (or a free basis) of F. The cardinality of £ is called the rank of the free 
group. 


One can now simplify expressions in a free group by using exponential notation, so 
we write a^b~ 2 instead of the formal reduced word aaab~ ] b~ l . Expressions like aba , 
however, cannot be simplified in the free group on {a, b). We mention one important 
theorem in this area. 

Theorem 19. (Schreier) Subgroups of a free group are free. 


This is not trivial to prove and we do not include a proof. There is a nice proof of 
this result using covering spaces (cf. Trees by J.-P. Serre, Springer- Verlag, 1980). 

Presentations 

Let G be any group. Then G is a homomorphic image of a free group: take £ = G 
and <p as the identity map from G to G; then Theorem 16 produces a (surjective) 
homomorphism from F(G) onto G. More generally, if £ is any subset of G such 
that G = { £ ), then again there is a unique surjective homomorphism from F(S) onto 
G which is the identity on S. (Note that we can now independently formulate the 
notion that a subset generates a group by noting that G — ( £ ) if and only if the map 
tv : F(S) — ► G which extends the identity map of £ to G is surjective.) 

Definition. Let £ be a subset of a group G such that G = ( £ ). 

(1) A presentation for G is a pair (£, R), where R is a set of words in F(S ) such that 
the normal closure of ( R ) in F(S) (the smallest normal subgroup containing 
( R )) equals the kernel of the homomorphism n : F(S) — ► G (where n extends 
the identity map from £ to £). The elements of £ are called generators and those 
of R are called relations of G. 

(2) We say G is finitely generated if there is a presentation (£, R) such that £ is a 
finite set and we say G finitely presented if there is a presentation (£, R) with 
both £ and R finite sets. 

Note that if (£, R) is a presentation, the kernel of the map F(S) G is not ( R ) 
itself but rather the (much larger) group generated by R and all conjugates of elements 
in R . Note that even for a fixed set £ a group will have many different presentations (we 
can always throw redundant relations into R, for example). If G is finitely presented 
with £ = {si , s 2 , . . . , s„ } and R = {w \ , W 2 , - . . , u>*}, we write (as we have in preceding 
chapters): 

G = ( Sj, s 2 , ■ ■ ■ , s„ | U >1 = w 2 = ■ ■ = W k = 1 ) 
and if w is the word W\U>f\ we shall write 1 ^ = 1 ^ instead of w = 1. 
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Examples 

(1) Every finite group is finitely presented. To see this let G = (g[, be a finite 

group. Let S = G and let n : F(S) — > G be the homomorphism extending the identity 
map of S. Let Rq be the set of words gigjg ^ , where i, j = 1, . . . , n and gig) = gk in 
G. Clearly Rq < ker7T. If /V is the normal closure of Ro in F(S) and G = F(S)/N, 
then G is a homomorphic image of G (i.e., tt factors through N). Moreover, the set of 
elements (gi | i = 1, . . . , /»} is closed under multiplication. Since this set generates 
G, it must equal G. Thus |G| = JG| and so N = ker jr and (S, Ro) is a presentation 
of G. 

This illustrates a sufficient condition for (5, R) to be a presentation for a given finite 
group G: 

(0 S must be a generating set for G, and 

(ii) any group generated by S satisfying the relations in R must have order < |G|. 

(2) Abelian groups can be presented easily. For instance 

Z=F({a)) = (a), 

TL x Z = ( a, b | [a, b] = 1 > , 

Z„x2 m ^ (a,b\a n = b m = [a,b] = 1 >. 

(Recall [a,b] = a~ l b~ ] ab). 

(3) Some familiar non-abelian groups introduced in earlier chapters have simple presen- 
tations: 

I>ln = (r, s | r n = s 2 = 1, s -1 rs = r~ l ) 

Q% = ( i, j I i 4 = l, j 2 = i 2 > j~ l ij = r 1 ). 

To check, for example, the presentation for D^n note that the relations in the presenta- 
tion ( r, s | r n = s 1 = 1, r _, rs = r _1 ) imply that this group has a normal subgroup 
(generated by r) of order < n whose quotient is generated by s (which has order < 2). 
Thus any group with these generators and relations has order at most 2 n. Since we 
already know of the existence of the group Z> 2 n of order 2 n satisfying these conditions, 
the abstract presentation must equal Z^. 

(4) As mentioned in Section 1.2, in general it is extremely difficult even to determine if a 
given set of generators and relations is or is not the identity group (let alone determine 
whether it is some other nontrivial finite group). For example, in the following two 
presentations the first group is an infinite group and the second is the identity group 
(cf. Trees , Chapter 1): 

(Xl,* 2 ,X 3 ,X 4 I X2X\X^ S ~x\, X2X2X^ ] = xf » X4X3XJ 1 = *3, X^xj" 1 = xj) 
(*),X2, X 3t I X2XJX2" 1 =xf. X3X2-X3 1 =x%, X\X3X) ] = xf). 

(5) It is easy to see that S„ is generated by the transpositions (1 2). (2 3), .... (n - 1 n), 
and that these satisfy the relations 

((/ i +l)(j+l i+2)) 3 = 1 and [(i i+1). 0 j -4-1)] = 1, whenever | i — j \ > 2 

(here |i— j | denotes the absolute value of the integer i— j). One can prove by induction 
on n that these form a presentation of S„: 

= ( t) , ■ ■ - , f/i— j | tj =1, (f/f/+i)^ = 1 , and \ti , tj) = 1 

whenever \i — j\ >2, 1 < i, j < n — 1 ). 
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As mentioned in Section 1.6 we can use presentations of a group to find homomor- 
phisms between groups or to find automorphisms of a group. We did this in classifying 
groups of order 6 , for example, when we proved that any non-abelian group of order 6 
was generated by an element of order 3 and an element of order 2 inverting it; thus there is 
a homomorphism from S 3 onto any non-abelian group of order 6 (hence an isomorphism, 
by computing orders). More generally, suppose G is presented by, say, generators a , b 
with relations r\ , . . . , r k . If a', b' are any elements of a group H satisfying these rela- 
tions, there is a homomorphism from G into H. Namely, if n : F([a, b)) -»■ G is the 
presentation homomorphism, we can define n' : F([a , b]) ->• H by n'(a) — d and 
7 t'(b) = d, Then ker n < ker it’ so tt' factors through ker n and we obtain 

G = F(\a>b))/kexTT — ► H. 

In, particular, if (d,d) = H = G, this homomorphism is an automorphism of G. 
Conversely, any automorphism must send a set of generators to another set of generators 
satisfying the same relations. For example, D 8 = (a, b \ a 2 = b 4 — 1 , aba = b~ l ) 
and any pair a\ b' of elements, where a' is a noncentral element of order 2 and b' is of 
order 4, satisfies the same relations. Since there are four noncentral elements of order 
2 and two elements of order 4, Dg has 8 automorphisms. 

Similarly, any pair of elements of order 4 in Qg which are not equal or inverses of 
each other necessarily generate Qg and satisfy the relations given in Example 3 above. 
It is easy to check that there are 24 such pairs, so 

|Aut(fi 8 )l=24. 

Free objects can be constructed in (many, but not all) other categories. For instance, 
a monoid is a set together with a binary operation satisfying all of the group axioms 
except the axiom specifying the existence of inverses. Free objects in the category of 
monoids play a fundamental role in theoretical computer science where they model the 
behavior of machines (Turing machines, etc.). We shall encounter free algebras (i.e., 
polynomial algebras) and free modules in later chapters. 

EXERCISES 

1. Let F[ and F% be free groups of finite rank. Prove that F\ = Fz if and only if they have the 
same rank. What facts do you need in order to extend your proof to infinite ranks (where 
the result is also true)? 

2. Prove that if |5| > 1 then F(S) is noD-abelian. 

3. Prove that the commutator subgroup of the free group on 2 generators is not finitely gener- 
ated (in particular, subgroups of finitely generated groups need not be finitely generated). 

4. Prove that every nonidentity element of a free group is of infinite order. 

5. Establish a finite presentation for A 4 using 2 generators. 

6 . Establish a finite presentation for S 4 using 2 generators. 

7. Prove that the following is a presentation for the quaternion group of order 8 : 

Qg = (a, b \ a 2 = b 2 , a~'ba = b~ l ). 

8 . Use presentations to find the orders of the automorphism groups of the groups Z 2 * Z 4 
and Z 4 x Z 4 . 
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9. Prove that Aut( Qs) = 5-j. 

10 . This exercise exhibits an automorphism of 56 that is not inner (hence, together with Ex- 
ercise 19 in Section 4.4 it shows that [Ant(56) : Inn(S6)| = 2). Let t[ = (1 2)(3 4)(5 6), 
4 = (1 4) (2 5)(3 6), t' 2 = (1 3)(2 4)(5 6), /' = (1 2) (3 6)(4 5), and t' = (1 4) (2 3)(5 6). 
Show that t[ , . . . , t* 5 satisfy the following relations: 

(t!) 2 = 1 for all f, 

(t'jt'j) 2 = 1 for all i and j with \i — j\ > 2, and 
= 1 for all r e {1, 2, 3, 4}. 

Deduce that 56 = < t [ , - - - , t ' s ) and that the map 

(12)i->-fJ. (2 3 (3 4)i (4 5)i-vt', (5 6) h-* r' 

extends to an automorphism of 56 (which is clearly not inner since it does not send trans- 
positions to transpositions). [Use the presentation for Ss described in Example 5.] 

11 . Let 5 be a set The group with presentation (5, R), where R = {[r, r] j s, t e 5} is called 

the free abelian group on 5 — denote it by A (5). Prove that A (5) has the following 
universal property: if G is any abelian group and <p : 5 -*■ G is any set map, then there is 
a unique group homomorphism : A (5) -> G such that <J> = <p. Deduce that if A is a 

free abelian group on a set of cardinality n then 

A = Zx Z x • ■ ■ x Z (n factors). 

12. Let 5 be a set and let c be a positive integer. Formulate the notion of a free nilpotent group 
on 5 of nilpotence class c and prove it has the appropriate universal property with respect 
to nilpotent groups of class < c. 

13 . Prove that there cannot be a nilpotent group N generated by two elements with the property 
that every nilpotent group which is generated by two elements is a homomorphic image 
of N (i.e., the specification of the class c in the preceding problem was necessary). 
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Part I 


RING THEORY 


The theory of groups is concerned with general properties of certain objects having 
an algebraic structure defined by a single binary operation. The study of rings is 
concerned with objects possessing two binary operations (called addition and multipli- 
cation) related by the distributive laws. We first study analogues for the basic points 
of development in the structure theory of groups. In particular, we introduce subrings, 
quotient rings, ideals (which are the analogues of normal subgroups) and ring homo- 
morphisms. We then focus on questions about general rings which arise naturally from 
the presence of two binary operations. Questions concerning multiplicative inverses 
lead to the notion of fields and eventually to the construction of some specific fields 
such as finite fields. The study of the arithmetic (divisibility, greatest common divisors, 
etc.) of rings such as the familiar ring of integers, Z, leads to the notion of primes and 
unique factorizations in Chapter 8. The results of Chapters 7 and 8 are then applied to 
rings of polynomials in Chapter 9. 

The basic theory of rings developed in Part II is the cornerstone for the rem ainin g 
four parts of the book. The theory of ring actions (modules) comprises Part ITI of the 
book. There we shall see how the structure of rings is reflected in the structure of the 
objects on which they act and this will enable us to prove some powerful classification 
theorems. The structure theory of rings, in particular of polynomial rings, forms the 
basis in Part IV for the theory of fields and polynomial equations over fields. There the 
rich interplay among ring theory, field theory and group theory leads to many beautiful 
results on the structure of fields and the theory of roots of polynomials. Part V continues 
the study of rings and applications of ring theory to such topics as geometry and the 
theory of extensions. In Part VI the study of certain specific kinds of rings (group rings) 
and the objects (modules) on which they act again gives deep classification theorems 
whose consequences are then exploited to provide new results and insights into finite 
groups. 
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CHAPTER 7 


Introduction to Rings 


7.1 BASIC DEFINITIONS AND EXAMPLES 
Definition. 

(1) A ring R is a set together with two binary operations + and x (called addition 
and multiplication) satisfying the following axioms: 

(i) (R, +) is an abelian group, 

(ii) x is associative : (a x b) x c — a x (b x c) for all a,b,c e R, 

(iii) the distributive laws hold in R : for all a, b y c € R 

(a+b)xc = (axc) + (bxc) and ax(b+c) = (a xb) + (axc). 

( 2 ) The ring R is commutative if multiplication is commutative. 

(3) The ring R is said to have an identity (or contain a 1) if there is an element 
1 e R with 

lxfl=axl-fl for all a € f?. 


We shall usually write simply ab rather than a x b for a, b € R. The additive 
identity of R will always be denoted by 0 and the additive inverse of the ring element 
a will be denoted by —a. 

The condition that R be a group under addition is a fairly natural one, but it may 
seem artificial to require that this group be abelian . One motivation for this is that if the 
ring R has a 1, the commutativity under addition is forced by the distributive laws. To 
see this, compute the product (1 + l)(a+Z>)in two different ways, using the distributive 
laws (but not assuming that addition is commutative). One obtains 

(1 + 1 )(a + b') = l(a + b) + l(a + b) = 1 a + 1 b + la ■+■ lb — a b a b 
and 

(1 + 1 )(a + b) = (1 + 1 )a + (1 + 1 )b — la + la + lb + lb = a + a + b + b. 

Since A is a group under addition, this implies b+a = a + b y i.e., that R under addition 
is necessarily commutative. 

Fields are one of the most important examples of rings. Note that their definition 
below is just another formulation of the one given in Section 1.4. 
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Definition. A ring R with identity 1, where 1 ^ 0, is called a division ring (or skew 
field) if every nonzero element a € R has a multiplicative inverse, i.e., there exists 
b € R such that ab = ba = 1.A commutative division ring is called afield. 

More examples of rings follow. 

Examples 

(1) The simplest examples of rings are the trivial rings obtained by taking R to be any 
commutative group (denoting the group operation by +) and defining the multiplication 
x on R by a x b = 0 for all a, b e R . It is easy to see that this multiplication defines 
a commutative ring. In particular, if R = {0} is the trivial group, the resulting ring R 
is called the zero ring, denoted if = 0. Except for the zero ring, a trivial ring does 
not contain an identity (if = 0 is the only ring where 1 = 0; we shall often exclude 
this ring by imposing the condition 1^0). Although trivial rings have two binary 
operations, multiplication adds no new structure to the additive group and the theory of 
rings gives no information which could not already be obtained from (abelian) group 
theory. 

(2) The ring of integers, Z, under the usual operations of addition and multiplication is a 
commutative ring with identity (the integer 1). The ring axioms (as with the additive 
group axioms) follow from the basic axioms for the system of natural numbers. Note 
that under multiplication Z — {0} is not a group (in fact, there are very few multiplicative 
inverses to elements in this ring). We shall come back to the question of these inverses 
shortly. 

(3) Similarly, the rational numbers, Q, the real numbers, R, and the complex numbers, C, 
are commutative rings with identity (in fact they are fields). The ring axioms for each 
of these follow ultimately from the ring axioms for Z. We shall verify this when we 
construct <Q> from Z (Section 7.5) and C from R (Example 1, Section 13.1); both of 
these constructions will be special cases of more general processes. The construction 
of R from <Q> (and subsequent verification of the ring axioms) is carried out in basic 
analysis texts. 

(4) The quotient group Z/nZisa commutative ring with identity (the element 1 ) under the 
operations of addition and multiplication of residue classes (frequently referred to as 
“modular arithmetic”)- We saw that the additive abelian group axioms followed from 
the general principles of the theory of quotient groups (indeed this was the prototypical 
quotient group). We shall shortly prove that the remaining ring axioms (in particular, 
the fact that multiplication of residue classes is well defined) follow analogously from 
the general theory of quotient rings. 

In all of the examples so far the rings have been commutative. Historically, one of the first 
noncommutative rings was discovered in 1843 by Sir William Rowan Hamilton (1805- 
1865). This ring, which is a division ring, was extremely influential in the subsequent 
development of mathematics and it continues to play an important role in certain areas of 
mathematics and physics. 

(5) (The (real) Hamilton Quaternions) Let H be the collection of elements of the form 
a + bi + cj + dk where a,b,c,d e R are real numbers (loosely, “polynomials in 
1, i, j, k with real coefficients”) where addition is defined “componentwise” by 

(a+bi+cj+dk) T (a'+b'i+c'j+d'k) ~ (a+a 1 ) + ( b+b()i + (c+d) j + (d+d')k 

and multiplication is defined by expanding (a 4- bi 4- cj 4- dk)(a' + t/i + dj + d'k) 
using the distributive law (being careful about the order of terms) and simplifying 
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using the relations 


i 2 = j 2 = k 2 = - 1, ij = -ji = k, jk = -kj — i, ki = -ik = j 

(where the real number coefficients commute with i, j and k). For example, 

(l+i+2j)(j+k) = l(J+k) +i(j+k) + 2j(J+k) = j + k + ij +ik +2j 2 +2jk 
= j + k + k + (—j) + 2(— 1) + 2(f) = —2 + 2i + 2k. 


The fact that H is a ring may be proved by a straightforward, albeit lengthy, check 
of the axioms (associativity of multiplication is particularly tedious). The Hamilton 
Quaternions are a noncommutative ring with identity (1 = \+0i+0j+0k). Similarly, 
one can define the ring of rational Hamilton Quaternions by taking a, b,c,d to be 
rational numbers above. Both the real and rational Hamilton Quaternions are division 
rings , where inverses of nonzero elements are given by 


( a + bi -|- cj + dk) 1 


a — bi — cj — dk 
a 2 + b 2 + c 2 d 2 


(6) One important class of rings is obtained by considering rings of functions. Let X 
be any nonempty set and let A be any ring. The collection, R, of all (set) functions 
f : X -*■ A is a ring under the usual definition of pointwise addition and multiplication 
of functions: (/ + g)(x) = f(x ) + g(x) and (fg)(x) = f(x)g(x ). Each ring axiom 
for R follows directly from the corresponding axiom for A. The ring R is commutative 
if and only if A is commutative and R has a 1 if and only if A has a 1 (in which case 
the 1 of R is necessarily the constant function 1 on X). 

If X and A have more structure, we may form other rings of functions which 
respect those structures. For instance, if A is the ring of real numbers R and X is 
the closed interval [0, 1] in R we may form the ring of all continuous functions from 
[0. 1] to R (here we need basic limit theorems to guarantee that sums and products of 
continuous functions are continuous) — this is a commutative ring with 1. 

(7) An example of a ring which does not have an identity is the ring 22 of even integers 
under usual addition and multiplication of integers (the sum and product of even 
integers is an even integer). 

Another example which arises naturally in analysis is constructed as follows. A 
function / : R -> R is said to have compact support if there are real numbers a, b 
(depending on /) such that f(x) = 0 for all x g [a, b\ (i.e., / is zero outside some 
bounded interval). The set of all functions / : R -> R with compact support is a 
commutative ring without identity (since an identity could not have compact support). 
Similarly, the set of all continuous functions / : R -> R with compact support is a 
commutative ring without identity. 


In the next section we give three important ways of constructing ‘larger” rings 
from a given ring (analogous to Example 6 above) and thus greatly expand our list 
of examples. Before doing so we mention some basic properties of arbitrary rings. 
The ring % is a good example to keep in mind, although this ring has a good deal 
more algebraic structure than a general ring (for example, it is commutative and has 
an identity). Nonetheless, its basic arithmetic holds for general rings as the following 
result shows. 


Sec. 7.1 Basic Definitions and Examples 


225 



Proposition 1 . Let R be a ring. Then 

( 1 ) Otf = aO = 0 for all a e R. 

(2) ( —a)b = a(—b ) = — ( ab ) for all a, b € R (recall —a is the additive inverse of 
a). 

(3) (— a){— b) = ab for all a, b e R. 

(4) if R has an identity 1, then the identity is unique and —a = (— l)a. 

Proof: These all follow from the distributive laws and cancellation in the additive 
group R. For example, (1) follows from On = (0 + 0 )a = Oa +0a. The equality 
( —a)b = — {ab) in (2) follows from ab 4- (— a)b = (a 4- (— a))b = Ob = 0. The rest 
follow similarly and are left to the reader. 

This proposition shows that because of the distributive laws the additive and mul- 
tiplicative structures of a ring behave well with respect to one another, just as in the 
familiar example of the integers. 

Unlike the integers, however, general rings may possess many elements that have 
multiplicative inverses or may have nonzero elements a and b whose product is zero. 
These two properties of elements, which relate to the multiplicative structure of a ring, 
are given special names. 

Definition. Let R be a ring. 

(1) A nonzero element a of R is called a zero divisor if there is a nonzero element 
b in R such that either ab = 0 or ba = 0. 

(2) Assume R has an identity 1 ^ 0. An element u of R is called a unit in R if there 
is some u in R such that uv = vu = 1. The set of units in R is denoted R*. 

It is easy to see that the units in a ring R form a group under multiplication so R* 
will be referred to as the group of units of R. In this terminology a field is a commutative 
ring F with identity 1^0 in which every nonzero element is a unit, ue., F* = F — {0}. 

Observe that a zero divisor can never be a unit. Suppose for example that a is a 
unit in R and that ab = 0 for some nonzero b in R. Then va — 1 for some v € R, so 
b = lb = ( va)b = v(ab) = uO = 0, a contradiction. Similarly, if ba = 0 for some 
nonzero b then a cannot be a unit 

This shows in particular that fields contain no zero divisors. 

Examples 

(1) The ring Z of integers has no zero divisors and its only units are ±1, i.e., Z* = {±1 }. 
Note that every nonzero integer has an inverse in the larger ring <Q), so the property of 
being a unit depends on the ring in which an element is viewed. 

(2) Let n be an integer > 2. In the ring Z/nZ the elements u for which u and n are 
relatively prime are units (we shall prove this in the next chapter). Thus our use of the 
notation (Z/nZ) x is consistent with the definition of the group of units in an arbitrary 
ring. 

If, on the other hand, a is a nonzero integer and a is not relatively prime to n then 
we show that a is a zero divisor in Z/nZ. To see this let d be the g.c.d. of a and n and 

let b = — . By assumption d > 1 so 0 < b < n, i.e., b ^ 6. But by construction n 
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divides ab, that is, ab = 0 in Z/nZ. This shows that every nonzero element ofZ/nZ 
is either a unit or a zero divisor . Furthermore, every nonzero element is a unit if and 
only if every integer a in the range 0 < a < n is relatively prime to n. This happens 
if and only if n is a prime, Le., Z/nZ is a field if and only if n is a prime. 

(3) If R is the ring of all functions from the closed interval [0,1] to R then the units of R 
are the functions that are not zero at any point (for such / its inverse is the function 



g(*) = 


0, if/(x)/0 

1. if f(x) = 0 


then g is not the zero function but /(x)g(x) = 0 for all x. 

(4) If R is the ring of all continuous functions from the closed interval [0,1] to R that 
the units of R are still the functions that are not zero at any point, but now there are 
functions that are neither units nor zero divisors. For instance, f(x) — x — j has only 
one zero (at x = so / is not a unit. On the other hand, if gf = 0 then g must 
be zero for all x ^ j, and the only continuous function with this property is the zero 
functioa Hence / is neither a unit nor a zero divisor Similarly, no function with 
only a finite (or countable) number of zeros on [0,1] is a zero divisor. This ring also 
contains many zero divisors. For instance let 


0, 0 < x < X 

/«=■ , r - 

X — j, \ <x <\ 

and let g(x) = /(I — x). Then / and g are nonzero continuous functions whose 
product is the zero function. 

(5) Let D be a rational number that is not a perfect square in Q and define 


Q(VD) = [a + bsTb J a, b 6 <Q>] 


as a subset of C. This set is clearly closed under subtraction, and the identity ( a + 
b*fD)(c + d-Jb ) = (ac + bdD) + (ad + bc)\fD shows that it is also closed under 
multiplication. Hence Q(\/Z) ) is a subring of C (even a subring of R if D > 0), so in 
particular is a commutative ring with identity. It is easy to show that the assumption 
that D is not a square implies that every element of Q(VD ) may be written uniquely 
in the form a + b-J~D. This assumption also implies that if a and b are not both 0 then 
a 2 — Db 1 is nonzero, and since (a + bJ~D )(a — b-Jl) ) = a 2 — Dh 2 it follows that if 


a + by/~D 0 (i.e., one of a or b is nonzero) then 


a-bJ~D . 


a 2 — Dip’ 


is the inverse of a + b-J~D 


in <Q>(\/Z> ) . This shows that every nonzero element in this commutative ring is a unit, 
i.e., <Q>(\/Z>) is a field (called a quadratic field , cf. Section 13.2). 

The rational number D may be written D = f 2 D* for some rational number / and 
a unique integer D* where D' is not divisible by the square of any integer greater than 
1, i.e., jy is either —1 or ±1 times the product of distinct primes in Z (for example, 
8/5 = (2/5) 2 ■ 10). Call Lf the squarefree part of D. Then y/~D = / sfiy, and so 
Q(VD ) = Thus there is no toss in assuming that D is a squarefree integer 

(I.e., f =\) in the definition of the quadratic field Q(^/d). 
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Rings having some of the same characteristics as the integers Z are given a name: 

Definition. A commutative ring with identity 1 ^ 0 is called an integral domain if it 
has no zero divisors. 

The absence of zero divisors in integral domains give these rings a cancellation 
property: 

Proposition 2. Assume a , b and c are elements of any ring with a not a zero divisor. If 
ab — ac , then either a = 0 or b = c (i.e., if a ^ 0 we can cancel the a’s). In particular, 
if a, b,c are any elements in an integral domain and ab = ac , then either a = 0 or 
b = c. 


Proof: If ab — ac then a(b — c) = 0 so either a — 0 or — c — 0. The second 
statement follows from the first and the definition of an integral domain. 

Corollary 3. Any finite integral domain is a field. 

Proof: Let R be a finite integral domain and let a be a nonzero element of R. By 
the cancellation law the map x i-> ax is an injective function. Since R is finite this map 
is also surjective. In particular, there is some b € R such that ab — 1, i.e., a is a unit 
in R. Since a was an arbitrary nonzero element, R is a field. 

A remarkable result of Wedderbum is that a finite division ring is necessarily com- 
mutative, i.e., is a field. A proof of this theorem is outlined in the exercises at the end 
of Section 13.6. 

In Section 5 we study the relation between zero divisors and units in greater detail. 
We shall see that every nonzero element of a commutative ring that is not a zero divisor 
has a multiplicative inverse in some larger ring. This gives another perspective on the 
cancellation law in Proposition 2. 

Having defined the notion of a ring, there is a natural notion of a subring. 

Definition. A subring of the ring R is a subgroup of R that is closed under multipli- 
cation. 

In other words, a subset S of a ring R is a subring if the operations of addition and 
multiplication in R when restricted to S give S the structure of a ring. To show that a 
subset of a ring R is a subring it suffices to check that it is nonempty and closed under 
subtraction and under multiplication 

Examples 

A number of the examples above were also subrings. 

(1) Z is a subring of Q and Q is a subring of HL The property “is a subring of’ is clearly 
transitive. 

(2) 2Z is a subring of Z, as is nZ for any integer n. The ring Z/nZ is not a subring (or a 
subgroup) of Z for any n > 2. 
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(3) The ring of all continuous functions from R to R is a subring of the ring of all functions 
from R to R. The ring of all differentiable functions from R to R is a subring of both 
of these. 

(4) S = Z + Zi + Zj + Zk, the integral Quaternions, form a subring of either the real or 
the rational Quaternions — it is easy to check that multiplying two such quaternions 
together gives another quaternion with integer coefficients. This ring (which is not a 
division ring) can be used to give proofs for a number of results in number theory. 

(5) If R is a subring of a field F that contains the identity of F then R is an integral 
domain. The converse of this is also true, namely any integral domain is contained in 
a field (cf. Section 5). 


Example: (Quadratic Integer Rings) 

Let D be a squarefree integer. It is immediate from the addition and multiplication that the 
subset Z [y/D ] = [a + by/D \ a,b € Z) forms a subring of the quadratic field Q(y/D) 
defined earlier. If D = 1 mod 4 then the slightly larger subset 


Z[ 


1 +x/D 


] = [a +b 


1 + y/D 


a, b € Z) 


is also a subring: closure under addition is immediate and (a 4- b l+ lf^ )(c + d = 

( ac + bd + (ad + be + bd) I-| ~^ together with the congruence on D shows closure 
under multiplication. 

Define 

O - = Z[o>] = [a + ba> | a, b € Z}, 


where 


y/D, if D = 2, 3 mod 4 

l + y/D 

— . if Z) = 1 mod 4, 


called the ring of integers in the quadratic field <Q \(*JD ). The terminology comes from the 
fact that the elements of the subring O of the field Q(y/D ) have many properties analogous 
to those of the subring of integers Z in the field of rational numbers Q (and are the integral 
closure of Z in Q(y/~D ) as explained in Section 15.3). 

In the special case when D = — 1 we obtain the ring Z[i] of Gaussian integers, which 
are the complex numbers a + bi € C with a and b both integers. These numbers were 
originally introduced by Gauss around 1 800 in order to stale the biquadratic reciprocity law 
which deals with the beautiful relations that exist among fourth powers modulo primes. 
We shall shortly see another useful application of the algebraic structure of this ring to 
number theoretic questions. 

Define th z field norm N : Q(y/D) -»• Q by 

N(a + bJD) = (a+byfD)(a - by/D) = a 1 - Db 2 e Q, 


which, as previously mentioned, is nonzero if a 4- b-jD ^ 0. This norm gives a measure 
of “size” in the field Q(y/D). For instance when D = — 1 the norm of a + bi is a 2 + h 2 , 
which is the square of the length of this complex number considered as a vector in the 
complex plane. We shall use the field norm in this and subsequent examples to establish 
many properties of the rings O. 
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It is easy to check that N is multiplicative , i.e., that N(afi) = N(a)N(fi) for all 
a, p € Q(%/d ). On the subring O it is also easy to see that the field norm is given by 

f a 2 - Db 2 , 


N(a + bay) = (a + bay) (a + bay) 


2 1 D 2 

a 1 + ab-\ — b , 


if D = 2, 3 mod 4 
if D = 1 mod 4 


where 


-VD, if Z) = 2, 3 mod 4 
1--/D 

, if Z) = 1 mod 4. 


It follows that N(a) is in fact an integer for every a € O. 

We may use this norm to characterize the units in O. If a € O has field norm 
N (a) = ±1, the previous formula shows that ( a + fcafp 1 = ±(a + bay), which is again 
an element of O and so a is a unit in O. Suppose conversely that a is a unit in O, say 
aft — 1 for some ft € O. Then the multiplicative property of the field norm implies that 
N(a) N(p) = N(afi) = N( 1) = 1. Since both N(a) and N(P) are integers, each must be 
±1. Hence, 


the element a is a unit in O if and only if N (a) = ±1. 

In particular the determination of the integer solutions to the equation x 2 — Dy 2 = ±1 
(called Pell’s equation in elementary number theory) is essentially equivalent to the deter- 
mination of the units in the ring O. 

When D = — 1, the units in the Gaussian integers Z[i] are the elements a + bi with 
a 2 + b 2 — ±1, a, b € Z, so the group of units consists of {±1, ±i). When D = —3, the 
units in Z[(l + V— 3)/2] are determined by the integers a, b with a 2 + ab + b 2 = ±1, i.e., 
with (2a + b) 2 + 3 b 2 = ±4, from which it is easy to see that the group of units is a group 
of order 6 given by {±1, ±p, ±p 2 } where p = (— 1 + V— 3 )/2. For any other D < 0 it is 
similarly straightforward to see that the only units are (±1). 

By contrast, when D > 0 it can be shown that the group of units O x is always infinite. 
For example, it is easy to check that 1 + -Jl is a unit in the ring O = Z[\/2] (with field 
norm — 1 ) and that {±(1 + V2 )" | n e Z}, is an infinite set of distinct units (in fact the full 
group of units in this case, but this is harder to prove). 


EXERCISES 


Let R be a ring with 1. 

1. Show that (— l) 2 = 1 in R. 

2. Prove that if u is a unit in R then so is —u. 

3. Let R be a ring with identity and let S be a subring of R containing the identity. Prove that 
if u is a unit in S then u is a unit in R. Show by example that the converse is false. 

4 . Prove that the intersection of any nonempty collection of subrings of a ring is also a subring. 

5. Deride which of the following (a) - (0 are subrings of Q: 

(a) the set of all rational numbers with odd denominators (when written in lowest terms) 

(b) the set of all rational numbers with even denominators (when written in lowest terms) 

(c) the set of nonnegative rational numbers 

(d) the set of squares of rational numbers 

(e) the set of all rational numbers with odd numerators (when written in lowest terms) 
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(f) the set of all rational numbers with even numerators (when written in lowest terms). 

6. Decide which of the following are subrings of the ring of all functions from the closed 
interval [0,1] to R: 

(a) the set of all functions / (jc) such that f(q) = 0 for all q e Q Pi [0, 1] 

(b) the set of all polynomial functions 

(c) the set of all functions which have only a finite number of zeros, together with the 
zero function 

(d) the set of all functions which have an infinite number of zeros 

(e) the set of all functions / such that lim /(x) = 0 

(f) the set of all rational linear combinations of the functions sinnx and cos nix, where 
m.n G (0, 1,2, ... }. 

7. The center of a ring R is {z e R | zr = rz for all r € R) (i.e., is the set of all elements 
which commute with every element of R). Prove that the center of a ring is a subring that 
contains the identity- Prove that the center of a division ring is a field. 

8. Describe the center of the real Hamil ton Quaternions M. Prove that { a + bi \ a, b € R} is 
a subring of IHI which is a field but is not contained in the center of IHL 

9. For a fixed element a E R define C(a) — {r E R \ ra = ar). Prove that C(a) is a subring 
of R containing a. Prove that the center of R is the intersection of the subrings C(a) over 
all a € 7?. 

10. Prove that if D is a division ring then C(a) is a division ring for all a e D (cf . the precedi ng 
exercise). 

11. Prove that if R is an integral domain and x 2 = 1 for some x e R then x = ±1. 

12. Prove that any subring of a field which contains the identity is an integral domain. 

13. An element x in R is called nilpoteni if x >n =0 for some m e Z + . 

(a) Show that if n = a k b for some integers a and b then ab is a nilpotent element of 
Z/nZ. 

(b) If a e Z is an integer, show that the element a e Z/nZ is nilpotent if and only if 
every prime divisor of n is also a divisor of a. In particular, determine the nilpotent 
elements of Z/72Z explicitly. 

(c) Let R be the ring of functions from a nonempty set X to a field F. Prove that R 
contains no nonzero nilpotent elements. 

14. Let x be a nilpotent element of the commutative ring R (cf. the preceding exercise). 

(a) Prove that x is either zero or a zero divisor. 

(b) Prove that rx is nilpotent for all r e R. 

(c) Prove that 1 + x is a unit in R . 

(d) Deduce that the sum of a nilpotent element and a unit is a u ni t. 

15. A ring R is called a Boolean ring if a 2 = a for all a e R. Prove that every Boolean ring 
is commutative. 

16. Prove that the only Boolean ring that is an integral domain is Z/2Z. 

17. Let R and S be rings. Prove that the direct product R x S is a ring under componentwise 
addition and multiplication. Prove that R x S is commutative if and only if both R and 
S are commutative. Prove that R x S has an identity if and only if both R and S have 
identities. 

18. Prove that {(r, r) | r e /?} is a subring of R x R. 

19. Let 7 be any nonempty index set and let R, be a ring for each i e I. Prove that the direct 


Sec. 7.1 Basic Definitions and Examples 


231 



product riie/ -Ki is a ring under componentwise addition and multiplication. 

20. Let R be the collection of sequences (a\ , 02 , 02 ,.. -) of integers a \ , 02 , 03 , ... where all 
but finitely many of the a,- are 0 (called the direct sum of infinitely many copies of Z). 
Prove that R is a ring under componentwise addition and multiplication which does not 
have an identity. 

21. Let X be any nonempty set and let V(X) be the set of all subsets of X (the power set of 
X). Define addition and multiplication on V(X) by 

A + B = (A- B)U(B -A) and AxB = AnB 


i.e., addition is symmetric difference and multiplication is intersection. 

(a) Prove that V(X) is a ring under these operations (P{X) and its subrings are often 
referred to as rings of sets). 

(b) Prove that this ring is commutative, has an identity and is a Boolean ring. 

22. Give an example of an infinite Boolean ring. 

23. Let D be a squarefree integer, and let O be the ring of integers in the quadratic field <Q(Vd ) . 
For any positive integer / prove that the set Of = Z[fto) = [a + bfa> \ a, b e Z) is a 
subring of O containing the identity. Prove that [O : Of) — f (index as additive abelian 
groups). Prove conversely that a suhring of O containing the identity and having finite 
index / in O (as additive abelian group) is equal to Of. (The ring Of is called the order 
of conductor f in the field Q(y/D ). The ring of integers O is called the maximal order in 

Q(VD).) 

24. Show for D = 3, 5, 6, and 7 that the group of units O* of the quadratic integer ring O is 
i nfini te by exhibiting an explicit unit of infinite (multiplicative) order in each ring. 

25. Let 7 be the ring of integral Hamilton Quaternions and define 

N \ l -*■ Z by N(a + bi + cj + dk ) — a 2 + b 2 + c 2 + d 2 


(the map N is called a norm). 

(a) Prove that N(ct) = cia. for all a e I, where if a = a + bi + cj + dk then 
a — a — bi — cj — dk. 

(b) Prove that N(af) — N(a)N(f) for all a, ft e I. 

(c) Prove that an element of / is a unit if and only if it has norm +1. Show that 7 X is 
isomorphic to the quaternion group of order 8. [The inverse in the ring of rational 

quaternions of a nonzero element a is ^ ^ . ] 


26. Let A" be a field. A discrete valuation on AT is a function v : AT* — ► Z satisfying 

0) v(ab) = v(a) + v(&) (i.e., v is a homomorphism from the multiplicative group of 

nonzero elements of K to Z), 

(ii) v is suijective, and 

(iii) v(x + y) > min{v(jt), v(y)} for all x, y € K x with x + y 0. 


The set R = {jc e K * | v(x) > 0) U {0} is called the valuation ring of v. 

(a) Prove that R is a subring of K which contains the identity. (Tn general, a ring R is 
called a discrete valuation ring if there is some field K and some discrete valuation v 
on K such that R is the valuation ring of v.) 

(b) Prove that for each nonzero element x e K either x or jc - 1 is in R. 

(c) Prove that an element x is a unit of R if and only if v(x) = 0. 

27. A specific example of a discrete valuation ring (cf. the preceding exercise) is obtained 
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when p is a prime, K = Q and 


v p : Q x -> Z by v p (~) = a where % — p a %, p /c and p /d . 

b b d 

Prove that the corresponding valuation ring R is the ring of all rational numbers whose 
denominators are relatively prime to p. Describe the units of this valuation ring. 

28. Let R be a ring with 1 ^ 0. A nonzero element a is called a left zero divisor in R if there is 

a nonzero element x e R such that ax = 0. Symmetrically, b ^ 0 is a right zero divisor if 
there is a nonzero y e R such that yb = 0 (so a zero divisor is an element which is either 
a left or a right zero divisor). An element u e R has a left inverse in R if there is some 
s € R such that su = 1. Symmetrically, v has a right inverse if vt = 1 for some i e R. 

(a) Prove that u is a unit if and only if it has both a right and a left inverse (i.e., u must 

have a two-sided inverse). 

(b) Prove that if u has a right inverse then u is not a right zero divisor. 

(c) Prove that if u has more than one right inverse then u is a left zero divisor. 

(d) Prove that if R is a finite ring then every element that has a right inverse is a unit (i.e., 
has a two-sided inverse). 

29. Let A be any commutative ring with identity 1 ^ 0. Let R be the set of all group homo- 
morphisms of the additive group A to itself with addition defined as pointwise addition of 
functions and multiplication defined as function composition. Prove that these operations 
make R into a ring with identity. Prove that the units of R are the group automorphisms 
of A (cf. Exercise 20, Section 1.6). 

30. Let A = Z x Z x Z x ■ ■ ■ be the direct product of copies of Z indexed by the positive integers 

(so A is a ring under componentwise addition and multiplication) and let R be the ring of 
all group homomorphisms from A to itself as described in the preceding exercise. Let <p 
be the element of R defined by <p(a\ , ari, 03 , . . . ) = (ai, < 23 , . . . ). Let be the element of 
R defined by . . .) = (0, a\, . . .). 

(a) Prove that <pyfr is the identity of R but yfnp is not the identity of R (i.e., yf is a right 
inverse for <p but not a left inverse). 

(b) Exhibit infini tely many right inverses for <p. 

(c) Find a nonzero element n in R such that <pn = 0 but ntp ^ 0. 

(d) Prove that there is no nonzero element k e R such that k<p = 0 (i.e., <p is a left zero 
divisor but not a right zero divisor). 


7.2 EXAMPLES: POLYNOMIAL RINGS, MATRIX RINGS, 

AND GROUP RINGS 

We introduce here three important types of rings: polynomial rings, matrix rings, and 
group rings. We shall see in the course of the text that these three classes of rings are 
often related. For example, we shall see in Part VI that the group ring of a group G over 
the complex numbers C is a direct product of matrix rings over C. 

These rings also have many important applications, in addition to being interesting 
in their own right. In Part HI we shall use polynomial rings to prove some classification 
theorems for matrices which, in particular, determine when a matrix is similar to a 
diagonal matrix. In Part VI we shall use group rings to study group actions and to prove 
some additional important classification theorems. 
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Polynomial Rings 

Fix a commutative ring R with identity. We define the ring of polynomials in a form 
which may already be familiar, at least for polynomials with real coefficients. A defi- 
nition in terms of Cartesian products is given in Appendix I. Let x be an indeterminate. 
The formal sum 

a n x n +a„-ix n ~ l -| \-aix +a 0 

with n > 0 and each a { e R is called a polynomial in x with coefficients a, in R. 
If a n / 0, then the polynomial is said to be of degree n , a„x n is called the leading 
term > and a„ is called the leading coefficient (where the lea ding coefficient of the zero 
polynomial is taken to be 0). The polynomial is monic if a n = 1. The set of all such 
polynomials is called the ring of polynomials in the variable x with coefficients in R 
and will be denoted /?[*]. 

The operations of addition and multiplication which make f?[x] into a ring are the 
same operations f amiliar from elementary algebra: addition is “componentwise” 

(a„x n -1- a„- 1 *" -1 H 1- aix -I- a c ) -I- (b n x n + y x n ~ x 1- b\x -1- bo) 

= (&n + b n )x n + (a n - 1 -1- b„-i)x n 1 + ■ ■ ■ + («i + b{)x + (oq + bo) 

(here a n or b n may be zero in order for addition of polynomials of different degrees 
to be defined). Multiplication is performed by first defining (ax‘)(bx J ) = abx l+ i for 
polynomials with only one nonzero term and then extending to all polynomials by the 
distributive laws (usually referred to as “expanding out and collecting like terms”): 

(flo -1- a^x -1- a 2 x 2 + ...) x (b 0 +b { x + b 2 x 2 + ...) 

= a Q b 0 -1- (aobi + aibo)x + fob? + a\bi + a 2 b( ) )x 1 + . . . 

(in general, the coefficient of x k in the product will be 5^/=o a > ^ k ~' )■ operations 
make sense since R is a ring so the sums and products of the coefficients are defined. 
An easy verification proves that /?[x] is indeed a ring with these definitions of addition 
and multiplication. 

The ring R appears in R{x\ as the constant polynomials. Note that by definition of 
the multiplication, /?[x] is a commutative ring with identity (the identity 1 from R). 

The coefficient ring R above was assumed to be a commutative ring since that is the 
situation we shall be primarily interested in, but note that the definition of the addition 
and multiplication in f?[x] above would be valid even if R were not commutative or 
did not have an identity. If the coefficient ring R is the integers Z (respectively, the 
rationals (Q>) the polynomial ring Z[x] (respectively, Qfx] ) is the ring of polynomials 
with integer (rational) coefficients familiar from elementary algebra. 

Another example is the polynomial ring Z/3Z[x] of polynomials in x with coeffi- 
cients in Z/3Z. This ring consists of nonnegative powers of x with coefficients 0, 1 , 
and 2 with calculations on the coefficients performed modulo 3. For example, if 

p(x) = x 2 + 2x + l and q(x ) = x 3 +x + 2 

then 

p(x) + q(x) = x 3 + x 2 
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and 


p(x)q(x) = x 5 + 2x 4 + 2x 3 + x 2 + 2x + 2. 

The ring in which the coefficients are taken makes a substantia] difference in the 
behavior of polynomials. For example, the polynomial x 2 + 1 is not a perfect square in 
the polynomial ring Z[x], but is a perfect square in the polynomial ring Z/2Z[x], since 
(jc + l) 2 = x 2 + 2x + 1 = x 2 + 1 in this ring. 

Proposition 4. Let R be an integral domain and let p(x), q(x) be nonzero elements of 
/?[*]■ Then 

(1) degree p(x)q(x) = degree p(x) + degree q(x), 

(2) the units of are just the units of R, 

(3) /?[x] is an integral domain. 

Proof: If R has no zero divisors then neither does /?[*]; if p(x) and q(x) are 
polynomials with leading terms a„x n and b m x m , respectively, then the leading term of 
p(x)q(x) is a n b m x n+m , and a„b m ^ 0. This proves (3) and also verifies (1). If p(x) is 
a unit, say p(x)q(x) = 1 in /?[*], then degree p(x) + degree q(x) = 0, so both p(x) 
and q(x) are elements of /?, hence are units in R since their product is 1. This proves 
( 2 ). 


If the ring R has zero divisors then so does because R c jR[x]. Also, if f(x) 
is a zero divisor in R[x ] (Le., f(x)g(x) = 0 for some nonzero g(x) e R[x]) then in 
fact cf(x) = 0 for some nonzero c e R (cf. Exercise 2). 

If S' is a subring of R then 5[x] is a subring of /?[*]• For instance, Z[x] is a subring 
of Q[x]. Some other examples of subrings of R[x] are the set of all polynomials in x 2 
(i.e., in which only even powers of x appear) and the set of all polynomials with zero 
constant term (the latter subring does not have an identity). 

Polynomial rings, particularly those over fields, will be studied extensively in Chap- 
ter 9. 

Matrix Rings 

Fix an arbitraiy ring R and let n be a positive integer. Let M n (R) be the set of all n x n 
matrices with entries from R. The element (<*,■_,■) of M„(R) is an n x n square array 
of elements of R whose entry in row i and column j is a,j e R. The set of matrices 
becomes a ring under the usual rules by which matrices of real numbers are added and 
multiplied Addition is componentwise: the t, j entry of the matrix (aij) + (fyy) is 
a-,j + bij. The i, j entry of the matrix product (« j; ) x (bij) is a ikbkj (note that 
these matrices need to be square in order that multiplication of any two elements be 
defined). It is a straightforward calculation to check that these operations make M„(R) 
into a ring. When R is a field we shall prove that M n (R) is a ring by less computational 
means in Part HI. 

Note that if R is any nontrivial ring (even a commutative one) and n > 2 then 
M n (R) is not commutative: if ab ^ 0 in R let A be the matrix with a in position 1,1 
and zeros elsewhere and let B be the matrix with b in position 1,2 and zeros elsewhere; 
then ab is the (nonzero) entry in position 1,2 of AB whereas BA is the zero matrix. 
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These two matrices also show that M n ( R ) has zero divisors for all nonzero rings R 
whenever n > 2. 

An element (o,y) of M n (R) is called a scalar matrix if for some a e R, an = a 

for all i € {1 n] and atj = 0 for all i ^ j (i.e., all diagonal entries equal a and 

all off-diagonal entries are 0). The set of scalar m atrices is a subring of M n (R). This 
subring is a copy of R (i.e., is “isomorphic” to J?): if the matrix A has the element a 
along the main diagonal and the matrix B has the element b along the main diagonal 
then the matrix A + B has a + b along the diagonal and AB has ab along the diagonal 
(and all other entries 0). If R is commutative, the scalar matrices commute with all 
elements of M n (R). If R has a 1, then the scalar matrix with l’s down the diagonal 
(the n x n identity matrix ) is the 1 of M n (R). In this case the units in M n (R ) are the 
invertible n x n matrices and the group of units is denoted GL„(R), the general linear 
group of degree n over R. 

If S is a subring of R then M n (5) is a subring of M„ (R). For instance M n (Z) is a sub- 
ring of M„ (Q) and M n (2Z) is a subring of both of these. Another example of a subring 
of M„(R) is the set of upper triangular matrices: {(Ojj) \ a pq = 0 whenever p > q) 
(the set of matrices all of whose entries below the main diagonal are zero) — one easily 
checks that the sum and product of upper triangular matrices is upper triangular. 

Group Rings 

Fix a commutative ring R with identity 1^0 and let G = (gi, g 2 , * ■ • * gn} be any finite 
group with group operation written multiplicatively. Define the group ring , RG, of G 
with coefficients in R to be the set of all formal sums 

«tgi + a 1 g 1 H +a„g n «/ € 1 < i < n. 

If gi is the identity of G we shall write a^g\ simply as a^ Similarly, we shall write the 
element lg for g e G simply as g. 

Addition is defined “componentwise” 

(«i£i + a 2 gi + • • • + a n g n ) -I- (b\g\ + t> 2 g 2 + • ■ • + b„g„) 

= (^i + bi)8i + (d2 + b 2 )g 2 H + (<2/i+ 

Multiplication is performed by first defining (agi)(bgj) = (ab)gk, where the product 
ab is taken in R and gigj = gk is the product in the group G. This product is then 
extended to all formal sums by the distributive laws so that the coefficient of g*. in the 
product (aigi + ■ ■ - +a n g„) x (b l g l + - ■ -+b n g n ) is Y, gigJ = Sk a i b J- Itis straightforward 
to check that these operations make RG into a ring (again, commutativity of R is not 
needed). The associativity of multiplication follows from the associativity of the group 
operation in G. The ring RG is commutative if and only if G is a commutative group. 

Example 

Let G = Dg be the dihedral group of order 8 with the usual generators r, s (r 4 = s 2 = 1 

and rs = sr -1 ) and let R = 7L. The elements a = r + r 2 — 2s and /? = —3 r 2 -I- rs are 
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typical members of ZDg. Their sum and product are then 
a + = r - 2r 2 - 2s + rs 

cip = (r + r 2 - 2i)(-3r 2 + r$) 

= ri-Zr 2 + rs) + r 2 (~3r 2 + rs) - 2s(-3r 2 + rs) 

= -3 r 3 + r 2 s - 3 + r 3 s + 6r 2 s - 2r 3 
= — 3 — 5r 3 + lr 2 s + r^s. 

The ring R appears in RG as the “constant” formal s ums Le., the ^-multiples of the 
identity of G (note that the definition of the addition and multiplication in RG restricted 
to these elements is just the addition and multiplication in R). These elements of R 
commute with all elements of RG. The identity of R is the identity of RG. 

The group G also appears in RG (the element g, appears as lg, — for example, 
r,s e Dg are also elements of the group ring ZDs above) — multiplication in the ring 
RG restricted to G is just the group operation. In particular, each element of G has a 
multiplicative inverse in the ring RG (namely, its inverse in G). This says that G is a 
subgroup of the group of units of RG. 

If |G| > 1 then RG always has zero divisors. For example, let g be any element 
of G of order m > 1. Then 

(1 - g)d + g + ■ ■ ■ + g m_1 ) = 1 - g m = 1 - 1 = 0 

so 1 — g is a zero divisor (note that by defi ni tion of RG neither of the formal sums in 
the above product is zero). 

If S is a subring. of R then SG is a subring of RG. For instance, ZG (called the 
integral group ring of G) is a subring of QG (the rational group ring of G). Furthermore, 
if H is a subgroup of G then RH is a subring of RG. The set of all elements of RG 
whose coefficients sum to zero is a subring (without identity). If |G| > 1, the set of 
elements with zero “constant term” (i.e., the coefficient of the identity of G is zero) is 
not a subring (it is not closed under multiplication). 

Note that the group ring is not the same ring as the Hamilton Quaternions H 
even though the latter contains a copy of the quaternion group Qg (under multiplication). 
One difference is that the unique element of order 2 in Qg (usually denoted by — 1 ) is not 
the additive inverse of 1 in MGs- ^ other words, if we temporarily denote the identity 
of the group Qg by gi and the unique element of order 2 by g 2 , then gi + g 2 is not zero 
in whereas 1 + (— 1) is zero in H. Furthermore, as noted above, the group ring 
RQ& contains zero divisors hence is not a division ring. 

Group rings over fields will be studied extensively in Chapter 18. 

EXERCISES 


Let R be a commutative ring with 1 . 

1. Let p(x) = 2* 3 — 3x 2 + Ax — 5 and let g(x) = 7* 3 + 33x — 4. In each of parts (a), (b) 
and (c) compute p(x) + q(x) and p(x)q (jt) under the assumption that the coefficients of 
the two given polynomials are taken from the specified ring (where the integer coefficients 
are taken mod n in parts (b) and (c) ): 

(a) R - Z, (b )R = Z/2Z, (c) R - Z/3Z. 
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2. Let p(x) = anx" +a n -yx n \-a}X+ao be an element of the polynomial ring F[jc). 

Prove that p(x) is a zero divisor in J if and only if there is a nonzero b e R such that 

bp(x) = 0. [LetgOO = b m x m +b m -\x m 1 H |-6o bea nonzero polynomial of minimal 

degree such that g(x)p(x) = 0. Show that b m a„ = 0 and so a n g(x) is a polynomial of 
degree less than m that also gives 0 when multiplied by p(x). Conclude that a„g(x) = 0. 
Apply a similar argument to show by induction on i that a„-, g( x) = 0 for / = 0, 1, . . . , n, 
and show that this implies b m p(x) = 0.] 

3. Define the set /?[[*]] of formal power series in the indeterminate x with coefficients from 
R to be all formal infini te sums 

oo 

y; a„x n = ao + aix + aix 2 + a^x 3 . 

71=0 

Define addition and multiplication of power series in the same way as for power series 
with real or complex coefficients i.e., extend polynomial addition and multiplication to 
power series as though they were “polynomials of infinite degree’*: 


oo 


71=0 


+ 


Y, b » x 


n = 0 


OO 


y^.(a n +b„)x n 

n ?= 0 


oo 


71=0 


x 


J2 bnX 


77=0 


oo n 


y (y akb n -k)x n - 

71=0 k=0 


(The term “formal” is used here to indicate that convergence is not considered, so that 
formal power series need not represent functions on R.) 

(a) Prove that /?[[*]] is a commutative ring with 1. 

(b) Show that 1 — x is a unit in F[(jc 1] with inverse 1 + x + x 2 + ■ ■ ■ . 

(c) Prove that is a unit in /?[[*]] if and only if no is a unit in R. 

4. Prove that if R is an integral domain then the ring of formal power series F[[jc)] is also an 
integral domain. 

5. Let F be a field and define the ring F((x)) of formal Laurent series with coefficients from 
F by 

OO 

F(( jc)) = {y a n x n | a n e. F and N e Z). 

n^N 


(Every element of F((x)) is a power series in x plus a polynomial in 1 /x, i.e., each element 
of F((x)) has only a finite number of terms with negative powers of jc.) 

(a) Prove that F ((*)) is a field. 

(b) Define the map 

00 

v.F((x)) x ^Z by v(y a„x n ) = N 

n>N 

where o/v is the first nonzero coefficient of the series (i.e., N is the “order of zero or 
pole of the series at 0”). Prove that v is a discrete valuation on F((x)) whose discrete 
valuation ring is F[(jc]], the ring of formal power series (cf. Exercise 26, Section 1). 

6. Let S be a ring with identity 1^0. Let n eZ + and let A be an n x n matrix with entries 
from S whose i, j entry is a-, j . Let Eij be the element of M„ (S) whose i, j entry is 1 and 
whose other enuies are all 0. 
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(a) Prove that EjjA is the matrix whose I th row equals the 7 th row of A and all other rows 
are zero. 

(b) Prove that A Ejj is the matrix whose 7 th column equals the i* column of A and all 
other columns are zero. 

(c) Deduce that EpqAE TS is the matrix whose p, s entry is a^ T and all other entries are 
zero. 

7. Prove that the center of the ring M n ( R ) is the set of scalar matrices (cf. Exercise 7, Section 
1). [Use the preceding exercise.] 

8. Let S be any ring and let n > 2 be an integer. Prove that if A is any strictly upper triangular 
matrix in M n (S) then A n = 0 (a strictly upper triangular matrix is one whose entries on 
and below the main diagonal are all zero). 

9. Let a = r + r 2 - 2s and p = -3 r 2 + rs be the two elements of the integral group ring 
ZDg described in this section. Compute the following elements of ZDg: 

(a) pa, (b) a 2 , (c) a/3 - Pa, (d) pap. 

10. Consider the following elements of the integral group ring ZS3: 

a = 3(1 2) - 5(2 3) + 14(1 2 3) and p = 6(1) + 2(2 3) - 7(1 3 2) 

(where (1) is the identity of S 3 ). Compute the following elements: 

(a) a + p, (b) 2a -3/6, (c) a/6, (d) Pa, (e) a 2 . 

11. Repeat the preceding exercise under the assumption that the coefficients of a and P are in 
Z/3Z (i.e., a, p e Z/3ZS 3 ). 

12. Let G = [g\ g„} be a finite group. Prove that the element JV = gi+g 2 + -- - + £/?is 

in the center of the group ring RG (cf. Exercise 7, Section 1). 

13. Let K — {£] , . . . , k m } be a conjugacy class in the finite group G. 

(a) Prove that the element A" = £| + . . . + is in the center of the group ring RG (cf. 
Exercise 7, Section 1). [Check that g -1 Kg = K for all g e G.] 

(b) Let fC \ , . . . , fC r be the conjugacy classes of G and for each K./ let Ki be the element 

of RG that is the sum of the members of K, { . Prove that an element a e RG is in the 
center of RG if and only if a = a\ K\ +a 2 Kl H \-a T for some a\ , ai a? e R- 


7.3 RING HOMOMORPH1SMS AND QUOTIENT RINGS 

A ring homomorphism is a map from one ring to another that respects the additive and 
multiplicative structures: 

Definition. Let R and S be rings. 

(1) A ring homomorphism is a map <p : R -> S satisfying 

(i) <p(a+b) = <p (a) + <p(b) for all a, b e R (so <p is a group homomor- 
phism on the additive groups) and 

(ii) <p(ab) = <p(d)<p(b) for all a, b G R. 

(2) The kernel of the ring homomorphism <p, denoted ker^, is the set of elements 
of R that map to 0 in S (i.e., the kernel of <p viewed as a homomorphism of 
additive groups). 

(3) A bijective ring homomorphism is called an isomorphism. 
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If the context is clear we shall simply use the term “homomorphism” instead of 
“ring homomorphism.” Similarly, if A and B are rings, A = B will always mean an 
isomorphism of rings unless otherwise stated. 

- Examples 

(1) The map tp : Z -»• Z/2Z defined by sending an even integer to 0 and an odd integer 
to 1 is a ring homomorphism. The map is additive since the sum of two even or odd 
integers is even and the sum of an even integer and an odd integer is odd. The map is 
multiplicative since the product of two odd integers is odd and the product of an even 
integer with any integer is even. The kernel of <p (the fiber of <p above 0 € Z/2Z) is 
the set of even integers. The fiber of <p above 1 e Z/2Z is the set of odd integers. 

(2) For n € Z the maps : Z — »■ Z defined by <p n (x) = nx are not in general ring homo- 
morphisms because <p n (xy) = nxy whereas <p n (x)fpn(y) — nxny = n 1 xy. Hence <p„ 
is a ring homomorphism only when n 2 = n, i.e., n = 0, 1. Note however that <p„ is 
always a group homomorphism on the additive groups. Thus care should be exercised 
when dealing with rings to be sure to check that both ring operations are preserved. 
Note that <po is the zero homomorphism and <pi is the identity homomorphism. 

(3) Let tp : <Q>[jc] — > Q be the map from the ring of polynomials in x with rational 
coefficients to the rationals defined by <p(p(x)) = p( 0) (i.e., mapping the polynomial 
to its constant term). Then <p is a ring homomorphism since the constant term of the 
sum of two polynomials is the sum of their constant terms and the constant term of 
the product of two polynomials is the product of their constant terms. The fiber above 
a eQ consists of the set of polynomials with a as constant term. In particular, the 
kernel of tp consists of the polynomials with constant term 0. 

Proposition 5. Let R and S be rings and let <p : R -> S be a homomorphism. 

(1) The image of tp is a subring of S. 

(2) The kernel of <p is a subring of R. Furthermore, if a € ker <p then rot and 
ctr g ker tp for every r e R, i.e., ker <p is closed under multiplication by elements 
from R. 

Proof: (l) If $i, $2 e im <p then S\ = <p(r{) and $2 = <p( r 2 ) for some € R. 
Then <p(r\ — r 2 ) = si — s 2 and ^(r^) = ^ 1 ^ 2 * This shows — S 2 , Sr-Sz e im <p, so the 
image of tp is closed under subtraction and under multiplication, hence is a subring of 
5. 

(2) If a, ft e ker <p then <p{ot) = <p(fi) = 0. Hence <p{ot — fi) = 0 and <p(ctf$) = 0, 
so ker<p is closed under subtraction and under multiplication, so is a subring of R. 
Similarly, for any r e R we have <p(rct) = (p(r)tp(ct) = <p(r) 0 = 0. and also 
tp(ar) = <p(a)<p(r ) = 0 <p(r) = 0, so rot , ctr e ker^>. 

In the case of a homomorphism <p of groups we saw that the fibers of the homo- 
morphism have the structure of a group naturally isomorphic to the image of tp, which 
led to the notion of a quotient group by a normal subgroup. An analogous result is true 
for a homomorphism of rings. 

Let <p : R -» S be a ring homomorphism with kernel 7. Since R and S are in 
particular additive abelian groups, <p is in particular a homomorphism of abelian groups 
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and the fibers of <p are the additive cosets r + I of the kernel 7 (more precisely, if r is 
any element of R mapping to a e S> <p(r) — a, then the fiber of q> over a is the coset 
r + 1 of the kernel 7). These fibers have the structure of a ring naturally isomorphic to 
the image of <p: if X is the fiber over a £ S and Y is the fiber over b £ S, then X + Y is 
the fiber over a + b and XY is the fiber over ab. In terms of cosets of the kernel I t hi s 
addition and multiplication is 

(r + I) + (s + I) = (r + s) + I (7.1) 

(r + 7) x (j + /) = (rs) + 7. (7.2) 

As in the case for groups, the verification that these operations define a ring structure 
on the collection of cosets of the kernel 7 ultimately rests on the corresponding ring 
properties of S. This ring of cosets is called the quotient ring of R by 7 = ker (p and 
is denoted R/I. Note that the additive structure of the ring R/I is just the additive 
quotient group of the additive abelian group R by the (necessarily normal) subgroup 
7. When 7 is the kernel of some homomorphism <p this additive abelian quotient group 
also has a multiplicative structure, defined by (7.2), which makes R/I into a ring. 

As in the case for groups, we can also consider whether (1) and (2) can be used to 
define a ring structure on the collection of cosets of an arbitrary subgroup 7 of 7?. Note 
that since 72 is an abelian additive group, the subgroup 7 is necessarily normal so that 
the quotient 72/7 of cosets of 7 is automatically an additive abelian group. The question 
then is whether this quotient group also has a multiplicative structure induced from the 
multiplication in 72, defined by (2). The answer is no in general (just as the answer is no 
in trying to form the quotient by an arbitrary subgroup of a group), which leads to the 
notion of an ideal in 72 (the analogue for rings of a normal subgroup of a group). We 
shall then see that the ideals of 72 are exactly the kernels of the ring homomorphisms 
of 72 (the analogue for rings of the characterization of normal subgroups as the kernels 
of group homomorphisms). 

Let 7 be an arbitrary subgroup of the additive group 72. We consider when the 
multiplication of cosets in (2) is well defined and makes die additive abelian group 72/7 
into a ring. The statement that the multiplication in (2) is well defined is the statement 
that the multiplication is independent of the particular representatives r and j chosen, 
i.e., that we obtain the same coset on the right if instead we use the representatives r+a 
and s + (i for any a, £ I. In other words, we must have 

(r + 0 !)(s + 0) + 7 = rs + I (*) 

for all r, s e 72 and all a, £ 7. 

Letting r = s = 0, we see that 7 must be closed under multiplication, i.e., 7 must 
be a subring of 72. 

Next, by letting s = 0 and letting r be arbitrary, we see that we must have rfl £ I 
for every r £ 72 and every /3 € 7, i.e., that 7 must be closed under multiplication on the 
left by elements from 72. Letting r = 0 and letting be arbitrary, we see similarly that 
7 must be closed under multiplication on the right by elements from 72. 

Conversely, if 7 is closed under multiplication on the left and on the right by 
elements from 72 then the relation (*) is satisfied for all «, 0 £ 7. Hence this is a 
necessary and sufficient condition for the multiplication in (2) to be well defined. 
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Finally, if the multiplication of cosets defined by (2) is well defined, then this 
multiplication makes the additive quotient group 72/7 into a ring. Each ring axiom in 
the quotient follows directly from the corresponding axiom in R. For example, one of 
the distributive laws is verified as follows: 

(r + I)((s + I) + (t + I)] = (r + I)[(s + t) + l] 

= r(s + t) + I = ( rs + rt) + I 
= (rs + T) + ( rt + 7) 

= l(r + J)(s + /)] + [(r + /)(/ + /)]■ 

This shows that the quotient R/l of the ring R by a subgroup 1 has a natural ring 
structure if and only if / is also closed under multiplication on the left and on the right 
by elements from R (so in particular must be a subring of R since it is closed under 
multiplication). As mentioned, such subrings 7 are called the ideals of 72: 

Definition. Let 7? be a ring, let 7 be a subset of R and let r € R. 

(1) rl = { ra \ a e 1} and 7r — [ar \ a e 1). 

(2) A subset 7 of 7? is a left ideal of 7? if 

(i) 7 is a subring of 7?, and 

(ii) 7 is closed under left multiplication by elements from 72, i.e., rl c. I 
for all r € R. 

Similarly 7 is a right ideal if (i) holds and in place of (ii) one has 

(ii)' 7 is closed under right multiplication by elements from 72, i.e., 7r c 7 
for all r € 72. 

(3) A subset 7 that is both a left ideal and a right ideal is called an ideal (or, for 
added emphasis, a two-sided ideal) of 72. 

For commutative rings the notions of left, right and two-sided ideal coincide. We 
emphasize that to prove a subset 7 of a ring 72 is an ideal it is necessary to prove that 7 is 
nonempty, closed under subtraction and closed under multiplication by all the elements 
of 72 (and not just by elements of 7). If 72 has a 1 then (— 1 )a = - a so in this case 7 is 
an ideal if it is nonempty, closed under addition and closed under multiplication by all 
the elements of 72. 

Note also that the last part of Proposition 5 proves that the kernel of any ring 
homomorphism is an ideal. 

We s umm arize the preceding discussion in the following proposition. 

Proposition 6. Let 72 be a ring and let 7 be an ideal of 72. Then the (additive) quotient 
group 72/7 is a ring under the binary operations: 

(r + 7) + (s + 7) = (r + s) + 7 and (r 4- 7) x (s + 7) = (rs) + I 

for all /*, s € 72. Conversely, if 7 is any subgroup such that the above operations are 
well defined, then 7 is an ideal of 72. 
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Definition. When / is an ideal of R the ring R/l with the operations in the previous 
proposition is called the quotient ring of R by 1 . 


Theorem 7. 

(1) (The First Isomorphism Theorem for Rings) If <p : R -> S is a homomorphism 
of rings, then the kernel of <p is an ideal of R, the image of <p is a subring of S 
and R/ ker <p is isomorphic as a ring to <p(R). 

(2) If 7 is any ideal of /?, then the map 

R -> R/l defined by r i->- r + 1 

is a sinjective ring homomorphism with kernel 7 (this homomorphism is called 
die natural projection of R onto R/l). Thus every ideal is the kernel of a ring 
homomorphism and vice versa. 

Proof: This is just a matter of collecting previous calculations. If 7 is the kernel of 
<p, then the cosets (under addition) of 7 are precisely the fibers of <p. In particular, the 
cosets r + 7, s + 7 and rs + 7 are the fibers of <p over <p(r), <p(s) and (p(rs), respectively. 
Since <p is a ring homomorphism <p(r)<p(s) = (p(rs ), hence (r + l)(s + 7) = rs + 1. 
Multiplication of cosets is well defined and so 7 is an ideal and R/l is a ring. The 
correspondence r + 7 i-> q>{r) is a bijection between the rings R/l and <p(R) which 
respects addition and multiplication, hence is a ring isomorphism. 

If 7 is any ideal, then R/I is a ring (in particular is an abelian group) and the map 
7r:rt-»r + 7isa group homomorphism with kernel 7 . It remains to check that n is a 
ring homomorphism. This is immediate from the definition of multiplication in R/l : 

n : rs rs + I = (r + l)(s + 7) = Tt(r)n(s). 


As with groups we shall often use the bar notation for reduction mod 7: r = r + 7. 
With this notation the addition and multiplication in the quotient ring R/l become 
simply V + s = r + s and Vs = rs. 


Examples 

Let R be a ring. 

(1) The subrings R and {0} are ideals. An ideal / is proper if 1 ± R. The ideal {0) is 
called the trivial ideal and is denoted by 0. 

(2) It is immediate that nTL is an ideal of Z for any n e Z and these are the only ideals of 
Z since in particular these are the only subgroups of Z. The associated quotient ring 
is Z/nZ (which explains the choice of notation and which we have now proved is a 
ring), introduced in Chapter 0. For example, if n = 15 then the elements of Z/15Z 

are the cosets 0, 1 1 3, 14. To add (or multiply) in the quotient, simply choose any 

representatives for the two cosets, add (multiply, respectively) these representatives 
in the integers Z, and take the corresponding coset containing this sum (product, 
respectively). For example, 7 + 11 = 18 and 18 = 3, so7+ll = 3in Z/15Z. 
Similarly, 7 11 = 77 = 2 in Z/ 15Z. We could also express this by writing 7 + 11 = 
3 mod 15, 7(1 1) s 2 mod 15. 

The natural projection Z — >■ Z/nZ is called reduction mod n and will be discussed 
further at the end of these examples. 
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(3) Let R = Z[x] be the ring of polynomials in x with integer coefficients. Let 7 be the 
collection of polynomials whose terms are of degree at least 2 (i.e., having no terms 
of degree 0 or degree 1) together with the zero polynomial. Then / is an ideal: the 
sum of two such polynomials again has terms of degree at least 2 and the product of a 
polynomial whose terms are of degree at least 2 with any polynomial again only has 
terms of degree at least 2. Two polynomials p(x), q(x) are in the same coset of 7 if 
and only if they differ by a polynomial whose terms are of degree at least 2, i.e., if 
and only if p(x) and q(x) have the same constant and first degree terms. For example, 
the polynomials 3 + 5x + x 3 + x s and 3 + 5x — x 4 are in the same coset of 7. It 
follows easily that a complete set of representatives for the quotient R/l is given by 
the polynomials a + bx of degree at most 1. 

Addition and multiplication in the quotient are again performed by representatives. 
For example, 

( 1 + 3x ) + ( — 4 + 5x ) = — 3 + 8x 

and 

(1 + 3x)(-4 + 5x) = (-4 — 7* + 15x 2 ) = -4 - lx . 

Note that in this quotient ring R/l we have * * = x 2 = 0, for example, so that 
R/l has zero divisors, even though R — Z[x] does not. 

(4) Let A be a ring, let X be any nonempty set and let 7? be the ring of all functions from 
X to A. For each fixed c G X the map 

E c : 7? — > A defined by E c (f) = f(c) 

(called evaluation at c) is a ring homomorphism because the operations in 7? are 
pointwise addition and multiplication of functions. The kernel of E c is given by 
{/ e R | /(c) = 0} (the set of functions from X to A that vanish at c). Also, E c is 
surjective: given any a e A the constant function f(x) = a maps to a under evaluation 
at c. Thus R/ ker E c = A. 

Similarly, let X be the closed interval [0,1] in K and let R be the ring of all 
continuous real valued functions on [0,1]. For each c G [0,1], evaluation at c is 
a suijective ring homomorphism (since R contains the constant functions) and so 
R/ ker E c = R. The kernel of E c is the ideal of all continuous functions whose graph 
crosses the x-axis at c. More generally, the fiber of E c above the real number yo is the 
set of all continuous functions that pass through the point (c, yo). 

(5) The map from the polynomial ring /?[*] to R defined by p(x) h* p( 0) (evaluation at 0) 
is a ring homomorphism whose kernel is the set of all polynomials whose constant term 
is zero, i.e., p( 0) = 0. We can compose this homomorphism with any homomorphism 
from R to another ring S to obtain a ring homomorphism from /?[*] to S. For example, 
let 7? = Z and consider the homomorphism Z[x] — ► Z/2Z defined by the composition 
p(x) h* p(0) i-» p(0) mod 2 e Z/2 Z. The kernel of this composite map is given by 
(p(x) e Z[x] | p( 0) g 2Z), i.e., the set of all polynomials with integer coefficients 
whose constant term is even. The other fiber of this homomorphism is the coset 
of polynomials whose constant term is odd, as we deter min ed earlier. Since the 
homomorphism is clearly suijective, the quotient ring is Z/2Z. 

(6) Fix some n e Z with n > 2 and consider the noncommutalive ring M n (R). If J 
is any ideal of R then M n (J), the n x n matrices whose entries come from J, is a 
two-sided ideal of M n (R). This ideal is the kernel of the suijective homomorphism 
M n {R) —> M n (R/J) which reduces each entry of a matrix mod 7, i.e., which maps 
each entry aij to a/J (here bar denotes passage to R/J). For instance, when n = 3 and 
R = Z, the 3x3 matrices whose entries are all even is the two-sided ideal M^(2Z) 
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of A/ 3 (Z) and the quotient M 3 (Z)/M3(2Z) is isomorphic to A/3(Z/2Z). If the ring R 
has an identity then the exercises below show that every two-sided ideal of M n (R) is 
of the form M n (J) for some two-sided ideal J of R. 

(7) Let R be a commutative ring with l and let G = {gi , . . . , g n ) be a finite group. The 
map from the group ring RG to R defined by £f =1 at g, h-> a > * s easily seen to 
be a homomorphism, called the augmentation map . The kernel of the augmentation 
map, the augmentation ideal , is the set of elements of RG whose coefficients sum to 
0. For example, g-, — gj is an element of the augmentation ideal for all i, j. Since the 
augmentation map is surjective, the quotient ring is isomorphic to R. 

Another ideal in RG is agj \ a e /?}, i.e., the formal sums whose coeffi- 
cients are all equal (equivalently, all ^-multiples of the element gi H 1- g n ). 

(8) Let R be a commutative ring with identity 1/0 and let neZ with n >2. We exhibit 
some one-sided ideals in the ring M n (R). For each j €{1,2,..., n) let Lj be the set 
of all ft x n matrices in M n (R ) with arbitrary entries in the j th column and zeros in all 
other columns. It is clear that Lj is closed under subtraction. It follows directly from 
the definition of matrix multiplication that for any matrix T € M n (R) and any A e Lj 
the product T A has zero entries in the i^ 1 column for all i / j. This shows Lj is a left 
ideal of M n ( R ). Moreover, Lj is not a right ideal (hence is not a two-sided ideal). To 
see this, let E pq be the matrix with 1 in the p* row and 4 th column and zeros elsewhere 
(p,q € (1, . . . , ft}). Then E\j € Lj but E\jEji = E\j £ Lj if i / j, so Lj is not 
closed under right multiplication by arbitrary ring elements. An anal ogous argument 
shows that if Rj is the set of all n x n matrices in M„(R) with arbitrary entries in the 
7 th row and zeros in all other rows, then Rj is a right ideal which is not a left ideal. 
These one-sided ideals will play an important role in Part VL 

Example: (The Reduction Homomorphism) 

The canonical projection map from Z to Z/nZ obtained by factoring out by the ideal ftZ of 
Z is usually referred to as “reducing modulo ft.” The fact that this is a ring homomorphism 
has important consequences for elementary number theory. For example, suppose we are 
trying to solve the equation 

x 2 + y 2 = 3 z 2 

in integers x, y and z (such problems are frequently referred to as Diophantine equations 
after Diophantus, who was one of the first to systematically examine the existence of 
integer solutions of equations). Suppose such integers exist Observe first that we may 
assume x, y and z have no factors in common, since otherwise we could divide through this 
equation by the square of this common factor and obtain another set of integer solutions 
smaller than the initial ones. This equation simply states a relation between these elements 
in the ring Z. As such, the same relation must also hold in any quotient ring as well. 
In particular, this relation must hold in Z/ftZ for any integer n. The choice n = 4 is 
particularly efficacious, for the following reason: the squares mod 4 are just 0 2 , l 2 , 2 2 , 3 2 , 
i.e., 0, 1 (mod 4) . Reading the above equation mod 4 (that is, considering this equation in 
the quotient ring Z/4Z), we must have 

{i} + {i} s 3 {> I s {3} <mod4) 

where the | ^ J, for example, indicates that either a 0 or a 1 may be taken. Checking 
the few possibilities shows that we must take the 0 each time. This means that each 
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of x , y and z must be even integers (squares of the odd integers gave us 1 mod 4). But 
this contradicts the assumption of no common factors for these integers, and shows 
that this equation has no solutions in nonzero integers. 

Note that even had solutions existed, this technique gives information about the 
possible residues of the solutions mod n (since we could just as well have examined 
the possibilities mod n as mod 4) and note that for each choice of n we have only 
a finite problem to solve because there are only finitely many residue classes mod 
n. Together with the Chinese Remainder Theorem (described in Section 6), we can 
then determine the possible solutions modulo very large integers, which greatly assists 
in finding them numerically (when they exist). We also observe that this technique 
has a number of limitations — for example, there are equations which have solutions 
modulo every integer, but which do not have integer solutions. An easy example (but 
extremely hard to verify that it does indeed have this property) is the equation 

3* 3 + 4y 3 -f- St? = 0. 

As a final example of this technique, we mention that the map from the ring 
Z[x] of polynomials with integer coefficients to the ring Z/pZ[x] of polynomials with 
coefficients in Z/pZ for a prime p given by reducing the coefficients modulo p is a 
ring homomorphism. This example of reduction will be used in Chapter 9 in trying to 
determine whether polynomials can be factored 


The following theorem gives the remaining Isomorphism Theorems for rings. Each 
of these may be proved as follows: first use the corresponding theorem from group 
theory to obtain an isomorphism of additive groups (or correspondence of groups, 
in the case of the Fourth Isomorphism Theorem) and then check that this group iso- 
morphism (or correspondence, respectively) is a multiplicative map, and so defines a 
ring isomorphism. In each case the verification is immediate from the definition of 
multiplication in quotient rings. For example, the map that gives the isomorphism 
in (2) below is defined by <p : r I r + J. This map is multiplicative since 
(ri + /)(r 2 + I) — ryr^ + / by the definition of the multiplication in the quotient ring 
R/I, and rir 2 + / rir 2 + J = (r» -I- J )(r 2 + J) by the definition of the multiplication 
in the quotient ring R/J , i.e., <p(rir 2 ) = <p{r\)<p(ji). The proofs for the other parts of 
the theorem are similar. 


Theorem 8. Let R be a ring. 

(1) (The Second Isomorphism Theorem for Rings) Let A be a subring and let B be 
an ideal of R. Then A + B = {a+b\a€:A, b € B} is a subring of R, A n B 
is an ideal of A and (A + B)/B = A/(A D B). 

(2) (The Third Isomorphism Theorem for Rings) Let I and J be ideals of R with 
I c J. Then J/l is an ideal of R/I and (/?//)/(///) ^ R/J. 

(3) (The Fourth or Lattice Isomorphism Theorem for Rings) Let / be an ideal of R. 

The correspondence A A/ 1 is an inclusion preserving bijection between the 

set of subrings A of R that contain I and the set of subrings of R/I. Furthermore, 
A (a subring containing I) is an ideal of R if and only if A/7 is an ideal of R/I. 
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Example 

Let R = 7L and let / be the ideal 12Z. The quotient ring R = R/I = %/\72> has 
ideals R, 2Z/12Z, 3Z/12Z, 4Z/12Z, 6Z/12Z, and 0 = 12Z/12Z corresponding to 
the ideals R = Z, 22, 3Z, 4Z, 6Z and 12Z = / of R co ntaining /, respectively. 

If 1 and 7 are ideals in the ring R then the set of sums a + b with a € I and b e J 
is not only a subring of R (as in the Second Isomorphism Theorem for Rings), but is an 
ideal in R (the set is clearly closed under sums and r(a + b) = ra + rb e 1 + 7 since 
ra g 7 and rb e J). We can also define the product of two ideals: 

Definition. Let I and J be ideals of R. 

(1) Define the sum of 7 and 7 by I + 7 = {a + b \ a e /, b e 7}. 

(2) Define the product of I and J, denoted by / 7, to be the set of all finite sums of 
elements of the form ab with a e I and b € J. 

(3) For any n > 1, define the /X th power of 7, denoted by 7", to be the set consisting 

of all finite sums of elements of the form a\ai ■ ■ ■ a n with n, € 7 for all i. 
Equivalently, 7” is defined inductively by defining 7 1 = 7, and I n = 1 7' 1-1 for 
n = 2, 3, 

It is easy to see that the sum 7 + J of the ideals 7 and J is the smallest ideal of R 
cont aining both 7 and J and that the product 7 J is an ideal contained in 7 O J (but may 
be strictly smaller, cf. the exercises). Note also that the elements of the product ideal I J 
are finite sums of products of elements ab from 7 and J. The set [ab [ a e 7, b € J} 
consisting just of products of elements from 7 and 7 is in general not closed under 
addition, hence is not in general an ideal. 

Examples 

(1) Let 1—6% and J = 10Z in Z. Then 7 + 7 consists of all integers of the form 6x + lOy 
with x, y € Z. Since every such integer is divisible by 2, the ideal / + 7 is contained 
in 2Z. On the other hand, 2 = 6(2) + 10(— 1 ) shows that the ideal 7 + 7 contains the 
ideal 2Z, so that 6Z + 10Z = 2Z. In general, ;?iZ + nZ = dZ, where d is the greatest 
common divisor of m and n. The product f J consists of all finite sums of elements of 
the form (6tx)(10y) with x, ye Z, which clearly gives the ideal 60Z. 

(2) Let 7 be the ideal in Z[x] consisting of the polynomials with integer coefficients whose 
constant term is even (cf. Example 5). The two polynomials 2 and x are contained in 
7, so both 4 = 2-2 and x 2 = x ■ x are elements of the product ideal I 2 — 77, as is 
their sum x 2 + 4. It is easy to check, however, that x 2 +4 cannot be written as a single 
product p(x)q(x) of two elements of 7. 


EXERCISES 


Let R be a ring with identity 1^0. 

1. Prove that the rings 2Z and 3Z are not isomorphic. 

2. Prove that the rings Z[x] and Q[x] are not isomorphic. 

3. Find all homomorphic images of Z. 
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4. Find all ring homomorphisms from Z to Z/30Z. In each case describe the kernel and the 
image. 

5. Describe all ring homomorphisms from the ring 2 x Z to Z. In each case describe the 
kernel and the image. 

6. Decide which of the following are ring homomorphisms from M 2 CZ) to Z: 
b' 


(a) 

(b) 

(c) 


c 

c 


C J) 


^ a (projection onto the 1 , 1 entry) 
^ hv a + d (the trace of the matrix) 


7. Let R = {( 
Prove that 1 


ad — be (the determinant of the matrix). 

d e Z} be the subring of M 2 (Z) of upper triangular matrices. 


(; 

the map 


<p : R — > Z x Z defined by 


-ft 5) 


(fl.d) 


is a surjective homomorphism and describe its kernel. 

8. Decide which of the following are ideals of the ring Z x Z: 

(a) ((a, a) | a € Z} 

(b) {(2a, 26) | a,* e Z} 

(c) {(2a, 0) ) a e Z) 

(d) {(a, -a) [ a e Z). 

9. Decide which of the sets in Exercise 6 of Section 1 are ideals of the ring of all functions 
from [0,1] to M. 


10. Decide which of the following are ideals of the ring Z[x]: 

(a) the set of all polynomials whose constant term is a multiple of 3 

(b) the set of all polynomials whose coefficient of x 2 is a multiple of 3 

(c) the set of all polynomials whose constant term, coefficient of jc and coefficient of x 2 
are zero 

(d) Z[x 2 ] (i.e., the polynomials in which only even powers of x appear) 

(e) the set of polynomials whose coefficients sum to zero 

(0 Lhe set of polynomials p(x) such that p’ (0) — 0, where p\x) is the usual first derivative 
of p(x) with respect to x. 


11. Let R be the ring of all continuous real valued functions on the closed interval [0, 1] . Prove 
that the map <p : R — > R defined by <p(f) = f 0 l f(t)dt is a homomorphism of additive 
groups but not a ring homomorphism- 


12. Let D be an integer that is not a perfect square in Z and let S = { ^ ^ ^ | a, b e Z). 

(a) Prove that 5 is a subring of M 2 (Z). 

(b) If D is not a perfect square in Z prove that the map <p : Z[-s/D] — > S defined by 

<p(a + b*jD ) = ( | is a ring isomorphism. 

\Db a / 

Uw « b +b )' a - heZ ' 


(c) If D = 1 mod 4 js squarefree, prove that the set { 


is a subring of Af 2 (Z) and is isomorphic to the quadratic integer ring O. 
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13 . 

14 . 

15 . 


16 . 


17 . 


18 . 


19 . 


20 . 


21 . 


22 . 


23 . 


24 . 


25. 


Prove that the ring M2(R) contains a subring that is isomorphic to C. 

Prove that the ring AfzjfR) contains a subring that is isomorphic to the real Hamilton 
Quaternions, IHL 

Let X be a nonempty set and let V(X) be the Boolean ring of all subsets of X defined in 
Exercise 21 of Section 1 . Let R be the ring of all functions from X into Z/2Z. For each 
A e V(X) define the function 

1 1 ifjc e A 

Xa-X-* 'IjVL by xa(x ) = \ 

10 if x £ A 

(Xa is called the characteristic function of A with values in Z/2Z). Prove that the map 
V(X) — ► R defined by A xa is a ring isomorphism. 

Let <p : R -► 5 be a surjective homomorphism of rings. Prove that the image of the center 
of R is contained in the center of S (cf. Exercise 7 of Section 1). 

Let R and S be nonzero rings with identity and denote their respective identities by 1 « 
and I5. Let <p : R S be a nonzero homomorphism of rings. 

(a) Prove that if <p(\ /?) yt then <p( 1 ft) is a zero divisor in S. Deduce that if S is an 
integral domain then every ring homomorphism from R to S sends the identity of R 
to the identity of S. 

(b) Prove that if <p(\ ft) = I5 then y(u) is a unit in S and that <p(u~ *) = <p(u)~ l for each 
unit u of R. 

(a) If I and J are ideals of R prove that their intersection J n J is also an ideal of R. 

(b) Prove that the intersection of an arbitrary nonempty collection of ideals is again an 
ideal (do not assume the collection is countable). 

Prove that if /1 c / 2 c - - - are ideals of R then I„ is an ideal of R. 

Let / be an ideal of R and let S be a subring of R. Prove that / n 5 is an ideal of S. Show 
by example that not every ideal of a subring 5 of a ring R need be of the form J n S for 
some ideal / of R. 

Prove that every (two-sided) ideal of M„(R ) is equal to M n ( J ) for some (two-sided) ideal 
J of R- [Use Exercise 6(c) of Section 2 to show first that the set of entries of matrices in 
an ideal of M„(R) form an ideal in /?.] 

Let a be an element of the ring R . 

(a) Prove that € R \ ax = 0} is aright ideal and {y e R | ya — 0} is a left ideal (called 
respectively the right and left annihilators of a in /?). 

(b) Prove that if L is a left ideal of R then {x e R | xa = 0 for all a e L) is a two-sided 
ideal (called the left annihilator of L in /?). 

Let S be a subring of R and let / be an ideal of R. Prove that if S n / = 0 then S = S, 
where the bar denotes passage to R/I. 

Let <p : R —>■ S be a ring homomorphism. 

(a) Prove that if J is an ideal of S then <p~ l (J) is an ideal of R. Apply this to the special 
case when R is a subring of S and <p is the inclusion homomorphism to deduce that if 
J is an ideal of S then J n R is an ideal of R. 

(b) Prove that if tp is surjective and / is an ideal of R then <p(I) is an ideal of S. Give an 
example where this fails if <p is not surjective. 

Assume R is a commutative ring with 1. Prove that the Binomial Theorem 


(a+b) n 



a k b 


n-k 
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holds in /?, where the binomial coefficient Q is interpreted in R as the sum 1 + 1 H 1-1 

of the identity 1 in R taken (£) times. 

26. The characteristic of a ring R is the smallest positive integer n such that 1 + 1-1 hi =0 

(n times) in R\ if no such integer exists the characteristic of R is said to be 0. For example, 
Z/nZ is a ring of characteristic n for each positive integer n and Z is a ring of characteristic 
0 . 

(a) Prove that the map Z -> R defined by 


k 


1 + I -| h 1 (k times) if k > 0 

0 if Jfc = 0 

— 1 — 1 — 1 (—k times) if k < 0 


is a ring homomorphism whose kernel is nZ, where n is the characteristic of R (this 
explains the use of the terminology “characteristic 0“ instead of the archaic phrase 
“characteristic oo” for rings in which no sum of l’s is zero). 

(b) Determine the characteristics of the rings <Q>, Z[x], Z/nZ[x]. 

(c) Prove that if p is a prime and if R is a commutative ring of characteristic p, then 
(a + b) p =a p +b p for all a, be R. 

27. Prove that a nonzero Boolean ring has characteristic 2 (cf. Exercise 15, Section 1). 

28. Prove that an integral domain has characteristic p, where p is either a prime or 0 (cf. 
Exercise 26). 

29. Let R be a commutative ring. Recall (cf. Exorcise 13, Section 1) that an element x e R 
is nil potent if x" = 0 for some n € Z + . Prove that the set of nilpotent elements form an 
ideal — called the nilradical of R and denoted by 91(R). [Use the Binomial Theorem to 
show 91(7?) is closed under addition.] 


30. Prove that if R is a commutative ring and 91(7?) is its nilradical (cf. the preceding exercise) 
then zero is the only nilpotent element of 7?/97(7?) i.e., prove that 91(7? /91(7?)) = 0. 


31. 


Prove that the elements 



are nilpotent elements of A/ 2 (Z) whose 


sum is not nilpotent (note that these two matrices do not commute). Deduce that the set 
of nilpotent elements in the noncommutative ring A72(Z) is not an ideal. 


32. Let <p : R -*■ S be a homomorphism of rings. Prove that if x is a nilpotent element of R 
then <p(x) is nilpotent in S. 

33. Assume R is commutative. Let p(x) = a„ x n + <z,,_ \x n ~ l -I 1- aix + oq be an element 

of the polynomial ring 7 ?[jc]. 

(a) Prove that p(x) is a unit in 7?[x] if and only if no is a unit and m, 02 , . . . , a,, are 
nilpotent in 7?. [ See Exercise 14 of Section 1 .] 

(b) Prove that p(x) is nilpotent in 7?[x] if and only if oo, ai a n are nilpotent elements 

of R. 


34. Let / and J be ideals of R. 

(a) Prove that / + J is the smallest ideal of R containing both / and J. 

(b) Prove that / J is an ideal contained in / n 7. 

(c) Give an example where I J / n J . 

(d) Prove that if R is commutative and if / + J = R then / J = / n J. 

35. Let /, J, K be ideals of R. 

(a) Prove that I(J + K) = IJ + IK and (/ + J)K = IK + JK. 

(b) Prove that if J c / then / fl (J + K) = J + (/ fi K). 
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36. Show that if / is the ideal of all polynomials in Z[x] with zero constant term then 
I n = { a„x n + a„+ ix n+i + ■ — h a„+ m x n+m \ a; € Z, m > 0} is the set of polynomials 
whose first nonzero term has degree at least n. 

37. An ideal N is called nilpotent if N n is the zero ideal for some n > 1. Prove that the ideal 
pTLfjPTL is a nilpotent ideal in the ring ZfjPZ. 


7.4 PROPERTIES OF IDEALS 

Throughout this section R is a ring with identity 1^0. 

Definition. Let A be any subset of the ring R. 

(1) Let (A) denote the smallest ideal of R containing A, called the ideal generated 
by A. 

(2) Let RA denote the set of all finite sums of elements of the form ra with r e R 

and a e A i.e., RA — {r\ai + r 2 a 2 H + r n a„ \ r t € R, a t G A, n e Z + } 

(where the convention is RA = 0 if A = 0). 

Similarly, AR = [a\r\ + a 2 r 2 H 1- a n r„ | r, e R, a, e A, n e Z + ] and 

RAR = {n air[ +r 2 a 2 f2 "I F r n a n r^ \ r,, r/ e R, a t e A, n e Z + }. 

(3) An ideal generated by a single element is called a principal ideal. 

(4) An ideal generated by a finite set is called a finitely generated ideal. 

When A = {a} or {aj, a 2t ... }, etc., we shall drop the set brackets and simply write 
(a), (<3i ,a 2 ,...) for (A), respectively. 

The notion of ideals generated by subsets of a ring is analogous to that of subgroups 
generated by subsets of a group (Section 2.4). Since the intersection of any nonempty 
collection of ideals of R is also an ideal (cf. Exercise 18, Section 3) and A is always 
contained in at least one ideal (namely R), we have 

m) = n '• 

/ an ideal 
>4c/ 

i.e., (A) is the intersection of all ideals of R that contain the set A. 

The left ideal generated by A is the intersection of all left ideals of R that contain 
A. This left ideal is obtained from A by closing A under all the operations that define 
a left ideal. It is immediate from the definition that RA is closed under addition and 
under left multiplication by any ring element. Since R has an identity, RA contains 
A. Thus RA is a left ideal of R which contains A. Conversely, any left ideal which 
contains A must contain all finite sums of elements of the form ra, r £ R and a e A 
and so must contain RA. Thus RA is precisely the left ideal generated by A. Similarly, 
AR is the right ideal generated by A and RAR is the (two-sided) ideal generated by 
A. In particular, 

if R is commutative then RA = AR = RAR = (A). 

When R is a commutative ring and a e R, the principal ideal (a) generated by 
a is just the set of all f?-multiples of fit. If R is not commutative, however, the set 
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[ras | r, s € R) is not necessarily the two-sided ideal generated by a since it need not 
be closed under addition (in this case the ideal generated by a is the ideal RaR, which 
consists of all finite sums of elements of the form ras, r,s€ R). 

The formation of principal ideals in a commutative ring is a particularly simple way 
of creating ideals, similar to generating cyclic subgroups of a group. Notice that the 
element b e R belongs to the ideal (a) if and only if b = ra for some r e R, i.e., if and 
only if b is a multiple of a or, put another way, a divides b in R. Also, b e (a) if and 
only if (b) C (a). Thus containment relations between ideals, in particular between 
principal ideals, is seen to capture some of the arithmetic of general commutative rings. 
Commutative rings in which all ideals are principal are among the easiest to study and 
these will play an important role in Chapters 8 and 9. 

Examples 

(1) The trivial ideal 0 and the ideal R are both principal: 0 = (0) and R = (1 ). 

(2) In Z we have nZ = Zn = (n) = (—n) for all integers n. Thus our notation for aR 
is consistent with the definition of nZ we have been using. As noted in the preceding 
section, these are all the idealsof Z so every ideal of Zis principal. For positive integers 
n and m, nZ c mZ if and only if m divides n in Z, so the lattice of ideals containing 
nZ is the same as the lattice of divisors of n. Furthermore, the ideal generated by two 
nonzero integers n and m is the principal ideal generated by their greatest common 
divisor, d\ (n, m) = (d). The notation for (n, m) as the greatest co mm on divisor of 
n and m is thus consistent with the same notation for the ideal generated by n and m 
(although a principal generator for the ideal generated by n and m is determined only 
up to a ± sign — we could make it unique by choosing a nonnegative generator). In 
particular, n and m are relatively prime if and only if (n, m) = ( 1 ). 

(3) We show that the ideal (2, *) generated by 2 and x in Z[x] is not a principal ideal. 
Observe that (2, x) = (2 p(x) + xq(x) | p(x), q(x) € Z[*J} and so this ideal consists 
precisely of the polynomials with integer coefficients whose constant term is even 
(as discussed in Example 5 in the preceding section) — in particular, this is a proper 
ideal. Assume by way of contradiction that (2, jr) = (a(x)) for some a(x) e Z[x J. 
Since 2 € ( a(x )) there must be some p(x) such that 2 = p(x)a(x). The degree of 
p(x)a(x) equals degree p(x) + degreea(x), hence both p(x) anda(x) must be constant 
polynomials, i.e., integers. Since 2 is a prime number, a(x), p(x) € (±1, ±2). If 
a(x) were ±1 then every polynomial would be a multiple of a(x), contrary to (a(x)) 
being a proper ideal. The only possibility is a(x) = ±2. But now x e (a(x)) = (2) = 
(— 2 ) and so x = lq(x) for some polynomial q(x) with integer coefficients, clearly 
impossible- This contradiction proves that (2, jc) is not principal. 

Note that the symbol (A) is ambiguous if the ring is not specified: the ideal 
generated by 2 and x in <Q>[jc] is the entire ring ( 1 ) since it contains the element 



We shall see in Chapter 9 that for any field F, all ideals of F[x] are principal. 

(4) If R is the ring of all functions from the closed interval [0, 1] into R let M be the ideal 
{/ I /(;) = 0} (the kernel of evaluation at 5 ). Let g(x) be the function which is zero 
at x = j and 1 at all other points. Then / = fg for all / e M so M is a principal 
ideal with generator g. In fact, any function which is zero at \ and nonzero at all other 
points is another generator for the same ideal M. 

On the other hand, if R is the ring of all continuous functions from [0,1J to R then 
{/ | / (5) = 0 ) is not principal nor is it even finitely generated (cf. the exercises). 
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(5) If G is a finite group and R is a commutative ring with 1 then the augmentation ideal 
is generated by the set {# — 1 | g G G), although this need not be a minimal set of 
generators. For example, if G is a cyclic group with generator a, then the augmentation 
ideal is a principal ideal with generator a — 1. 

Proposition 9. Let 7 be an ideal of R. 

(1) 7 = R if and only if 7 contains a unit 

(2) Assume R is commutative. Then 7? is a field if and only if its only ideals are 0 
and R. 

Proof: (1) If 7 = R then 7 contains the unit 1. Conversely, if u is a unit in 7 with 
inverse v, then for any r € R 


■ 1 — r(vu ) — ( rv)u e I 


hence R — I . 

(2) The ring R is a field if and only if every nonzero element is a unit. If R is a 
field every nonzero ideal cont ains a unit, so by the first part R is the only nonzero ideal. 
Conversely, if 0 and R are the only ideals of R let u be any nonzero element of R. By 
hypothesis (n) = R and so 1 € (n). Thus there is some v e R such that 1 = vu, i.e., u 
is a unit. Every nonzero element of R is therefore a unit and so R is a field 

Corollary 10. If 7? is a field then any nonzero ring homomorphism from R into another 
ring is an injection. 

Proof: The kernel of a ring homomorphism is an ideal. The kernel of a nonzero 
homomorphism is a proper ideal hence is 0 by the proposition. 

These results show that the ideal structure of fields is trivial. Our approach to 
studying an algebraic structure through its homomorphisms will still play a fundamental 
role in field theory (Part IV) when we study injective homomorphisms (embeddings) of 
one field into another and automorphisms of fields (isomorphisms of a field to itself). 

If D is a ring with identity 1 ^ 0 in which the only left ideals and the only right 
ideals are 0 and D, then D is a division ring. Conversely, the only (left, right or two- 
sided) ideals in a division ring 7) are 0 and D, which gives an analogue of Proposition 
9(2) if R is not commutative (see the exercises). However, if F is a field, then for 
any n > 2 the only two-sided ideals in the matrix ring M„(F) are 0 and A7„(7 r ), even 
though this is not a division ring (it does have proper, nontrivial, left and right ideals: 
cf. Section 3), which shows that Proposition 9(2) does not hold for noncommutalive 
rings. Rings whose only two-sided ideals are 0 and the whole ring (which are called 
simple rings) will be studied in Chapter 18. 

One important class of ideals are those which are not contained in any other proper 
ideal: 

Definition. An ideal M in an arbitrary ring S is called a maximal ideal if M ^ S and 
the only ideals containing M are M and S. 
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A general ring need not have maximal ideals. For example, take any abelian group 
which has no maximal subgroups (for example, Q — cf. Exercise 16, Section 6.1) and 
make it into a trivial ring by defining ab = 0 for all a, b. In such a ring the ideals are 
simply the subgroups and so there are no maximal ideals. The zero ring has no maximal 
ideals, hence any result involving maximal ideals forces a ring to be nonzero. The next 
proposition shows that rings with an identity 1^0 always possess maximal ideals. 
Like many such general existence theorems (e.g., the result that a finitely generated 
group has maximal subgroups or that every vector space has a basis) the proof relies 
on Zorn’s Lemma (see Appendix I). In many specific rings, however, the presence of 
maximal ideals is often obvious, independent of Zorn’s Lemma. 

Proposition 11. In a ring with identity every proper ideal is contained in a maximal 
ideal 

Proof: Let R be a ring with identity and let / be a proper ideal (so R cannot be the 
zero ring, i.e., 1 ^ 0). Let S be the set of all proper ideals of R which contain /. Then 
«S is nonempty (/ e S) and is partially ordered by inclusion. If C is a chain in <S, define 
J to be the union of all ideals in C: 

J = [jA. 

AeC 

We first show that J is an ideal. Certainly J is nonempty because C is nonempty 
— specifically, 0 € J since 0 is in every ideal A. If a, b £ J , then there are ideals 
A, B £ C such that a £ A and b £ B. By definition of a chain either A c. B or B c. A. 
In either case a — b £ J, so J is closed under subtraction. Since each A eC is closed 
under left and right multiplication by elements of R, so is J. This proves J is an ideal. 

If J is not a proper ideal then 1 e J. In this case, by definition of J we must 
have 1 € A for some A e C. This is a contradiction because each A is a proper ideal 
(A £ C C 5). This proves that each chain has an upper bound in S. By Zom’s Lemma 
S has a maximal element which is therefore a maximal (proper) ideal containing /. 

For commutative rings the next result characterizes maximal ideals by the structure 
of their quotient rings. 

Proposition 12. Assume R is commutative. The ideal M is a maximal ideal if and only 
if the quotient ring R/M is a field. 

Proof: This follows from the Lattice Isomorphism Theorem together with Proposi- 
tion 9(2). The ideal M is maximal if and only if there are no ideals / with M c / C R. 
By the Lattice Isomorphism Theorem the ideals of R containing M correspond bijec- 
tively with the ideals of R/M, so M is maximal if and only if the only ideals of R/M 
are 0 and R/M . By Proposition 9(2) we see that M is maximal if and only if R/M is 
a field. 

The proposition above indicates how to construct some fields: take the quotient 
of any commutative ring R with identity by a maximal ideal in R. We shall use this 
in Part IV to construct all finite fields by taking quotients of the ring Z[x] by maximal 
ideals. 


254 


Chap. 7 Introduction to Rings 



Examples 

(1) Let n be a nonnegative integer The ideal «Z of Z is a maximal ideal if and only if 
Z/nZ is a field. We saw in Section 3 that this is the case if and only if n is a prime 
number. This also follows directly from the containment of ideals of Z described in 
Example 2 above. 

(2) The ideal (2, x) is a maximal ideal in Z[x] because its quotient ring is the field Z/2Z 
— cf. Example 3 above and Example 5 at the end of Section 3. 

(3) The ideal (x) in Z[x] is not a maximal ideal because (x) C (2, x) C Z[x]. The quotient 
ring Z[x]/(x) is isomorphic to Z (the ideal (x) in Z[x] is the kernel of the surjective 
ring homomorphism from Z[x] to Z given by evaluation at 0). Since Z is not a field, 
we see again that (x) is not a maximal ideal in Z[x]. 

(4) Let R be the ring of all functions from [0,1] to R and for each a e [0, 1] let M a be the 
kernel of evaluation at a. Since evaluation is a surjective homomorphism from R to 
R, we see that R/M a = R and hence M a is a maximal ideal. Similarly, the kernel of 
evaluation at any fixed point is a maximal ideal in the ring of continuous real valued 
functions on [0, 1]. 

(5) If F is a field and G is a finite group, then the augmentation ideal l is a maximal 
ideal of the group ring FG (cf. Example 7 at the end of the preceding section). The 
augmentation ideal is the kernel of the augmentation map which is a surjective homo- 
morphism onto the field F (i.e., FG / / = E, a field). Note that Proposition 12 does 
not apply directly since FG need not be commutative, however, the implication in 
Proposition 12 that / is a maxim al ideal if R/J is a field holds for arbitrary rings. 

Definition. Assume R is commutative. An ideal P is called a prime ideal if P ^ R 
and whenever the product ab of two elements a,b e Ris an element of P, then at least 
one of a and b is an element of P. 

The notion of a maximal ideal is fairly intuitive but the definition of a prime ideal 
may seem a little strange. It is, however, a natural generalization of the notion of a 
“prime” in the integers Z. Let n be a nonnegative integer. According to the above 
definition the ideal nZ is a prime ideal provided n ^ 1 (to ensure that the ideal is 
proper) and provided every time the product ab of two integers is an element of nZ, 
at least one of a, b is an element of n'L. Put another way, if n 0, it must have the 
property that whenever n divides ab, n must divide a or divide b. This is equivalent to 
the usual definition that n is a prime number. Thus the prime ideals of Z are just the 
ideals pi of "L generated by prime numbers p together with the ideal 0 . 

For the integers Z there is no difference between the maximal ideals and the nonzero 
prime ideals. This is not true in general, but we shall see shortly that every maximal 
ideal is a prime ideal. First we translate the notion of prime ideals into properties of 
quotient rings as we did for maximal ideals in Proposition 12. Recall that an integral 
domain is a commutative ring with identity 1 ^ 0 that has no zero divisors. 

Proposition 13. Assume R is commutative. Then the ideal P is a prime ideal in R if 
and only if the quotient ring R/ P is an integral domain. 

Proof: This proof is simply a matter of translating the definition of a prime ideal 
into the language of quotients. The ideal P is prime if and only if P / R and whenever 
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ab € P, then either a e P or b e P. Use the bar notation for elements of R/P: 
7 = r + P. Note that r e P if and only if the element r is zero in the quotient ring 
R/P. Thus in the terminology of quotients P is a prime ideal if and only if R ^ 0 and 
whenever ab = ab = 0, then either a = 0 or b = 0, i.e., R/P is an integral domain. 

It follows in particular that a commutative ring with identity is an integral do main 
if and only if 0 is a prime ideal. 

Corollary 14. Assume R is commutative. Every maximal ideal of R is a prime ideal. 

Proof: If M is a maximal ideal then R/M is a field by Proposition 12. A field is 
an integral domain so the corollary follows from Proposition 13. 

Examples 

(1) The principal ideals generated by primes in Z are both prime and maximal ideals. The 
zero ideal in Z is prime but not maximal. 

(2) The ideal (x) is a prime ideal in Z[x] since Z[x]/(x) = Z. This ideal is not a maximal 
ideal. The ideal 0 is a prime ideal in Z[x], but is not a maximal ideal 


EXERCISES 


Let R be a ring with identity 1 # 0. 

1. Let Lj be the left ideal of M n ( R ) consisting of arbitrary entries in the j* column and zero 
in all other entries and let Ej j be the dement of M n (R) whose i, j entry is 1 and whose 
other entries arc all 0. Prove that Lj = M„(R)Eij for any i. [See Exercise 6, Section 2.] 

2. Assume R is commutative. Prove that the augmentation ideal in the group ring RG is 
generated by [g — 1 | g e G). Prove that if G = ( a ) is cyclic then the augmentation ideal 
is generated by a — 1. 

3. (a) Let p be a prime and let G be an abelian group of order p" . Prove that the nilradical 

of the group ring F^G is the augmentation ideal (cf. Exercise 29, Section 3). [Use 
the preceding exercise.] 

(b) Let G = [gi be a finite group and assume R is commutative. Prove that if r 

is any element of the augmentation ideal of RG then r(gi + ■ ■ - + g„) = 0. [Use the 
preceding exercise.] 

4. Assume R is commutative. Prove that R is a field if and only if 0 is a maximal ideal. 

5. Prove that if M is an ideal such that R/M is a field then M is a maximal ideal (do not 
assume R is commutative). 

6. Prove that R i s a division ring if and only if its only left ideals are (0) and R. (The analogous 
result holds when “left” is replaced by “right.”) 

7. Let R be a commutative ring with 1 . Prove that the principal ideal generated by x in the 
polynomial ring /?[*] is a prime ideal if and only if R is an integral domain. Prove that 
(x) is a maximal ideal if and only if R is a field. 

8. Let R be an integral domain. Prove that (a) = (b) for some elements a, b € R, if and only 
if a = ub for some unit u of R. 

9. Let R be the ring of all continuous functions on [0, 1] and let I be the collection of functions 
fix ) in R with /(1/3) = / (1/2) = 0. Prove that / is an ideal of R but is not a prime 
ideal. 
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10 . Assume R is commutative. Prove that if P is a prime ideal of R and P contains no zero 
divisors then R is an integral domain. 

11. Assume R is commutative. Let I and J be ideals of R and assume P is a prime ideal of R 
that contains I J (for example, if P contains in J). Prove either I or J is contained in P. 

12 . Assume R is commutative and suppose I = (ai , aa a„) and J = (by , t> 2 , . . . , b m ) are 

two finitely generated ideals in R. Prove that the product ideal I J is finitely generated by 
the elements tiibj for i = 1,2 n, and j = 1. 2, .... m. 

13 . Let <p : R —y S be a homomorphism of commutative rings. 

(a) Prove that if P is aprime ideal of S then either^? -1 (P) = R or <p~ l (P) is aprime ideal 
of R. Apply this to the special case when R is a subring of 5 and <p is the inclusion 
homomorphism to deduce that if P is a prime ideal of S then P f) R is either R or a 
prime ideal of R. 

(b) Prove that if M is a maximal ideal of S and <p is suijective then <p~ l (M) is a maximal 
ideal of R . Give an example to show that this need not be the case if (p is not suijective. 

14 . Assume R is commutative. Let x be an indeterminate, let /(x) be a monic polynomial 

in R[x] of degree n > 1 and use the bar notation to denote passage to the quotient ring 
Rix]/(f(x)). 

(a) Show that every element of R[x]/(f(x)) is of the form p(x) for some polynomial 
p(x) € /?[x] of degree less than n, i.e., 

R[x]/(f(x)) = [ao+a\x-^ +a n -ix n ~ 1 | ao,a\ a „- 1 e R}. 

[If /( x) = x n + b n -i x" _I + - - - + bo then x n = —(b„-\x n ~ l + ■ ■ ■ + bo). Use this 
to reduce powers of x in the quotient ring.] 

(b) Prove that if p( x) and q(x) are distinct polynomials in /?[x] which are both of degree 
less than n, then p(x) g(x). [Otherwise p(x) — g(x) is an P[x]-multiple of the 
monic polynomial /(*).] 

(c) If /( x) = a(x)b(x) where both a(x) and b(x) have degree less than n, prove thata(x) 
is a zero divisor in R[x]/(f(x)). 

(d) If f(x) = x n — a for some nilpotent element a e R, prove that x is nilpotent in 
R[x]/(f(x)). 

(e) Let p be a prime, assume R = W p and f(x) = x p — a for some a e F p . Prove that 
x — a is nilpotent in P[x]/(/(jc)). [Use Exercise 26(c) of Section 3.] 

15 . Let x 2 + x + 1 be an element of the polynomial ring E = Fifx] and use the bar notation 
to denote passage to the quotient ring F 2 [x]/(x 2 + x 4- 1). 

(a) Prove that E has 4 elements: 0, 1, x and x + 1. 

(b) Write out the 4 x 4 addition table for E and deduce that the additive group E is 
isomorphic to the Klein 4-group. 

(c) Write out the 4 x 4 multiplication table for E and prove that £ x is isomorphic to the 
cyclic group of order 3. Deduce that £ is a field. 

16 . Let x 4 — 16 be an element of the polynomial ring E = Z[x\ and use the bar notation to 
denote passage to the quotient ring Z[x]/(x 4 — 16). 

(a) Find a polynomial of degree < 3 that is congruent to 7x u — llx 9 + 5x 5 — 2x 3 + 3 

modulo (x A — 16 ). 

(b) Prove that x — 2 and x 4- 2 are zero divisors in E. 

17 . Let x 3 — 2x + l bean element of the polynomial ring E = Z[x] and use the bar notation to 
denote passage to the quotientringZ[x]/(x 3 — 2x4-1). Let p(x) = 2x 7 — 7x 5 -)-4x 3 — 9x+l 
and letg(x) = (x — l) 4 . 
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(a) Express each of the following elements of E in the form / (jc) for some polynomial 
f(x) of degree < 2: p(x), q(x) t p(x) + q(x) and p(x)q(x). 

(b) Prove that E is not an integral domain. 

(c) Prove that x is a unit in E. 

18. Prove that if R is an integral domain and /?[[*]] is the ring of formal power series in the 
indeterminate x then the principal ideal generated by x is a prime ideal (cf. Exercise 3, 
Section 2). Prove that the principal ideal generated by x is a maximal ideal if and only if 
R is a field. 

19. Let R be a finite commutative ring with identity. Prove that every prime ideal of R is a 
maximal ideal. 

20. Prove that a nonzero finite commutative ring that has no zero divisors is a field (if the ring 
has an identity, this is Corollary 3, so do not assume the ring has a 1). 

21. Prove that a finite ring with identity 1^0 that has no zero divisors is a field (you may 
quote Weddeibum’s Theorem). 

22. Let peZ + be a prime and let the F p Quaternions be defined by 

a+bi+cj+dk a,b,c,dsZ/pZ 

where addition is componentwise and multiplication is defined using the same relations 
on i, j, k as for the real Quaternions. 

(a) Prove that the F p Quaternions are a homomorphic image of the integral Quaternions 
(cf. Section 1). 

(b) Prove that the F p Quaternions contain zero divisors (and so they cannot be a division 
ring). [Use the preceding exercise.] 

23. Prove that in a Boolean ring (cf. Exercise 15, Section 1) every prime ideal is a maximal 
ideal. 

24. Prove that in a Boolean ring every finitely generated ideal is principal. 

25. Assume R is commutative and for each as R there is an integer n > 1 (depending on a) 
such that a n = a. Prove that every prime ideal of R is a maximal ideal. 

26. Prove that a prime ideal in a commutative ring R contains every nilpotent element (cf. 
Exercise 13, Section 1). Deduce that the nilradical of R (cf. Exercise 29, Section 3) is 
contained in the intersection of all the prime ideals of R. (It is shown in Section 15.2 that 
the nilradical of R is equal to the intersection of all prime ideals of R.) 

27. Let R be a commutative ring with 1^0. Prove that if a is a nilpotent element of R then 
1 — ab is a unit for all b e R. 

28. Prove that if R is a commutative ring and N = (a \ , a m ) where each a ,• is a nilpotent 

element, then N is a nilpotent ideal (cf. Exercise 37, Section 3). Deduce that if the nilradical 
of R is finitely generated then it is a nilpotent ideal. 

29. Let p be a prime and let G be a finite group of order a power of p (i.e., a p-group). Prove 
that the augmentation ideal in the group ring Z/pZG is a nilpotent ideal. (Note that this 
ring may be noncommutative.) [Use Exercise 2.] 

30. Let / be an ideal of the commutative ring R and define 

rad / = {r e R \ r n e / for some n e Z + ) 

called the radical of /. Prove that rad / is an ideal containing / and that (rad /)// is the 
nilradical of the quotient ring /?//, i.e., (rad /)// = 02(7?//) (cf. Exercise 29, Section 3). 

31. An ideal / of the commutative ring R is called a radical ideal if rad 1 = 1. 
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(a) Prove that every prime ideal of R is a radical ideal. 

(b) Lei n > 1 be an integer. Prove that 0 is a radical ideal in ’LfnL if and only if n is a 
product of distinct primes to the first power (i.e., n is square free). Deduce that (n) is 
a radical ideal of Z if and only if n is a product of distinct primes in 7L. 

32. Let / be an ideal of the commutative ring R and define 

Jac / to be the intersection of all maximal ideals of R that contain 7 

where the convention is that Jac 7? = R. (If / is the zero ideal, Jac 0 is called the Jacobson 
radical of the ring R, so Jac / is the preimage in R of the Jacobson radical of R/l.) 

(a) Prove that Jac / is an ideal of R containing /. 

(b) Prove that rad / c Jac I, where rad / is the radical of / defined in Exercise 30. 

(c) Let n > 1 be an integer. Describe Jac nL in terms of the prime factorization of n. 

33. Let 7? be the ring of all continuous functions from the closed interval [0,1] to IR and for 
each c € [0, 1] let M c — {/ e R \ f(c) = 0} (recall that M c was shown to be a maximal 
ideal of 7?). 

(a) Prove that if M is any maximal ideal of R then there is a real number c e [0, 1] such 
that M = M c . 

(b) Prove that if b and c are distinct points in [0,1] then Mb i 1 M c . 

(c) Prove that M c is not equal to the principal ideal generated by x — c. 

(d) Prove that M c is not a finitely generated ideal. 

The preceding exercise shows that there is a bijection between the points of the closed interval 
[0,1] and the set of maximal ideals in the ring 7? of all of continuous functions on [0,1] given 
by c -o- M c . For any subset X of R or, more generally, for any completely regular topological 
space X, the map c i-+ M c is an injection from X to the set of maximal ideals of R, where 
7? is the ring of all bounded continuous real valued functions on X and M c is the maximal 
ideal of functions that vanish at c. Let fi(X) be the set of maximal ideals of 7?. One can put 
a topology on p(X) in such a way that if we identify X with its image in ft(X) then X (in its 
given topology) becomes a subspace of fi(X). Moreover, fi(X) is a compact space under this 
topology and is called the Stone-£ech compactification of X. 

34. Let R be the ring of all continuous functions from R to R and for each c e R let M c be 
the maximal ideal {/ € 7? | /(c) = 0}. 

(a) Let / be the collection of functions f(x) in R with compact support (i.e., fix) = 0 
for \x | sufficiently large). Prove that 7 is an ideal of R that is not a prime ideal. 

(b) Let M be a maximal ideal of R containing / (properly, by (a)). Prove that M ± M c 
for any ceR (cf. the preceding exercise). 

35. Let A = (fli , <32 a n) be a nonzero finitely generated ideal of R. Prove that there is 

an ideal B which is maximal with respect to the property that it does not contain A. [Use 
Zorn’s Lemma.] 

36. Assume 7? is commutative. Prove that the set of prime ideals in R has a minimal element 
with respect to inclusion (possibly the zero ideal). [Use Zorn’s Lemma.] 

37. A commutative ring 7? is called a local ring if it has a unique maximal ideal. Prove that 
if 7? is a local ring with maximal ideal M then every element of R — M is a unit Prove 
conversely that if 7? is a commutative ring with 1 in which the set of nonunits forms an 
ideal M , then R is a local ring with unique maximal idea] M . 

38. Prove that the ring of all rational numbers whose denominators is odd is a local ring whose 
unique maximal ideal is the principal ideal generated by 2. 

39. Following the notation of Exercise 26 in Section 1, let A' be a field, let v be a discrete 
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valuation on K and let 7? be the valuation ring of i>. For each integer k > 0 define 
A k = [ r e R | v(r) > k] U {0). 

(a) Prove that A k is a principal ideal and that Ao 3 A) 2 A 2 2 ' ■ ■ ■ 

(b) Prove that if 7 is any nonzero ideal of 7?, then 7 = A* for some k > 0. Deduce that 
R is a local ring with unique maximal ideal A\. 

40. Assume R is commutative. Prove that the following are equivalent (see also Exercises 
13 and 14 in Section 1) 

(i) R has exactly one prime ideal 

(ii) every element of 7? is either nilpotent or a unit 

(iii) R/i](R) is a field (cf. Exercise 29, Section 3). 

41. A proper ideal Q of the commutative ring 7? is called primary if whenever ab e Q and 
a £ Q then b" e Q for some positive integer n. (Note that the symmetry between a and 
b in this definition implies that if Q is a primary ideal and ab e Q with neither a nor b 
in Q, then a positive power of a and a positive power of b both lie in Q.) Establish the 
following facts about primary ideals. 

(a) The primary ideals of Z are 0 and (jf), where p is a prime and n is a positive integer. 

(b) Every prime ideal of 7? is a primary ideal. 

(c) An ideal Q of 7? is primary if and only if every zero divisor in R/Q is a nilpotent 
element of R/Q. 

(d) If Q is a primary ideal then rad(O) is a prime ideal (cf. Exercise 30). 


7.5 RINGS OF FRACTIONS 


Throughout this section R is a commutative ring. Proposition 2 shows that if a is not 
zero nor a zero divisor and ab = ac in R then b = c. Thus a nonzero element that is not 
a zero divisor enjoys some of the properties of a unit without necessarily possessing a 
multiplicative inverse in R. On the other hand, we saw in Section 1 that a zero divisor 
a cannot be a unit in R and, by definition, if a is a zero divisor we cannot always cancel 
the a's in the equation ab = ac to obtain b — c (take c = 0 for example). The aim of 
this section is to prove that a commutative ring R is always a subring of a larger ring 
Q in which every nonzero element of R that is not a zero divisor is a unit in Q. The 
principal application of this will be to integral domains, in which case this ring Q will 
be a field — called its field of fractions or quotient field . Indeed, the paradigm for the 
construction of Q from R is the one offered by the construction of the field of rational 
numbers from the integral domain Z. 

In order to see the essential features of the construction of the field <Q> from the 
integral domain Z we review the basic properties of fractions. Each rational number 
may be represented in many different ways as the quotient of two integers (for example, 


1 

2 


2 

4 



.... etc.). These representations are related by 


a 

b 


c 

d 


if and only if ad — be . 


a 


In more precise terms, the fraction - is the equivalence class of ordered pairs (a, b) 
of integers with b ± 0 under the equivalence relation: {a, b) ~ (c, d ) if and only if 


260 


Chap. 7 Introduction to Rings 



ac 


ad — be. The arithmetic operations on fractions are given by 


a c ad -+- be 
b + d = bd 


and 



bd' 


These are well defined (independent of choice of representatives of the equivalence 
classes) and make the set of fractions into a commutative ring (in fact, a field), (Q>. The 


a 


integers Z are identified with the subring {y | a G Z) of Q and every nonzero integer 


a has an inverse — in <Q>. 

a 

It seems reasonable to attempt to follow the same steps for any commutative ring 
R, allowing arbitrary denominators. If, however, b is zero or a zero divisor in R , say 
bd = 0, and if we allow b as a denominator, then we should expect to have 


d = 


d _bd 
1 ~~b 



in the “ring of fractions” (where, for convenience, we have assumed R has a 1). Thus 
if we allow zero or zero divisors as denominators there must be some collapsing in 
the sense that we cannot expect R to appear naturally as a subring of this “ring of 
fractions.” A second restriction is more obviously imposed by the laws of addition and 
multiplication: if ring elements b and d are allowed as denominators, then bd must 
also be a denominator, i.e., the set of denominators must be closed under multiplication 
in R. The main result of this section shows that these two restrictions are sufficient to 
construct a ring of fractions for R. Note that this theorem includes the construction of 
Q from Z as a special case. 


Theorem 15. Let I? be a commutative ring. Let D be any nonempty subset of R that 
does not contain 0, does not contain any zero divisors and is closed under multiplication 
(i.e., ab G D for all a, b G D). Then there is a commutative ring Q with 1 such that 
Q contains R as a subring and every element of D is a unit in Q. The ring Q has the 
following additional properties. 

(1) every element of Q is of the form rd~ l for some r g R and d G D. In particular, 
if D = R — {0} then Q is a field. 

(2) (uniqueness of Q) The ring Q is the “ smallest ” ring cont ainin g R in which all 
elements of D become units, in the following sense. Let S be any commutative 
ring with identity and let <p : R S be any injective ring homomorphism 
such that <p(d) is a unit in S for every d e D. Then there is an injective 
homomorphism d> : Q -> S such that <P\ R = (p. In other words, any ring 
containing an isomorphic copy of I? in which all the elements of D become 
units must also contain an isomorphic copy of Q. 


Remark: In Section 15.4 a more general construction is given. The proof of the general 
result is more technical but reties on the same basic rationale and steps as the proof 
of Theorem 15. Readers wishing greater generality may read the proof below and the 
beginning of Section 15.4 in concert. 

Proof: Let T = [(r,d) \ r g R, d g D) and define the relation ~ on T by 
(r, d) ~ (s, e) if and only if re — sd. 
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Il is immediate that this relation is reflexive and symmetric. Suppose (r, d ) ~ (s, e) 
and (^, e) ~ (t, /). Then re — sd = 0 and sf — te — 0. Multiplying the first of these 
equations by / and the second by d and adding them gives ( rf — td)e = 0. Since 
e 6 D is neither zero nor a zero divisor we must have rf — td = 0, i.e., (r, d) ~ (t, /). 
This proves ~ is transitive, hence an equivalence relation. Denote the equivalence class 

of (r, d) by 

a 

— = {(a, b) | a 6 R, b e D and rb = ad), 
d 


r re 

Let Q be the set of equivalence classes under Note that — = — in Q for all e e D, 

d de 

since D is closed under multiplication. 

We now define an additive and multiplicative structure on Q : 


a c ad + be 
b + d = bd 


and 


a c ac 
b X d bd 


In order to prove that Q is a commutative ring with identity there axe a number of things 
to check: 


(1) these operations are well defined (i.e., do not depend on the choice of representatives 
for the equivalence classes), 

0 

(2) Q is an abelian group under addition, where the additive identity is — for any d e D 


and the additive inverse of — is — — , 

d d 

(3) multiplication is associative, distributive and commutative, and 

d 

(4) Q has an identity (= — for any d e D). 


These are all completely straightforward calculations involving only arithmetic in 
R and the definition of Again we need D to be closed under multiplication for 
addition and multiplication to be defined. 


a a 

For example, to check that addition is well defined assume - — — (i.e., ab' = a'b) 

, c c' y . ad + bc a'd' + b'd . 

and - = — (i.e., cd = c d). We must show that — — — = , ne., 

d d' bd b'd' 

(ad + be) (b'd 1 ) = (a’d’ + b'c')(bd). 


The left hand side of this equation is atfdd' + cd'bb' substituting a'b for ab' and dd 
for cd‘ gives a'bdd ' + ddbb', which is the right hand side. Hence addition of fractions 
is well defined. Checking the details in the other parts of (l) to (4) involves even easier 
manipulations and so is left as an exercise. 

Next we embed R into Q by defining 

i : R — >■ Q by i : r — where d is any element of D. 

d 

Since — = — for all d, e € Z), t(r) does not depend on the choice of d e D. Since 
d e 

D is closed under multiplication, one checks directly that t is a ring homomorphism. 
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Furthermore, i is injective because 

rd 0 n 

t(r) = 0 <£> — = - & rd 2 = 0 & r = 0 
d d 

because d (hence also d 2 ) is neither zero nor a zero divisor. The subring t(R) of Q is 
therefore isomorphic to R. We henceforth identify each r e R with i (r) and so consider 
R as a subring of Q. 

Next note that each d e D has a multiplicative inverse in Q: namely, if d is 

de e 

represented by the fraction — then its multiplicative inverse is — . One then sees that 

e de 

every element of Q may be written as r ■ d x for some r e R and some d e D. In 
particular, if D = R — (0), every nonzero element of Q has a multiplicative inverse and 
Q is a field. 

It remains to establish the uniqueness property of Q. Assume cp : R — > S is an 
injective ring homomorphism such that <p(d) is a unit in S for all d e D. Extend cp to a 
map 0 : Q — : ► S by defining 0(rd~ l ) = <p(r)(p(d)~ x for all r e R, d e Z). This map 
is well defined, since rd~ x = se~ l implies re = sd , so (p(r)<p(e) = <p(s)<p(d) y and then 

0(rd~ x ) ~ q>(r)<p(d)~ l = <p(s)<p(e)~ l = <P(se~ l ). 

It is straightforward to check that <P is a ring homomorphism — the details are left as an 
exercise. Finally, <t> is injective because rd~ 1 6 ker 0 implies r e ker 0 n R = ker <p\ 
since <p is injective this forces r and hence also rd~ l to be zero. This completes the 
proof. 

Definition. Let /?, D and Q be as in Theorem 15. 

(1) The ring Q is called the ring of fractions of D with respect to R and is denoted 
D~ l R. 

(2) If R is an integral domain and D = R — {0}, Q is called the field of fractions 
or quotient field of R. 

If A is a subset of a field F (for example, if A is a subring of F), then the intersection 
of all the subfields of F containing A is a subfield of F and is called the subficld 
generated by A. This subfield is the smallest subfield of F containing A (namely, any 
subfield of F containing A contains the subfield generated by A). 

The next corollary shows that the smallest field containing an integral domain R is 
its field of fractions. 

Corollary 16. Let R be an integral domain and let Q be the field of fractions of R. If 
a field F contains a subring R' isomorphic to R then the subfield of F generated by R‘ 
is isomorphic to Q . 

Proof: Let <p : R = R' C. F bea (ring) isomorphism of R to R'. In particular, 
<p : R -> F is an injective homomorphism from R into the field F. Let 0 : Q — ^ ► F be 
the extension of <p to Q as in the theorem. By Theorem 15, 0 is injective, so 0 ( Q) is an 
isomorphic copy of Q in F containing <p(R) = R'. Now, any subfield of F containing 
R 1 = <p(R) contains the elements <p(ri)<p(r 2 )~ l = <p(r\rf l ) for all n, r 2 G R- Since 
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every element of Q is of the form r\r^ x for some r\, g R, it follows that any subfield 
of F containing R' contains the field <£>(£)), so that <£>(£)) is the subfield of F generated 
by R', proving the corollary. 


Examples 

(X) If R is a field then its field of fractions is just R itself. 

(2) The integers Z arc an integral domain whose field of fractions is the field (Q> of rational 
numbers. The quadratic integer ring O of Section 1 is an integral domain whose field 
of fractions is the quadratic field Q(-/D ). 

(3) The subring 2Z of Z also has no zero divisors (but has no identity). Its field of fractions 
is also Q. Note how an identity “appears” in the field of fractions. 

(4) If R is any integral domain, then the polynomial ring /?[jc] is also an integral domain. 
The associated field of fractions is the field of rational functions in the variable x 

over R. The elements of this field are of the form where p(x) and ^(jc) are 

polynomials with coefficients in R with q{ jt) not the zero polynomial. In particular, 

p(x) and q(x) may both be constant polynomials, so the field of rational functions 

a 

contains the field of fractions of R: elements of the form - such that a.b e R and 

b 

b 0. If F is a field, we shall denote the field of rational functions by F(x). Thus if 
F is the field of fractions of the integral domain R then the field of rational functions 
over R is the same as the field of rational functions over F, namely F(x). 

For example, suppose R = Z, so F = <Q>. If p(x),q(x) are polynomials in 
Q[jc] then for some integer N, Np(x), Nq(x) have integer coefficients (let N be a 
common denominator for all the coefficients in p(r) and q{x), for example). Then 

_ Np( ) jjg wr itte n as the quotient of two polynomials with integer co- 
q(x) Nq(x) 

efficients, so the field of fractions of Q[x] is the same as the field of fractions of 
Z[jc]. 

(5) If R is any commutative ring with identity and d is neither zero nor a zero divisor in R 
we may form the ring /?[l/d] by setting D = {1, d, d 2 , d 3 , . . . } and defining fl[l/d] 
to be the ring of fractions D~ l R. Note that R is the subring of elements of the form 

- . In this way any nonzero element of R that is not a zero divisor can be inverted in 

a larger ring containing R. Note that the elements of R[\/d] look like polynomials in 
1/d with coefficients in R , which explains the notation. 


EXERCISES 

Let R be a commutative ring with identity 1 ^ 0. 

X. Fill in all the details in the proof of Theorem 15. 

2. Let R be an integral domain and let D be a nonempty subset of R that is closed under 
multiplication. Prove that the ring of fractions D~ l R is isomorphic to a subring of the 
quotient field of R (hence is also an integral domain). 

3. Let F be a field. Prove that F contains a unique smallest subfield Fq and that Fq is 
isomorphic to either <Q> or Z//?Z for some prime p (Fq is called the prime subfield of F). 
[See Exercise 26, Section 3.] 

4. Prove that any subfield of R must contain (Q>. 
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5. If F is a field, prove that the field of fractions of ^[[x]] (the ring of formal power series in 
the indeterminate x with coefficients in F) is the ring F((x)) of formal Laurent series (cf. 
Exercises 3 and 5 of Section 2). Show the field of fractions of the power series ring Z[[x]] 
is properly contained in the field of Laurent series Q((x)). (Consider the series for e*.] 

6. Prove that the real numbers, IR, contain a su bring A with 1 e A and A maximal (under 
inclusion) with respect to the property that 5 ^ A. [Use Zorn’s Lemma.] (Exercise 13 
in Section 15.3 shows K is the quotient field of A, so K is the quotient field of a proper 
subring.) 


7.6 THE CHINESE REMAINDER THEOREM 

Throughout this section we shall assume unless otherwise stated that all rings are com- 
mutative with an identity 1 9^ 0. 

Given an arbitrary collection of rings (not necessarily satisfying the conventions 
above), their ( ring) direct product is defined to be their direct product as (abelian) groups 
made into a ring by defining multiplication componentwise. In particular, if R\ and R 2 
are two rings, we shall denote by R\ x R 2 their direct product (as rings), that is, the set 
of ordered pairs (r lf r2) with r\ e Ri and r 2 e R 2 where addition and multiplication 
are performed componentwise: 

(n , r 2 ) + (si . $2) = (n + £ 1 . r 2 + *2) and (rj , r 2 )(^i , ^2) = (n^i , r 2 s 2 ). 

We note that a map ip from a ring R into a direct product ring is a homomorphism if 
and only if the induced maps into each of the components are homomorpbisms. 

There is a generalization to arbitrary rings of the notion in Z of two integers n and 
m being relatively prime (even to rings where the notion of greatest common divisor is 
not defined). In Z this is equivalent to being able to solve the equation nx -j- my = I 
in integers x and y (this fact was stated in Chapter 0 and will be proved in Chapter 8). 
This in turn is equivalent to nZ 4- mZ = Z as ideals (in general, nZ + raZ = (m , n) Z). 
This motivates the following definition: 

Definition. The ideals A and B of the ring R are said to be comaximal if A + B = R. 

Recall that the product , A B, of the ideals A and B of R is the ideal consisting of all 
finite sums of elements of the form xy, x e A and y e B ( cf . Exercise 34, Section 3). 
If A = (a) and B = (b) t then AB = ( ab ). More generally, the product of the ideals 
A\ , A2 , ..., A* is the ideal of all finite sums of elements of the form x x x 2 ■ ■ - x k such 
that Xj € A; for all f. If A,- = (fl/), then Aj ■ ■ - A* = (a 1 ■ - - a k ). 


Theorem 17. (Chinese Remainder Theorem) Let A] , A2 , . . . , A* be ideals in R. The 
map 

R R/AixR/A 2 X“-xR/A k defined by r (r+Ai, r+A2 , . . . , r+A k ) 

is a ring homomorphism with kernel Aj D A2 n ■ - • n A*. If for each i, j € (1 , 2, . . . , k) 
with i 9^ j the ideals A; and Aj are comaximal, then this map is surjective and 
a 1 n A2 n ■ ■ ■ n A k = A\A 2 ■ ■ ■ a*, so 

r/(a 1 a 2 - ■ - a*) = R/(A\ n a 2 n ■ • • n a*) = r/Ai x r/a 2 x * * • x R/A k . 
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Proof: We first prove this for k = 2; the general case will follow by induction. 
Let A — A\ and B = A 2 . Consider the map <p : R ->> R/A x R/B defined by 
<p(r) = (r mod A, r mod B), where mod A means the class in R/A containing r (that 
is, r + A). This map is a ring homomorphism because <p is just the natural projection 
of R into R/A and R/B for the two components. The kernel of <p consists of all the 
elements r € R that are in A and in B, i.e., AC\ B. To complete the proof in this case it 
remains to show that when A and B are comaximal, <p is surjective and A D B = AB. 
Since A + B = R, there are elements x e A and y e B such that x + y = 1. This 
equation shows that <p(x) = (0, 1 ) and <p(y) = (1,0) since, for example, x is an element 
of A and x = l — y e 1 + B. If now (n mod A , r 2 mod B) is an arbitrary element in 
R/A x R/B, then the element r 2 x + ny maps to this element since 

<p(r 2 x + r { y) = <p(r 2 )tp(x) + <p(ri)(p(y) 

= ( r 2 mod A, r 2 mod £)(0, 1) + (r t mod A, r\ mod £)(1, 0) 

= (0, r 2 mod B) + ( r\ mod A, 0) 

= (n mod A,r 2 mod B). 

This shows that <p is indeed surjective. Finally, the ideal A B is always contained in 
A (IB. If A and B are comaximal and x and y are as above, then for any c e A D B, 
c = cl = cx + cy € AB. This establishes the reverse inclusion A n B c AB and 
completes the proof when k = 2. 

The general case follows easily by induction from the case of two ideals using 
A = A] and B = A 2 ■ ■ ■ A* once we show that Ai and A 2 ■ * • A* are comaximal. By 
hypothesis, for each i e (2, 3, . . . , /c} there are elements € Aj and y,- € A, such that 
x 1 + y,- = 1. Since jq + y,- = y,- mod A\, it follows that 1 = (x 2 + y 2 ) ■ ■ ■ (x* + yjt) is 
an element in A[ + (A 2 ■ • ■ A k ). This completes the proof. 

This theorem obtained its name from the special case Z/mnZ = (Z/mZ) x (Z/nZ) 
as rings when m and n are relatively prime integers. We proved this isomorphism just 
for the additive groups earlier. Tliis isomorphism, phrased in number-theoretic terms, 
relates to simultaneously solving two congruences modulo relatively prime integers 
(and states that such congruences can always be solved, and uniquely). Such problems 
were considered by the ancient Chinese, hence the name. Some examples are provided 
in the exercises. 

Since the isomorphism in the Chinese Remainder Theorem is an isomorphism of 
rings , in particular the groups of units on both sides must be isomorphic. It is easy to 
see that the units in any direct product of rings are the elements that have units in each 
of the coordinates. In the case of Z/m«Z the Chinese Remainder Theorem gives the 
following isomorphism on the groups of units: 

(Z/mnZ) x ^ (Z/mZ) x x (Z/nZ) x . 

More generally we have the following result 
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Corollary 18. Let n be a positive integer and let pi a ' pf 1 . . . be its factorization 
into powers of distinct primes. Then 

Z/nZ = {Z/p x a 'Z) x (Z/^Z) x - - - x (Z/pf'Z), 

as rings, so in particular we have the following isomorphism of multiplicative groups: 

(Z/nZ)* = (Z/py a, X) x x {Zip 2 ff2 Z) x x ■ ■ ■ x {Z/p h ak ZY. 


If we compare orders on the two sides of this last isomorphism, we obtain the 
formula 

<p(n) = <p(pi a) )<p(pi a2 ) . . . <p{pk° k ) 

for the Euler ^-function. This in turn implies that cp is what in elementary number 
theory is termed a multiplicative function, namely that <p{ab) = <p(a)<p(b) whenever a 
and b are relatively prime positive integers. The value of <p on prime powers p a is easily 
seen to be <p(p a ) = p a ~ l (p — 1) (cf. Chapter 0). From this and the multiplicalivity of 
<p we obtain its value on all positive integers. 

Corollary 18 is also a step toward a deter mina tion of the decomposition of the 
abelian group (Z/nZ)* into a direct product of cyclic groups. The complete structure 
is derived at the end of Section 9.5. 

EXERCISES 


Let R be a ring with identity 1^0. 

1. An element e e R is called an Idempotent if e 2 = e. Assume e is an idem potent in R and 
er = re for all r e R. Prove that Re and 7?(1 — e) are two-sided ideals of R and that 
R = Re x R( 1 — e). Show that e and 1 — e are identities for the subrings Re and R{ \ — e) 
respectively. 

2. Let R be a finite Boolean ring with identity 1^0 (cf. Exercise 15 of Section 1). Prove 
that R = Z/2Z x - - ■ x Z/2Z. [Use the preceding exercise.] 

3. Let R and S be rings with identities. Prove that every ideal of R x S is of the form 7x7 
where 7 is an ideal of R and 7 is an ideal of S. 

4. Prove that if R and S arc nonzero rings then 7? x S is never a field. 

5. Letni, /J 2 , . . . , n/t be integers which are relatively prime in pairs: (n;,nj) = 1 for all i ± j. 

(a) Show that the Chinese Remainder Theorem implies that for any ay, . . . , a* e Z there 
is a solution x e Z to the simultaneous congruences 

x =a\ mod n\ , x = ai mod /12 , . . . , x = mod ny 

and that the solution x is unique mod n = n 1/12 ... n*. 

(b) Let A, = n/nj be the quotient of n by n,, which is relatively prime tony by assumption. 
Let U be the inverse of n- mod n,-. Prove that the solution x in (a) is given by 

x = a\t\n\ -f- 02*2^2 + akt k n k m0< ^ n - 

Note that the elements tj can be quickly found by the Euclidean Algorithm as described 
in Section 2 of the Preliminaries chapter (writing a/i, -I- bn\ = (/i/, «') = 1 gives 
U = b) and that these then quickly give the solutions to the system of congruences 
above for any choice of ay , a%, . . . , ay. 
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(c) Solve the simultaneous system of congruences 

x = 1 mod 8 , x = 2 mod 25 , and x = 3 mod 81 
and the simultaneous system 

y = 5 mod 8 , y = 12 mod 25 , and y = 47 mod 81 . 

6. Let f\ (x), / 2 OO, . . . , fk(x) be polynomials with integer coefficients of the same degree 

d. Let n j , nj nu be integers which are relatively prime in pairs (i.e., (n; , nj) = 1 for 

all i ± j). Use the Chinese Remainder Theorem to prove there exists a polynomial f(x) 
with integer coefficients and of degree d with 

f(x) = fi(x) mod n\ , f(x) = fzix) mod m fix) = fkix) mod nk 

i.e., the coefficients of fix) agree with the coefficients of fix) mod Show that if all 
the f ix) are monic, then / (x) may also be chosen monic. [Apply the Chinese Remainder 
Theorem in 2 to each of the coefficients separately.] 

7. Let m and n be positive integers with n dividing m. Prove that the natural suijective ring 
projection Z/mZ — > Z/rzZ is also suijective on the units: (Z/rwZ) x -> (Z/nZ) x . 

The next four exercises develop the concept of direct limits and the “dual” notion of inverse 
limits. In these exercises 7 is a nonempty index set with a partial order < (cf. Appendix I). For 
each i € 7 let A/ be an additive abelian group. In Exercise 8 assume also that 7 is a directed 
set. for every i, j € I there is some k € I with i < k and j < k. 

8. Suppose for every pair of indices 1 , j with i < j there is a map p,j : A, — > Ay such that 
the following hold: 

i. pjk o pij = pik whenever i < j < k, and 
iL pa = 1 for all / € 7. 

Let B be the disjoint union of all the A, . Define a relation ~ on B by 

a ~ b if and only if there exists k with », j < k and Pikia) = pjkib), 
for a € A/ and b e Aj. 

(a) Show that ~ is an equivalence relation on B. (The set of equivalence classes is called 
the direct or inductive limit of the directed system {A/}, and is denoted Urp A-, . In the 
remaining parts of this exercise let A = liip A, .) 

(b) Let x denote the class of x in A and define p, : A, -> A by Pi ia) — d. Show that 
if each Pij is injective, then so is Pi for all i (so we may then identify each A,- as a 
subset of A). 

(c) Assume al 1 p/y are group homomoiphisms. Fora € Ai,b e Ay show that the operation 

a + b = Pikia) + pjkib) 

where k is any index with i, j < k , is well defined and makes A into an abelian group. 
Deduce that the maps p, in (b) are group homomorphisms from A/ to A. 

(d) Show that if all A; are commutative rings with 1 and all p,y are ring homomorphisms 
that send 1 to 1, then A may likewise be given the structure of a commutative ring 
with 1 such that all p; are ring homomorphisms. 

(e) Under the hypotheses in (c) prove that the direct limit has the following universal 
property: if C is any abelian group such that for each / € 7 there is a homomorphism 
<pi : A/ -> C with ^3; = tpj opij whenever i < j, then there is a unique homomorphism 
<p : A -> C such that (p o p t = <p; for all i. 
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9. Let 7 be the collection of open intervals U = (a, b) on the real line containing a fixed real 
number p. Order these by reverse inclusion: U <V if V c f/ (note that 7 is a directed set). 
For each U let Ap be the ring of continuous real valued functions on U . For V c U define 
the restriction maps ppy ■ M j — ► Ay by / i-» /| y, the usual restriction of a function on 
U to a function on the subset V (which is easily seen to be a ring homomorphism). Let 
A = hip Ajj be the direct limit In the notation of the preceding exercise, show that the 
maps pu : Ajj ->■ A are not injective but are all suqective (A is called the ring of germs 
of continuous junctions at p). 

We now develop the notion of inverse limits. Continue to assume 7 is a partially ordered set 

(but not necessarily directed), and A; is a group for all j e 7. 

10 . Suppose for every pah of indices r, j with i < j there is a map pji : Aj -*■ A,- such that 
the following hold: 

i. pji o pkj — p,ki whenever i < j < k, and 

ii. pa = 1 for all i e 1. 

Let 7* be the subset of elements (n/), e / in the direct product ]~L e/ A,- such that pjj (aj) = o,- 
whenever i < j (here a, and aj are the I th and j 01 components respectively of the element 
in the direct product). The set P is called the inverse or projective limit of the system {A,- }, 
and is denoted l^m A,-.) 

(a) Assume all Pj\ are group homomorphisms. Show that P is a subgroup of the direct 
product group (cf. Exercise 15, Section 5.1). 

(b) Assume the hypotheses in (a), and lei 7 = Z + (usual ordering). For each / e 7 let 
tii - P — » A; be the projection of P onto its component. Show that if each pjj is 
suijective, then so is Pi for all i (so each A, is a quotient group of P). 

(c) Show that if all A, are commutative rings with 1 and all pjj are ring homomorphisms 
that send 1 to 1, then A may likewise be given the structure of a commutative ring 
with 1 such that all pj are ring homomoiphisms. 

(d) Under the hypotheses in (a) prove that the inverse limit has the following universal 
property: if D is any group such that for each i e 7 there is a homomorphism 
Tii : D —> Aj with 77/ = pj\ ottj whenever i < j, then there is a unique homomorphism 
77 : D —>■ P such that pj qtt — itj for all i. 

11 . Let p be a prime let 7 = Z + , let A/ = Z/p'Z and let pji be the natural projection maps 

pji : a (mod p J ) i — > a (mod p'). 

The inverse limit IjmZ/p'Z is called the ring of p-adic integers , and is denoted by Z p . 

(a) Show that every element of Z p may be written uniquely as an i nfini te formal sum 

bo +bip+b 2 p 2 + bip^ H with each bj € (0, 1, .... p — 1). Describe the rules for 

adding and multiplying such formal sums corresponding to addition and multiplication 
in the ring Z p . [Write a least residue in each Z/p' Z in its base p expansion and then 
describe the maps Pji-] (Note in particular that Z p is uncountable.) 

(b) Prove that Z p is an integral domain that contains a copy of the integers. 

(c) Prove that i>o + &i p + & 2 P 2 + & 3 P 3 -i as in (a) is a unit in Z p if and only if bo ^ 0. 

(d) Prove that pZ p is the unique maximal ideal of Z p and Z p /pZ p = Z/pZ (where 
p — 0 + lp + Op 2 + Op 3 +■■■)■ Prove that every ideal of Z p is of the form p"Z p 
for some integer n > 0. 

(e) Show that if a\ ^ 0 (mod p) then there is an element a = (a,-) in the direct limit Z p 
satisfying aj’ = 1 (mod p J ) and Pjiiaj) = a\ for all j. Deduce that Z p contains 
p — 1 distinct (p — l) 81 roots of 1. 


Sec. 7.6 


The Chinese Remainder Theorem 


269 


CHAPTER 8 


Euclidean Domains, 
Principal Ideal Domains, 
and Unique Factorization Domains 


There are a number of classes of rings with more algebraic structure than generic 
rings. Those considered in this chapter are rings with a division algorithm (Euclidean 
Domains), rings in which every ideal is principal (Principal Ideal Domains) and rings in 
which elements have factorizations into primes (Unique Factorization Domains). The 
principal examples of such rings are the ring Z of integers and polynomial rings F[jt] 
with coefficients in some field F. We prove here all the theorems on the integers Z 
stated in the Preliminaries chapter as special cases of results valid for more general 
rings. These results will be applied to die special case of the ring F[x] in the next 
chapter. 

All rings in this chapter are commutative. 

8.1 EUCLIDEAN DOMAINS 

We first define the notion of a norm on an integral domain R. This is essentially no 
more than a measure of “size” in R. 

Definition. Any function N : R — ► Z + U {0} with N(0) = 0 is called a norm on the 
integral domain R. If N (a) > 0 for a ^ 0 define N to be a positive norm. 

We observe that this notion of a norm is fairly weak and that it is possible for the 
same integral domain R to possess several different norms. 

Definition. The integral domain R is said to be a Euclidean Domain (or possess a 
Division Algorithm) if there is a norm N on R such that for any two elements a and b 
of R with b ^ 0 there exist elements q and r in R with 

a = qb + r with r = 0 or N(r) < N(b). 

The element q is called the quotient and the element r the remainder of the division. 


270 



The importance of the existence of a Division Algorithm on an integral domain R 
is that it allows a Euclidean Algorithm for two elements a and b of R: by successive 
“divisions” (these actually are divisions in the field of fractions of R) we can write 


a = q G b + r Q (0) 

b = <7ir 0 + r\ (1) 

ri) = qir\ + r 2 (2) 

r n — 2 = n — 1 “1“ (^0 

r „- 1 = q n +\r n (n + 1) 


where r„ is the last nonzero remainder. Such an r„ exists since N(b) > N (re) > 
N(ri) > ■ > N(r„) is a decreasing sequence of nonnegative integers if the remainders 

are nonzero, and such a sequence cannot continue indefinitely. Note also that there is 
no guarantee that these elements are unique. 

Examples 

(0) Fields are trivial examples of Euclidean Doma ins where any norm will satisfy the 
defining condition (e.g., N (a) = 0 for all a). This is because for every a, b with b # 0 
we have a = qb + 0, where q = ab ~~ 1 . 

(1) The integers Z are a Euclidean Domain with norm given by N(a) = \a\, the usual 
absolute value. Theexistence of a DivisionAlgorithm in Z (the familiar “long division” 
of elementary arithmetic) is verified as follows. Let a and b be two nonzero integers 
and suppose first that b > 0. The half open intervals [nb , (n+l)i> ), n e Z partition 
the real line and so a is in one of them, say a e [kb. (£+l)h ). For q = k we have 
a — qb = r e [ 0. |Z>|) as needed. If b < 0 (so -b > 0), by what we have just seen 
there is an integer q such that a = q(-b) + r with either r — 0 or |r | < | — b\; then 
a = (-q)b + r satisfies the requirements of the Division Algorithm for a and b. This 
argument can be made more formal by using induction on \a\. 

Note that if a is not a multiple of b there are always two possibilities for the 
pair q.r: the proof above always produced a positive remainder r. If for example 
b > 0 and q, r are as above with r > 0, then a = q'b + r' with q' = q + 1 and 
r' = r — b also satisfy the conditions of the Division Algorithm applied to a, b. Thus 
5 =2-2+ l = 3- 2—1 are the two ways of applying the DivisionAlgorithm in Z to 
a = 5 and b = 2. The quotient and remainder are unique if we require the remainder 
to be nonnegative. 

(2) If F is a field, then the polynomial ring F[x] is a Euclidean Domain with norm 
given by N(p(x)) = the degree of p(x). The Division Algorithm for polynomials is 
simply “long division” of polynomials which may be familiar for polynomials with 
real coefficients. The proof is very similar to that for Z and is given in the next chapter 
(although for polynomials the quotient and remainder are shown to be unique). In 
order for a polynomial ring to be a Euclidean Domain the coefficients must come from 
a field since the division algorithm ultimately rests on being able to divide arbitrary 
nonzero coefficients. We shall prove in Section 2 that f?[x] is not a Euclidean Domain 
if R is not a field. 

(3) The quadratic integer rings O in Section 7. 1 are integral domains with a norm defined 
by the absolute value of the field norm (to ensure the values taken are nonnegative; 
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when D < 0 the field norm is itself a norm), but in general O is not Euclidean with 
respect to this norm (or any other norm). The Gaussian integers Z(7] (where D = — 1), 
however, are a Euclidean Domain with respect to the norm N(a +bi) = a 2 + b 2 , as 
we now show (cf. also the end of Section 3). 

Let a = a+bi,fi = c + dibt two elements of Z[i] with ft ^ 0. Then in the field 

Q(r‘)wehave— = r+si where r = (ac+bd)/{cP , +d 2 )aDds = (bc — ad)/(c 2 +d 2 ) 

are rational numbers. Let p be an integer closest to the rational number r and let q be 
an integer closest to the rational number s, so that both \r — p\ and \s — q | are at most 
l/Z The Division Algorithm follows immediately once we show 


a — (P + + Y for some y € Z[i] with N(y) < ^ N(/3) 

which is even stronger than necessary. Let 6 = (r — p) + (s — q)i and set y = @8. 
Then y = a-(p+qi)f},&othaty € Z[i] is a Gaussian integer and a = (p+qi)0+y. 
Since N(6) = (r — p) 2 + (s —q) 2 is at most 1/4 + 1/4 = 1 /2, the multiplicativity of 

the norm N implies that N(y) = N(8)N(j 3) < }rN(fi) as claimed. 

Note that the algori thm is quite explicit since a quotient p + qi is quickly deter- 
mined from the rational numbers r and s, and then the remainder y = a — (p + qi)fi 
is easily computed. Note also that the quotient need not be unique: if r (or s) is half 
of an odd integer then there are two choices for p (or for q, respectively). 

This proof that Z[i] is a Euclidean Domain can also be used to show that O 
is a Euclidean Domain (with respect to the field norm defined in Section 7.1) for 
D = —2, —3, —7, —11 (cf. the exercises). We shall see shortly that Z[-v/— 5] is not 
Euclidean with respect to any norm, and a proof that Z[(l +■ -J— 19)/2] is not a 
Euclidean Domain with respect to any norm appears at the end of this section. 

(4) Recall (cf. Exercise 26 in Section 7.1) that a discrete valuation ring is obtained as 
follows. Let AT be a field. A discrete valuation on K is a function v : K x — > Z 
satisfying 

(i) v(ab) = v(a) + v(b) (i.e., visa homomorphism from the multiplicative group of 
nonzero elements of AT to Z), 

(il) v is surjective, and 

(iii) v(x + y) > min{v(x), v(y)} for all jt, y € K x with x + y ^ 0. 

The set (jt € K* \ v(x) > 0} U {0} is a subring of K railed the valuation ring of v. 
An integral domain R is called a discrete valuation ring if there is a valuation v on its 
field of fractions such that R is the valuation ring of v. 

For example the ring R of all rational numbers whose denominators are relatively 
prime to the fixed prime p € Z is a discrete valuation ring contained in Q. 

A discrete valuation ting is easily seen to be a Euclidean Domain with respect 
to the norm defined by N( 0) = 0 and N = v on the nonzero elements of R. This is 
because for a,b e R with b ^ 0 

(a) if N(a) < N(b) then a = 0 ■ b + a, and 

(b) if N(a) > N(b) then it follows from property (i) of a discrete valuation that 
q = ab~ x e R, so a ~ qb + 0. 


The first implication of a Division Algorithm for the integral domain R is that it 
forces every ideal of R to be principal. 
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Proposition 1 . Every ideal in a Euclidean Do main is principal. More precisely, if / is 
any nonzero ideal in the Euclidean Domain R then / = (d), where d is any nonzero 
element of 1 of minimum norm. 


Proof: If / is the zero ideal, there is nothing to prove. Otherwise let d be any 
nonzero element of 1 of minimum norm (such a d exists since the set [N (a) \ a € /} 
has a minimum element by the Well Ordering of Z). Clearly (d) c / since d is an 
element of / . To show the reverse inclusion let a be any element of I and use the 
Division Algorithm to write a = qd +r with r = 0 or N(r) < N(d). Then r = a — qd 
and both a and qd are in /, so r is also an element of /. By the minimality of the norm 
of d , we see that r must be 0. Thus a — qd e (d) showing I = (d). 


Proposition 1 shows that every ideal of Z is principal. This fundamental property 
of Z was previously determined (in Section 7.3) from the (additive) group structure of 
Z, using the classification of the subgroups of cyclic groups in Section 2.3. Note that 
these are really the same proof, since the results in Section 2.3 ultimately relied on the 
Euclidean Algori thm in Z. 

Proposition 1 can also be used to prove that some integral domains R are not 
Euclidean Domains (with respect to any norm) by proving the existence of ideals of R 
that are not principal. 

) 

Examples y 

(1) Let R = Z[xf. Since the ideal (2, x) is not principal (cf. Example 3 at the beginning 
of Section 7.4), it follows that the ring Z[x] of polynomials with integer coefficients 
is not a Euclidean Domain (for any choice of norm), even though the ring Q[jc] of 
polynomials with rational coefficients is a Euclidean Domain. 

(2) Let R be the quadratic integer ring Z(V— 5 ], let N be the associated field norm 

N(a +byf^5 )= a 2 + 5 b 2 and consider the ideal / = (3, 2+>/^5) generated by 3 and 
2+V^5. Suppose I = (a + b-J — 5 ),a,b e Z, were principal, i.e., 3 = a (a +b^/^5 ) 
and 2+V— 5 — fi(a + b+/—5 ) for some a, fi e R. Taking norms in the first equation 
gives 9 = N(a)(a 2 -f 5b 2 ) and since a 2 + 5 b 2 is a positive integer it must be 1 ,3 or 9. 
If the value is 9 then N(a) = 1 and a = ±1, so a + b-J—5 = ±3, which is impossible 
by the second equation since the coefficients of 2+-v/— 5 are not divisible by 3. The 
value cannot be 3 since there are no integer solutions to a 2 + 5 b 2 = 3 . If the value is 1 , 
then a + 5 = ±1 and the ideal / would be the entire ring R. But then 1 would be 

an element of 7, so 3y + (2+-/^5 )& = 1 for some y,S e R. Multiplying both sides 
by 2— would then imply that 2— V— 5 is a multiple of 3 in /?, a contradiction. It 
follows that 7 is not a principal ideal and so R is not a Euclidean Domain (with respect 
to any norm). 


One of the fundamental consequences of the Euclidean Algorithm in Z is that it 
produces a greatest common divisor of two nonzero elements. This is true in any 
Euclidean Domain. The notion of a greatest common divisor of two elements (if it 
exists) can be made precise in general rings. 
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Definition. Let R be a commutative ring and let a, b e R with b ^ 0. 

(1) a is said to be a multiple of b if there exists an element x e R with a = bx. In 
this case b is said to divide a or be a divisor of a, written b | a. 

(2) A greatest common divisor of a and b is a nonzero element d such that 

(i) d J a and d j b, and 

(ii) if d' | a and d' | b then d' | d. 

A greatest common divisor of a and b will be denoted by g.c.d. (a , b) , or (abusing 
the notation) simply (a, b). 

Note that b \ a in a ring R if and only if a e (b) if and only if (a) c (fc). In 
particular, if d is any divisor of both a and b then (d) must contain both a and b and 
hence must contain the ideal generated by a and b. The defining properties (i) and (ii) 
of a greatest common divisor of a and b translated into the language of ideals therefore 
become (respectively): 

if / is the ideal of R generated by a and b, then d is a greatest common divisor of 
a and b if 

(i) I is contained in the principal ideal (d), and 

(ii) if id') is any principal ideal containing / then (d) c (d'). 

Thus a greatest common divisor of a and b (if such exists) is a generator for the uniqlie 
smallest principal ideal containing a and b. There are rings in which greatest common 
divisors do not exist. 

This discussion immediately gives the following sufficient condition for the exis- 
tence of a greatest common divisor. 

Proposition 2. If a and b are nonzero elements in the commutative ring R such that the 
ideal generated by a and b is a principal ideal (d), then d is a greatest common divisor 
of a and b. 

This explains why the symbol ( a , b) is often used to denote both the ideal generated 
by a and b and a greatest common divisor of a and b. An integral domain in which 
every ideal (a, b) generated by two elements is principal is called a Bezout Domain . 
The exercises in this and subsequent sections explore these rings and show that there 
are Bezout Domains containing nonprincipal (necessarily infinitely generated) ideals. 

Note that the condition in Proposition 2 is not a necessary condition. For example, 
in the ring R = Z[x] the elements 2 and x generate a maximal, nonprincipal ideal (cf. 
the examples in Section 7.4). Thus R = (1) is the unique principal ideal containing 
both 2 and x, so 1 is a greatest common divisor of 2 and x. We shall see other examples 
along these lines in Section 3. 

Before returning to Euclidean Domains we examine the uniqueness of greatest 
common divisors. 

Proposition 3. Let R be an integral domain. If two elements d and d' of R generate the 
same principal ideal, i.e., (d) = id'), then d' = ud for some unit u in R. In particular, 
if d and d' are both greatest common divisors of a and b, then d' = ud for some unit u. 
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Proof: This is clear if either d or d' is zero so we may assume d and d' are nonzero. 
Since d € (d') there is some x e R such that d = x d'. Since d' e (d) there is some 
y e R such that d' = yd . Thus d = xyd and so d(l — xy) = 0. Since d =/ 0, xy = 1, 
that is, both x and y are units. This proves the first assertion. The second assertion 
follows from the first since any two greatest common divisors of a and b generate the 
same principal ideal (they divide each other). 

One of the most important properties of Euclidean Domains is that greatest common 
divisors always exist and can be computed algorithmically. 

Theorem 4. Let R be a Euclidean Domain and let a and b be nonzero elements of 
R. Let d = r n be the last nonzero remainder in the Euclidean Algorithm for a and b 
described at the beginning of this chapter. Then 

(1) d is a greatest common divisor of a and b, and 

(2) the principal ideal (d) is the ideal generated by a and b. In particular, d can be 
written as an R-linear combination of a and b, i.e., there are elements x and y 
in R such that 

d = ax + by. 

Proof: By Proposition 1, the ideal generated by a and b is principal so a, b do have 
a greatest common divisor, namely any element which generates the (principal) ideal 
(a, b). Both parts of the theorem will follow therefore once we show d = r n generates 
this ideal, i.e., once we show that 

(i) d | a and d \ b (so (a, b ) C (d)) 

(ii) d is an /^-linear combination of a and b (so (d) c ( a , b)). 

To prove that d divides both a and b simply keep track of the divisibilities in the 
Euclidean Algorithm. Starting from the (n+l) 81 equation, r n - k = q„+ir n , we see that 
r„ | r„_ j. Clearly r„ \ r„. By induction (proceeding from index n downwards to index 
0) assume r„ divides r k+i and r*. By the (£+l) a equation, r k _ x = q k+ 1 r k + r k+i , and 
since r„ divides both terms on the right hand side we see that r„ also divides r k ~i . From 
the l sl equation in the Euclidean Algorithm we obtain that r„ divides b and then from 
the 0 th equation we get that r„ divides a. Thus (i) holds. 

To prove that r„ is in the ideal (a, b) generated by a and b proceed similarly by 
induction proceeding from equation (0) to equation (n). It follows from equation (0) 
that ro € (a, b) and by equation (1) that r k = b — q\ro € (b, ro ) C (a, b ). By induction 
assume r k - 1 , r k e (a, b). Then by the (k+lf 1 equation 

ri:+i = ri-i - qk+iri e (r*_ 1( r k ) c ( a , b). 

This induction shows r n e (a, b), which completes the proof. 

Much of the material above may be familiar from elementary arithmetic in the case 
of the integers Z, except possibly for the translation into the language of ideals. For 
example, if a = 2210 and b = 1131 then the smallest ideal of Z that contains both a 
and b (the ideal generated by a and b) is 13Z, since 13 is the greatest common divisor 
of 2210 and 1131. This feet follows quickly from the Euclidean Algorithm: 

2210 — 1 - 1131 H- 1079 
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1131 = 1 - 1079 + 52 
1079 = 20-52 + 39 
52 = 1 - 39 + 13 
39 = 3 ■ 13 


so that 13 = (2210, 1131) is the last nonzero remainder. Using the procedure of 
Theorem 4 we can also write 13 as a linear combination of 2210 and 1131 by first 
solving the next to last equation above for 13 = 52 — 1 ■ 39, then using previous 
equations to solve for 39 and 52, etc., finally writing 13 entirely in terms of 2210 and 
1131. The answer in this case is 


13 = (-22) - 2210 + 43 ■ 1131. 


The Euclidean Algorithm in the integers % is extremely fast. It is a theorem that 
the number of steps required to determine the greatest common divisor of two integers 
a and b is at woist 5 times the number of digits of the smaller of the two numbers. 
Put another way, this algorithm is logarithmic in the size of the integers. To obtain an 
appreciation of the speed implied here, notice that for the example above we would 
have expected at worst 5 ■ 4 = 20 divisions (the example required far fewer). If we had 
started with integers on the order of 10 100 (large numbers by physical standards), we 
would have expected at worst only 500 divisions. 

There is no uniqueness statement for the integers x and y in (a, b) = ax + by. 
Indeed, x' = x + b and y' = y — a satisfy (a, b) = ax’ + by This is essentially 
the only possibility — one can prove that if xo and yo are solutions to the equation 
ax + by = N, then any other solutions x and y to this equation are of the form 


x — xo + m 


y = y 0 - m 


(a, b) 
a 

(a, b) 


for some integer m (positive or negative). 

This latter theorem (a proof of which is outlined in the exercises) provides a com- 
plete solution of the First Order Diophantine Equation ax + by = N provided we know 
there is at least one solution to this equation. But the equation ax + by = N is simply 
another way of stating that A is an element of the ideal generated by a and b. Since we 
know this ideal is just (d), the principal ideal generated by the greatest common divisor 
d of a and b, this is the same as saying N e (d), i.e., N is divisible by d. Hence, the 
equation ax + by = N is solvable in integers x and y if and only if N is divisible by 
the gx.d. of a and b (and then the result quoted above gives a full set of solutions of 
this equation). 


We end this section with another criterion that can sometimes be used to prove 
that a given integral domain is not a Euclidean Domain. 1 For any integral domain let 

'The material here and in some of the following section follows the exposition by J.C. Wilson in 
A principal ideal rbig that is not a Euclidean ring, Math. Mag., 46(1973), pp. 34-38, of ideas of Th. 
Motzkin, and use a simplification by Kenneth S. W illiams in Note on non- Euclidean Principal Ideal 
Domains , Math. Mag., 48(1975), pp. 176-177. 
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R = /?*U{0} denote the collection of units of R together with 0. An element u e R — R 

/V 

is called a universal side divisor if for every x e R there is some z e R such that u 
divides x — z in R, i.e., there is a type of “division algorithm” for u: every x may be 
written x — qu + z where z is either zero or a unit. The existence of universal side 
divisors is a weakening of the Euclidean condition: 

Proposition 5. Let R be an integral domain that is not a field. If R is a Euclidean 
Domain then there are universal side divisors in R. 

Proof: Suppose R is Euclidean with respect to some norm N and let u be an element 
of R — R (which is nonempty since R is not a field) of minim al norm. For any x e R> 
write x = qu Hh r where r is either 0 or N(r) < N (u). In either case the minimality of 
u implies r € R. Hence u is a universal side divisor in R . 

Example 

We can use Proposition 5 to prove that the quadratic integer ring R = Z[(l + V— 19 )/2J is 
not a Euclidean Domain with respect to any norm by showing that R contains no universal 
side divisors (we shall see in the next section that all of the ideals in R are principal, 
so the technique in the examples following Proposition 1 do not apply to this ring). We 
have already determined that ±1 are the only units in R and so R = {0, ±1}. Suppose 
u e R is a universal side divisor and let N(q + b{\ + V— 19)/2) = a 2 + ab + 5b 1 
denote the field norm on R as in Section 7.1. Note that if a, b e TL and b ^ 0 then 
a 2 +ab + 5 b 2 = (a + b/2) 2 + 19/4 b 2 > 5 and so the smallest nonzero values of N on R 
are 1 (for the units ±1) and 4 (for ±2). Taking x = 2 in the definition of a universal side 
divisor it follows that u must divide one of 2 — 0 or 2 ± 1 in R, i.e., u is a nonunit divisor 
of 2 or 3 in R. If 2 = ot.fi then 4 = N(a)N(Ji) and by the remark above it follows that 
one of a or £ has norm 1, i.e., equals ±1. Hence the only divisors of 2 in R are {±1, ±2). 
Similarly, the only divisors of 3 in R are {± 1 , ±3), so the only possible values for u are 
±2 or ±3. Taking x = (1 + V— 19 )/2 it is easy to check that none of x t x ± 1 are divisible 
by ±2 or ±3 in R, so none of these is a universal side divisor. 


EXERCISES 

1. For each of the following five pairs of integers a and b, determine their greatest common 
divisor d and write d as a linear combination ax + by of a and b. 

(a) a = 20, b = 13. 

(b) a = 69, b = 312. 

(c) a = 11391, b = 5673. 

(d) a = 507885, b = 60808. 

(e) a = 91442056588823. b = 779086434385541 (the Euclidean Algorithm requires 
only 7 steps for these integers). 

2. For each of the following pairs of integers a and n, show that a is relatively prime to n and 
determine the inverse of a mod n (cf. Section 3 of the Preliminaries chapter). 

(a) a = 13, n = 20. 

(b) a = 69,n = 89. 

(c) a = 1891, n = 3797. 
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(d) a = 6003722857, n = 77695236973 (the Euclidean Algorithm requires only 3 steps 
for these integers). 

3- Let R be a Euclidean Domain. Let m be the minimum i nteger in the set of norms of nonzero 
elements of /?. Prove that every nonzero element of R of norm m is a unit. Deduce that a 
nonzero element of norm zero (if such an element exists) is a unit. 

4 . Let R be a Euclidean Domain. 

(a) Prove that if (a, b) = 1 and a divides be, then a divides c. More generally, show that 

a 

if a divides be with nonzero a , b then divides c. 

(a,b) 

(b) Consider the Diophantine Equation ax + by = N where a, b and N are integers and 
a, bare nonzero. Suppose * 0 , yo is a solution: oxq + byo = N. Prove that the full set 
of solutions to this equation is given by 


x = XQ + m 


(a. b) ’ 


y - yo - m 


(a,b) 


as m ranges over the integers. [If x, y is a solution to ax + by = N, show that 
a(x - * 0 ) = b(yo - y) and use (a).] 


5. Determine all integer solutions of the following equations: 

(a) 2x + 4y = 5 

(b) 17x + 29y = 31 

(c) 85* + 145y = 505. 


6. (The Postage Stamp Problem ) Let a and b be two relatively prime positive integers. Prove 
that every sufficiently large positive integer N can be written as a linear combination 
ax + by of a and b where x and y are both nonnegative, i.e., there is an integer Nq such that 
for all N > Nq the equation ax + by = N can be solved with both x and y nonnegative 
integers. Prove in fact that the integer ab - a — b cannot be written as a positive linear 
combination of a and b but that every integer greater than ab — a — b is a positive linear 
combination of a and b (so every “postage” greater than ab — a—b can be obtained using 
only stamps in denominations a and b ). 

7. Find a generator for the ideal (85. 1+13/) in Z[i], i.e., a greatest co mmo n divisor for 85 
and 1+13/, by the Euclidean Algorithm. Do the same for the ideal (47 — 13/, 53 + 56/). 


It is known (but not so easy to prove) that D = — 1, -2, -3, —7, —11, —19, —43, —67, and 
— 1 63 are the only negative values of D for which every ideal in O is principal (i.e., O is a PJX>. 
in the terminology of the next section). The results of the next exercise determine precisely 
which quadratic integer rings with D < 0 are Euclidean. 

8. Let F — (Q>(\/Zj) be a quadratic field with associated quadratic integer ring O and field 
norm IV as in Section 7.1. 

(a) Suppose D is —1, —2, -3, —7 or —11. Prove that Q is a Euclidean Domain with 
respect to N. [Modify the proof for Z[i] (D = — 1 ) in the text. For D = —3. —7, — 1 1 
prove that every element of F differs from an element in O by an element whose norm 
is at most ( 1 + 1D|) 2 /(16JD|), which is less than 1 for these values of D. Plotting the 
points of O in C may be helpful.] 

(b) Suppose that D = —43, —67, or — 163. Prove that O is not a Euclidean Domain with 
respect to any norm. [Apply the same proof as for D = — 19 in the text] 

9. Prove that the ring of integers O i n the quadratic integer ring Q(-j2 ) is a Euclidean Domain 
with respect to the norm given by the absolute value of the field norm N in Section 7.1. 

10. Prove that the quotient ring Z[i]/l is finite for any nonzero ideal J of Z[i], [Use the fact 
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that I = (a) for some nonzero a and then use the Division Algorithm in this Euclidean 
Domain to see that every coset of I is represented by an dement of norm less than N(a).] 

11. Let R be a commutative ring with 1 and let a and b be nonzero elements of R. A least 
common multiple of a and b is an element e of R such that 

(i) a | e and b \ e, and 

(ii) if a | e? and b \ e > then e \ P. 

(a) Prove that a least common multiple of a and b (if such exists) is a generator for the unique 
largest principal ideal contained in (a) n (b). 

(b) Deduce that any two nonzero elements in a Euclidean Domain have a least common 
multiple which is unique up to multiplication by a unit. 

(c) Prove that in a Euclidean Domain the least common multiple of a and b is . where 

(a.b) 

(a, b) is the greatest common divisor of a and b. 

12. (A Public Key Code ) Let A' be a positive integer. Let M be an integer relatively prime to 
N and let d be an integer relatively prime to <p(N), where <p denotes Euler’s ^-function. 
Prove that if M\ = M d (mod N) then M = M d (mod N) where d' is the inverse of d 
modtf>(AO: dd’ = 1 (mod <p(N)). 

Remark: This result is the basis for a standard Public Key Code. Supposed = pq is the product 
of two distinct large primes (each on the order of 100 digits, for example). If M is a message, 
then Mi = M d (mod N) is a scrambled ( encoded) version of M, which can be unscrambled 
(decoded) by computing M d (mod AO (these powers can be computed quite easily even for 
large values of M and N by successive squarings). The values of N and d (but not p and q) 
are made publicly known (hence the name) and then anyone with a message M can send their 
encoded message M d (mod N). To decode the message it seems necessary to determine d' , 
which requires the determination of the value <p(N) = <p(pq) = (p — 1 )(q — 1) (no one has 
as yet proved that there is no other decoding scheme, however). The success of this method 
as a code rests on the necessity of determining the factorization of N into primes, for which 
no sufficiently efficient algorithm exists (for example, the most naive method of checking all 
factors up to y/N would here require on the order of 1 0 100 computations, or approximately 300 
years even at 10 billion computations per second, and of course one can always increase the 
size of p and q). 


8.2 PRINCIPAL IDEAL DOMAINS (P.I.D.s ) 

Definition. A Principal Ideal Domain (P.I.D.) is an integral domain in which every 
ideal is principal. 

Proposition 1 proved that every Euclidean Domain is a Principal Ideal Domain 
so that every result about Principal Ideal Domains automatically holds for Euclidean 
Domains. 

Examples 

(1) As mentioned after Proposition 1, the integers Z are a P.I.D. We saw in Section 7.4 
that the polynomial ring Z[x] contains nonprincipal ideals, hence is not a P.I.D. 

(2) Example 2 following Proposition 1 showed that the quadratic integer ring Z[-v/^5] 
is not a P.LD., in fact the ideal (3, 1 + \/^5) is a nonprincipal ideal It is possible 
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for the product / /of two nonprincipal ideals / and / to be principal, for example the 
ideals (3,1+ ^f—5 ) and (3,1— V— 5 ) are both nonprincipal and their product is the 
principal ideal generated by 3, i.e., (3, I + )(3, 1 — */^5) = (3) (cf. Exercise 5 

and the example preceding Proposition 12 below). 

It is not true that every Principal Ideal Domain is a Euclidean Domain. We shall 
prove below that the quadratic integer ring Z[(l + V— 19)/2], which was shown not 
to be a Euclidean Domain in the previous section, nevertheless is a P.I.D. 

From an ideal-theoretic point of view Principal Ideal Domains are a natural class 
of rings to study beyond rings which are fields (where the ideals are just the trivial 
ones: (0) and (1)). Many of the properties enjoyed by Euclidean Domains are also 
satisfied by Principal Ideal Domains. A significant advantage of Euclidean Domains 
over Principal Ideal Domains, however, is that although greatest co mm on divisors exist 
in both settings, in Euclidean Domains one has an algorithm for computing them. Thus 
(as we shall see in Chapter 12 in particular) results which depend on the existence 
of greatest co mm on divisors may often be proved in the larger class of Principal Ideal 
Domains although computation of examples (Le., concrete applications of these results) 
are more effectively carried out using a Euclidean Algorithm (if one is available). 

We collect some facts about greatest co mm on divisors proved in the preceding 
section. 


Proposition 6. Let R be a Principal Ideal Domain and let a and b be nonzero elements 
of R. Let d be a generator for the principal ideal generated by a and b. Then 

(1) d is a greatest common divisor of a and b 

(2) d can be written as an R-linear combination of a and b, i.e., there are elements 
jt and y in R with 


d = ax + by 


(3) d is unique up to multiplication by a unit of R. 


Proof: This is just Propositions 2 and 3. 

Recall that maximal ideals are always prime ideals but the converse is not true in 
general. We observed in Section 7.4, however, that every nonzero prime ideal of Z is 
a maximal ideal. This useful fact is true in an arbitrary Principal Ideal Domain, as the 
following proposition shows. 


Proposition 7. Every nonzero prime ideal in a Principal Ideal Domain is a maximal 
ideal. 


Proof: Let (p) be a nonzero prime ideal in the Principal Ideal Domain R and let 
I = (m) be any ideal containing (p). We must show that I = ( p ) or 1 = R. Now 
p G (m) so p = rm for some r g R. Since (p) is a prime ideal and rm G (p), either r 
or m must lie in (p). If m G (p) then (p) = ( m ) = I. If , on the other hand, r G (p) 
write r = ps. In this case p = rm = psm , so sm = 1 (recall that R is an integral 
domain) and m is a unit so / = R. 
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As we have already mentioned, if F is a field, then the polynomial ring F[x] is a 
Euclidean Domain, hence also a Principal Ideal Domain (this will be proved in the next 
chapter). The converse to this is also true. Intuitively, if / is an ideal in R (such as the 
ideal (2) in Z) then the ideal (/, x) in F[x] (such as the ideal (2, x) in Z[x]) requires 
one more generator than does /, hence in general is not principal. 

Corollary 8. If R is any commutative ring such that the polynomial ring R[x] is a 
Principal Ideal Domain (or a Euclidean Domain), then R is necessarily a field. 


Proof: Assume 7?[x] is a Principal Ideal Domain. Since R is a subring of T?[x] then 
R must be an integral domain (recall that /?[x] has an identity if and only if R does). 
The ideal (x) is a nonzero prime ideal in R[x] because /?[x]/(x) is isomorphic to the 
integral domain R. By Proposition 7, (x) is a maximal ideal, hence the quotient R is a 
field by Proposition 12 in Section 7.4. 


The last result in this section will be used to prove that not every P.I.D. is a Euclidean 
Domain and relates the principal ideal property with another weakening of the Euclidean 
condition. 


Definition. Define TV to be a Dedekind-Hasse norm if TV is a positive norm and for 
every nonzero a, b G R either a is an element of the ideal (b) or there is a nonzero 
element in the ideal (a, b) of norm strictly smaller than the norm of b (i.e., either b 
divides a in R cr there exist s, t € R with 0 < N(sa — tb) < N(b)). 

Note that R is Euclidean with respect to a positive norm TV if it is always possible 
to satisfy the Dedekind-Hasse condition with s = 1, so this is indeed a weakening of 
the Euclidean condition. 


Proposition 9. The integral domain R is a P.I.D. if and only if R has a Dedekind-Hasse 
norm. 2 

Proof : Let I be any nonzero ideal in R and let b be a nonzero element of / with TV (b) 
minimal . Suppose a is any nonzero element in /, so that the ideal (a, b) is contained 
in /. Then the Dedekind-Hasse condition on TV and the minimality of b implies that 
a G (b) t so / = (b) is principal The converse will be proved in the next section 
(Corollary 16). 


^hat a Dedekind-Hasse norm on R implies that R is a P.IJD. (and is equivalent when R is a ring 
of algebraic integers) is the classical Criterion of Dedekind and Hasse, cf. fiber emdeutige Zerlegung in 
Primelemente oder in Primhauptideale in Inlegritfltsbereichen , Jour, ftir die Reine und Angew. Math., 
159(1928), pp. 3-12. The observation that the converse holds generally is mere recent and due to 
John Greene, Principal Ideal Domnins are almost Euclidean, Amer. Math. Monthly, 104(1997), pp. 
154-156. 
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Example 

Let/? = Z[( 1 +v/^l9 ) /2] be the quadratic integer ring considered at theend of the previous 
section. We show that the positive field norm N(a + b( 1 + +J—l9)/2) = a 2 +ab + 5b 2 
defined on R is a Dedekind-Hasse norm, which by Proposition 9 and the results of the 
previous section will prove that R is a P1D. but not a Euclidean Domain. 

Suppose a, ft are nonzero elements of R and a/fi 4 R- We must show that there are 
elements s,t € R with 0 < N(sa — tfi) < N(j 5), which by the multiplicalivity of the field 
norm is equivalent to 

0 < N(^s — t) < 1. (*) 

P 


a a + b-J— 19 , 

Write — € Qw— 19] with integers a y b, c having no common divisor and 

P ^ 

with c > 1 (since £ is assumed not to divide a). Since a, b, c have no common divisor 
there are integers x, y, z with ax + by + cz = l. Write ay — I9bx = cq + r for some 
quotient q and remainder r with |r| < c/2 and let s = y + x+J— 19 and i = g — zV— 19. 
Then a quick computation shows that 

(ay — 19 bx — eg) 2 + 19 (ax + by + cz) 2 


a 


0 < N(-s-t) = 

fi 


1 19 

-4 + ? 


and so (*) is satisfied with this s and / provided c >5. 

Suppose that c = 2. Then one of a , b is even and the other is odd (otherwise a/fi e /?), 

(a - 1) + bs/-l9 

and then a quick check shows that s = l and t = are elements of R 

satisfying (*). 

Suppose that c = 3. The integer a 2 + I9i> 2 is not divisible by 3 (modulo 3 this is 
a 2 + b 2 which is easily seen to be 0 modulo 3 if and only if a and b are both 0 modulo 3; 
but then a, b, c have a common factor). Write a 2 + 19h 2 = 3 g + r with r = 1 or 2. Then 
again a quick check shows that s = a — by/— 19, t = g are elements of R satisfying (*). 

Finally, suppose that c = 4, so a and b are not both even. If one of a, b is even and the 
other odd, then a 2 + 1 9b 2 is odd, so we can write a 2 + 19 b 2 = Ag + r for some q , reZ 
and 0 < r < 4. Then s = a — b-J — 19 and i = g satisfy (*). If a and b are both odd then 
a 2 + 19i> 2 = 1 + 3 mod 8, so we can write a 2 + I9fc 2 = 8^+4 for some g e Z, Then 

s = - — ^ ^ and t = q arc elements of R that satisfy (*). 


EXERCISES 

1. Prove that in a Principal Ideal Domain two ideals (a) and (b) are comaximal (cf. Section 
7.6) if and only if a greatest common divisor of a and b is 1 (in which case a and b are 
said to be coprime or relatively prime). 

2 . Prove that any two nonzero elements of a P.LD. have a least common multiple (cf. Exercise 
1 1, Section 1). 

3. Prove that a quotient of a P.I.D. by a prime ideal is again a P.I.D. 

4. Let R be an integral domain. Prove that if the following two conditions hold then R is a 
Principal Ideal Domain: 

0) any two nonzero elements a and b in R have a greatest common divisor which can be 
written in the form ra + sb for some r,s € R, and 
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(ii) if a\ , 02. 03, ■ - - are nonzero elements of R such that | a,- for all i, then there is 
a positive integer TV such that a n is a unit times aw for all rc > TV. 

5. Let 7? be the quadratic integer ring Z[\/— 5]. Define the ideals 1% = (2, 1 + v^-5 ), 
/ 3 = (3, 2 4- \/^5 ), and I' 3 = (3,2- V=5). 

(a) Prove that 72, 7 3 , and /g are nonprincipal ideals in 7?. [Note that Example 2 following 
Proposition 1 proves this for /3.J 

(b) Prove that the product of two nonprincipal ideals can be principal by showing that 7| 
is the principal ideal generated by 2, i.e., /f = (2). 

(c) Prove similarly that 72/3 = (l—s/^5) and 72/3 = (1-F-v/— 5 ) are principal. Conclude 
that the principal ideal (6) is the product of 4 ideals: (6) = I%hly 

6. Let 7? be an integral domain and suppose that every prime ideal in 7? is principal. This 
exercise proves that every ideal of R is principal. i.e., R is a P.I.D. 

(a) Assume that the set of ideals of R that are not principal is nonempty and prove that 
this set has a maximal element under inclusion (which, by hypothesis, is not prime). 
[Use Zorn's Lemma.] 

(b) Let 7 be an ideal which is maximal with respect to being nonprincipal, and letn, b e 7? 
with ab e 7 but a $ I and b I . Let I a = (7, a) be the ideal generated by 7 and a , 
let lb = (7, b) be the ideal generated by 7 and b, and define J = {r e R \ rl a c 7}. 
Prove that I a = (a) and J = (fi) are principal ideals in R with 7 C l b c J and 
I a J = (aP) C 7. 

(c) If x e 7 show that x = sa for some s e J. Deduce that I = I a J is principal, a 
contradiction, and conclude that 7? is a P.IJ). 

7. An integral domain R in which every ideal generated by two elements is principal (i.e., 
for every a, b e R , (a, b) = (d) for some d e R) is called a Bezout Domain, [cf. also 
Exercise 1 1 in Section 3.] 

(a) Prove that the integral domain R is a Bezout Domain if and only if every pair of 
elements a, b of R has a g.c.d. d in R that can be written as an R -linear combination 
of a and b , i.e., d = ax + by for some x,y e. R. 

(b) Prove that every finitely generated ideal of a Bezout Domain is principal, [cf. the 
exercises in Sections 9.2 and 9.3 for Bezout Domains in which not every ideal is 
principal.] 

(c) Let F be the fraction field of the Bezout Domain R. Prove that every element of F can 
be written in the form a/b with a, b e R and/2 and b relatively prime (cf. Exercise 1), 

8. Prove that if R is a Principal Ideal Domain and D is a multiplicatively closed subset of R, 
then D l R is also a P.I.D. (cf. Section 7.5). 


8.3 UNIQUE FACTORIZATION DOMAINS (U.F.D.s ) 

In the case of the integers Z, there is another method for determining the greatest 
common divisor of two elements a and h familiar from elementary arithmetic, namely 
the notion of “factorization into primes’’ for a and b, from which the greatest common 
divisor can easily be determined. This can also be extended to a larger class of rings 
called Unique Factorization D omai ns (U.F.D.s) — these will be defined shortly. We 
shall then prove that 

every Principal Ideal Domain is a Unique Factorization Domain 
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so that every result about Unique Factorization Domains will automatically hold for 
both Euclidean Domains and Principal Ideal Domains. 

We first introduce some terminology. 

Definition. Let R be an integral domain. 

(1) Suppose r e R is nonzero and is not a unit. Then r is called irreducible in R 
if whenever r = ab with a,b e R t at least one of a or b must be a unit in R. 
Otherwise r is said to be reducible. 

(2) The nonzero element p € R is called prime in R if the ideal (p) generated by 
p is a prime ideal. In other words, a nonzero element p is a prime if it is not a 
unit and whenever p | ab for any a, b € R> then either p | a or p | b. 

(3) Two elements a and b of R differing by a unit are said to be associate in R (i.e., 
a = ub for some unit u in R). 

Proposition 10. In an integral domain a prime element is always irreducible. 

Proof: Suppose (p) is a nonzero prime ideal and p = ab. Then ab = p E (p), so 
by definition of prime ideal one of a or b , say a, is in (p). Thus a = pr for some r. 
This implies p = ab = prb sorb = 1 and b is a unit. This shows that p is irreducible. 

It is not true in general that an irreducible element is necessarily prime. For 
example, consider the element 3 in the quadratic integer ring R = Z[\/^5 ]. The 
computations in Section 1 show that 3 is irreducible in R , but 3 is not a prime since 
(2+V^5 )(2— V— 5 ) = 3 2 is divisible by 3, but neither 2+V— 5 nor 2— V— 5 is divis- 
ible by 3 in R. 

If R is a Principal Ideal Domain however, the notions of prime and irreducible 
elements are the same. In particular these notions coincide in Z and in F[x] (where F 
is a field). 

Proposition 11. In a Principal Ideal Domain a nonzero element is a prime if and only 
if it is irreducible. 

Proof: We have shown above that prime implies irreducible. We must show con- 
versely that if p is irreducible, then p is a prime, i.e., the ideal (p) is a prime ideal. If 
M is any ideal containing (p) then by hypothesis M = (m) is a principal ideal. Since 
p € (m), p = rm for some r. But p is irreducible so by definition either r or m is a 
unit. This means either (p) = (m) or (m) — (1), respectively. Thus the only ideals 
containing (p) are (p) or (I), i.e., (p) is a maximal ideal. Since maximal ideals are 
prime ideals, the proof is complete. 

Example 

Proposition 1 1 gives another proof that the quadratic integer ring Z[«/^5 ] is not a P.I.D. 
since 3 is irreducible but not prime in this ring. 
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The irreducible elements in the integers Z are the prime numbers (and their nega- 
tes) familiar from elementary arithmetic, and two integers a and b are associates of 
le another if and only if a = ±b. 

In the integers Z any integer n can be written as a product of primes (not necessarily 
.stinct), as follows. If n is not itself a prime then by definition it is possible to write 
= nyti 2 for two other integers ti\ and n 2 neither of which is a unit, i.e., neither of 
hich is ±1. Both n 1 and /12 must be smaller in absolute value than n itself. If they are 
)th primes, we have already written n as a product of primes. If one of n \ or n 2 is not 
rime, then it in turn can be factored into two (smaller) integers. Since integers cannot 
;crease in absolute value indefinitely, we must at some point be left only with prime 
iteger factors, and so we have written n as a product of primes. 

For example, if n = 2210, the algorithm above proceeds as follows: n is not 
self prime, since we can write n = 2 ■ 1105. The integer 2 is a prime, but 1 105 is not: 
105 = 5-221. The integer 5 is prime, but 221 is not: 221 = 13 -17. Here the algorithm 
rminates, since both 13 and 17 are primes. This gives the prime factorization of 2210 
; 2210 = 2-5-13-17. Similarly, we find 1131 =3-13-29. In these examples each 
ime occurs only to the first power, but of course this need not be the case generally. 

In the ring Z not only is it true that every integer n can be written as a product of 
rimes, but in fact this decomposition is unique in the sense that any two prime fac- 
irizations of the same positive integer n differ only in the order in which the positive 
ime factors are written. The restriction to positive integers is to avoid considering 
ie factorizations (3)(5) and (—3) (—5) of 15 as essentially distinct. This unique fac- 
'rizalion property of Z (which we shall prove very shortly) is extremely useful for the 
ithmetic of the integers. General rings with the analogous property are given a name. 

efmition. A Unique Factorization Domain (U .F.D.) is an integral domain R in which 
/ery nonzero element r e R which is not a unit has the following two properties: 

0) r can be written as a finite product of irreducibles of R (not necessarily 
distinct): r = p\p 2 ■ ■ • p n and 

(ii) the decomposition in (i) is unique up to associates : namely, if r = q\q 2 - - • q m 
is another factorization of r into irreducibles, then m = n and there is some 
renumbering of the factors so that p, is associate to q t for i — 1, 2, .... n. 

xamples 

(1) A field F is trivially a Unique Factorization Domain since every nonzero element is a 
unit, so there are no elements for which properties (i) and (ii) must be verified. 

(2) As indicated above, we shall prove shortly that every Principal Ideal Domain is a 
Unique Factorization Domain (so, in particular, Z and F[x] where F is a field are both 
Unique Factorization Domains). 

(3) We shall also prove in the next chapter that the ring 7 ?[jc] of polynomials is a Unique 
Factorization Domain whenever R itself is a Unique Factorization Domain (in contrast 
to the properties of being a Principal Ideal Domain or being a Euclidean Domain, which 
do not cany over from a ring R to the polynomial ring /?[*]). This result together with 
the preceding example will show that Z[x] is a Unique Factorization Domain. 

(4) The subring of the Gaussian integers R = Z[2i] = {a 4- 2b i 1 a,b € Z}, where 
i 2 = — 1 , is an integral domain but not a Unique Factorization Domain (rings of this 
nature were introduced in Exercise 23 of Section 7.1). The elements 2 and 2 i are 


c. 8.3 Unique Factorization Domains (U.F.D.s) 


285 



irreducibles which are not associates in R since i & R, and 4 = 2-2= (—2/) ■ ( 2i ) 
has two distinct factorizations in R. One may also check directly that 2/ is irreducible 
but not prime in R (since R/(2i) = Z/4Z). In the larger ring of Gaussian integers, 
Z[iJ, (which is a Unique Factorization Domain) 2 and 2 i are associates since i is a unit 
in this larger ring. We shall give a slightly different proof that Z[2/] is not a Unique 
Factorization Domain at the end of Section 9.3 (one in which we do not have to check 
that 2 and 2 / are irreducibles). 

(5) The quadratic integer ring Z[V-5 ] is another example of an integral domain that is 
not a Unique Factorization Domain, since 6 = 2- 3 = (1 + ->/— 5 )(1 — ->/— 5 ) gives 
two distinct factorizations of 6 into irreducibles. The principal ideal ( 6 ) in Z[-/^5] 
can be written as a product of 4 nonprincipal prime ideals: ( 6 ) = P% P 3 P^ and the 
two distinct factorizations of the element 6 in Z(V— 5 ] can be interpreted as arising 
from two rearrangements of this product of ideals into products of principal ideals: 
the product of P\ = (2) with F 3 F 3 = (3), and the product of P 2 P 3 = 0 + V--5) 
with F 2 F 3 = (1 — \/^5) (cf. Exorcise 8 ). 

While the elements of the quadratic integer ring O need not have unique factor- 
ization, it is a theorem (Corollary 16. 16) that every ideal in O can be written uniquely 
as a product of prime ideals . The unique factorization of ideals into the product of 
prime ideals holds in general for rings of integers of algebraic number fields (exam- 
ples of which are the quadratic integer rings) and leads to the notion of a Dedekind 
Domain considered in Chapter 16. It was the failure to have unique factorization into 
irreducibles for elements in algebraic integer rings in number theory that originally 
led to the definition of an ideal. The resulting uniqueness of the decomposition into 
prime ideals in these rings gave the elements of the ideals an “ideal” (in the sense of 
“perfect” or “desirable”) behavior that is the basis for the choice of terminology for 
these (now fundamental) algebraic objects, 

The first property of irreducible elements in a Unique Factorization Domain is 
that they are also primes. One might think that we could deduce Proposition 1 1 from 
this proposition together with the previously mentioned theorem (that we shall prove 
shortly) that every Principal Ideal Domain is a Unique Factorization Domain, however 
Proposition 1 1 will be used in the proof of the latter theorem. 

Proposition 12. In a Unique Factorization Domain a nonzero element is a prime if and 
only if it is irreducible. 

Proof: Let R be a Unique Factorization Domain. Since by Proposition 10, primes 
of R are irreducible it remains to prove that each irreducible element is a prime. Let 
p be an irreducible in R and assume p j ab for some a, b e R', we must show that 
p divides either a or b. To say that p divides ab is to say ab = pc for some c in R. 
Writing a and b as a product of irreducibles, we see from this last equation and from the 
uniqueness of the decomposition into irreducibles of ab that the irreducible element p 
must be associate to one of the irreducibles occurring either in the factorization of a or 
in the factorization of b. We may assume that p is associate to one of the irreducibles 
in the factorization of a, i.e., that a can be written as a product a = ( up ) pi ■ • - p n for 
u a unit and some (possibly empty set of) irreducibles pi, , p n . But then p divides 
a , since a = pd with d = - ■ p„, completing the proof. 
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In a Unique Factorization Domain we shali now use the terms “prime” and “irre- 
ducible” interchangeably although we shall usually refer to the “primes” in Z and the 
“irreducibles” in Fix]. 

We shall use the preceding proposition to show that in a Unique Factorization 
Domain any two nonzero elements a and b have a greatest common divisor: 

Proposition 13. Let a and b be two nonzero elements of the Unique Factorization 
Domain R and suppose 

a = upi ei p 2 * ■ ■ * Pn n and b = vp\* 1 p 2 * 2 - • • pj" 

are prime factorizations for a and b, where u and t; are units, the primes pi , P 2 , . . . , p n 
are distinct and the exponents £; and f are > 0. Then the element 

d = . . . p^tenjn) 

(where d = 1 if all the exponents are 0) is a greatest common divisor of a and b. 

Proof: Since the exponents of each of the primes occurring in d are no larger than 
the exponents occurring in the factorizations of both a and b, d divides both a and 
b. To show that d is a greatest common divisor, let c be any common divisor of a 
and b and let c — q\ gl q 2 Sl ■ ■ • q m Sm be the prime factorization of c. Since each 
divides c, hence divides a and b, we see from the preceding proposition that q, must 
divide one of the primes pj. In particular, up to associates (so up to multiplication 
by a unit) the primes occurring in c must be a subset of the primes occurring in a 
and b : {qi, qi, . . . , q m } c {pi, P 2 , - - . , p n }- Similarly, the exponents for the primes 
occurring in c must be no larger than those occurring in d. This implies that c divides 
d, completing the proof. 

Example 

In the example above, where a = 2210 and b = 1131, we find immediately from their 
prime factorizations that (a, b) = 13. Note that if the prime factorizations for a and b are 
known, the proposition above gives their greatest common divisor instantly, but that finding 
these prime factorizations is extremely time-consuming computationally. The Euclidean 
Algorithm is the fastest method for determining thfc g.c.d. of two integers but unfortunately 
it gives almost no information on the prime factorizations of the integers. 

We now come to one of the principal results relating some of the rings introduced 
in this chapter. 

Theorem 14. Every Principal Ideal Domain is a Unique Factorization Do main. In 
particular, every Euclidean Do main is a Unique Factorization Domain. 

Proof: Note that the second assertion follows from the first since Euclidean Do- 
mains are Principal Ideal Domains. To prove the first assertion let R be a Principal 
Ideal Domain and let r be a nonzero element of R which is not a unit We must show 
first that r can be written as a finite product of irreducible elements of R and then we 
must verify that this decomposition is unique up to units. 
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The method of proof of the first part is precisely analogous to the determination 
of the prime factor decomposition of an integer. Assume r is nonzero and is not a 
unit. If r is itself irreducible, then we are done. If not, then by definition r can be 
written as a product r = rjr 2 where neither r\ nor r 2 is a unit. If both these elements 
are irreducibles, then again we are done, having written r as a product of irreducible 
elements. Otherwise, at least one of the two elements, say r\ is reducible, hence can be 
written as a product of two nonunit elements r\ = run 2 , and so forth. What we must 
verify is that this process terminates , i.e., that we must necessarily reach a point where 
all of the elements obtained as factors of r are irreducible. Suppose this is not the case. 
From the factorization r = nr 2 we obtain a proper inclusion of ideals: (r) C (n) C R- 
The first inclusion is proper since r 2 is not a unit, and the last inclusion is proper since r\ 
is not a unit. From the factorization of ry we similarly obtain (r) c (ri) C (rn) c R. 
If this process of factorization did not terminate after a finite number of steps, then we 
would obtain an infinite ascending chain of ideals: 

(r) C (n) C (rn) C ■ ■ ■ C R 

where all containments are proper, and the Axiom of Choice ensures that an infinite 
chain exists (cf. Appendix I). 

We now show that any ascending chain A ^ / 2 c • ■ • c of ideals in a Principal 
Ideal Domain eventually becomes stationary, i.e., there is some positive integer n such 
that h = l n for all k > n. 3 In particular, it is not possible to have an infini te ascending 
chain of ideals where all containments are proper. Let I — U^,/, . It follows easily (as 
in the proof of Proposition 1 1 in Section 7.4) that / is an ideal. Since R is a Principal 
Ideal Domain it is principally generated, say / = (a). Since 1 is the union of the ideals 
above, a must be an element of one of the ideals in the chain, say a e I„. But then we 
have I„ c / = (a) c I r and so I = I n and the chain becomes stationary at /„ . This 
proves that every nonzero element of R which is not a unit has some factorization into 
irreducibles in R. 

It remains to prove that the above decomposition is essentially unique. We proceed 
by induction on the number, n , of irreducible factors in some factorization of the element 
r. If n = 0, then r is a unit If we had r — qc (some other factorization) for some 
irreducible q, then q would divide a unit, hence would itself be a unit, a contradiction. 
Suppose now that n is at least 1 and that we have two products 

r = PiP 2 -Pn = q\qi--qm m>n 

for r where the p, and qj are (not necessarily distinct) irreducibles. Since then p\ 
divides the product on the right, we see by Proposition 1 1 that p\ must divide one of the 
factors. Renumbering if necessary, we may assume p\ divides q\. But then q\ = p\u 
for some element u of R which must in fact be a unit since qi is irreducible. Thus pi 
and q\ are associates. Cancelling p\ (recall we are in an integral domain, so this is 
legitimate), we obtain the equation 

Pi - ■ p n = uqiqi - qm = qi'q* ■ ■ q* m>n. 

3 This same argument can be used to prove the more general statement: an ascending chain of ideals 
becomes stationary in any commutative ring where all the ideals are. finitely generated. This result will 
be needed in Chapter 12 where the details will be repeated. 
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where q^ — uqi is again an irreducible (associate to qf). By induction on n, we 
conclude that each of the factors on the left matches bijectively (up to associates) with 
the factors on the far right, hence with the factors in the middle (which are the same, up 
to associates). Since p\ and q\ (after the initial renumbering) have already been shown 
to be associate, this completes the induction step and the proof of the theorem. 

Corollary 15. (Fundamental Theorem of Arithmetic) The integers Z are a Unique 
Factorization Do m ain. 

Proof: The integers Z are a Euclidean Domain, hence are a Unique Factorization 
Do main by the theorem. 

We can now complete the equivalence (Proposition 9) between the existence of a 
Dedekin d-Hasse norm on the integral domain R and whether R is a P.I.D. 

Corollary 16. Let R be a P.I.D. Then there exists a multiplicative Dedekind-Hasse 
norm on R. 

Proof: If R is a P.I.D. then R is a U.F.D. Define the norm N by setting N( 0) = 0, 
N(u) = I if u is a unit, and N(a) = 2" if a = p\pi ■ ■ • p n where the Pi are 
irreducibles in R (well defined since the number of irreducible factors of a is unique). 
Clearly N(ab ) = N(a)N(b) so N is positive and multiplicative. To show that N is a 
Dedekind-Hasse norm, suppose that a, b are nonzero elements of R. Then the ideal 
generated by a and b is principal by assumption, say (a, b) = (r) . If a is not contained in 
the ideal (b) then also r is not contained in (b), i.e., r is not divisible by b. Since b — xr 
for some x e R t it follows that x is not a unit in R and so N(b) = N(x)N(r ) > N(r). 
Hence (a, b) contains a nonzero element with norm strictly smaller than the norm of b, 
completing the proof. 

Factorization in the Gaussian Integers 

We end our discussion of Unique Factorization Domains by describing the irreducible 
elements in the Gaussian integers Z[t] and the corresponding application to a famous 
theorem of Fermat in elementary number theory. This is particularly appropriate since 
the classical study of Z[i] initiated the algebraic study of rings. 

In general, let O be a quadratic integer ring and let N be the associated field norm 
introduced in Section 7.1. Suppose a e O is an element whose norm is a prime p in 
Z. If ce = fiy for some fi.y g O then p = N(ct) = N(fi)N(y) so that one of N(f) 
or N (y) is ±1 and the other is ±p. Since we have seen that an element of O has norm 
± 1 if and only if it is a unit in O, one of the factors of a is a unit. It follows that 

ifN(a) is ± a prime (vn'L), then a is irreducible in O. 

Suppose that 7r is a prime element in O and let (7r) be the ideal generated by rr in 
O. Since (n) is a prime ideal in O it is easy to check that (n) f> Z is a prime ideal in 
Z (if a and b are integers with ab 6 (rr) then either a or b is an element of (7r), so a 
or b is in (7r) H Z). Since N(jt) is a nonzero integer in (7r) we have (n) n Z = pZ 
for some integer prime p. It follows from p € (tz) that n is a divisor in O of the 


Sec. 8.3 Unique Factorization Domains (U.F.D.s) 


289 



integer prime p, and so the prime elements in O can be found by determining how 
the primes in Z factor in the larger ring O. Suppose n divides the prime p in O, say 
p — txtx' . Then N(jz)N(tz') = N(p) — p 2 , so since 7T is not a unit there are only two 
possibilities: either N(n) = ±p 2 or N(n) = ±p. In the former case — ±1, 

hence tc' is a unit and p = tt (up to associates) is irreducible in Z[r]. In the latter case 
N(jt) = N(n') = ±p, hence n 1 is also irreducible and p = rnt' is the product of 
precisely two irreducibles. 

Consider now the special case D = — 1 of the Gaussian integers Z[i]. We have seen 
that the units in Z[i] are the elements ± 1 and ±i. We proved in Section 1 that Z[i] is a 
Euclidean Domain, hence is also a Principal Ideal Domain and a Unique Factorization 
Domain, so the irreducible elements are the same as the prime elements, and can be 
determined by seeing how the primes in Z factor in the larger ring Z[i]. 

Inthiscasea = a+bi has N(a) = aa = a 2 +b 2 , where a = a — bi is the complex 
conjugate of a. It follows by what we just saw that p factors in Z[i] into precisely two 
irreducibles if and only if p = a 2 + b 2 is the sum of two integer squares (otherwise 
p remains irreducible in Z[j]). ]f p — a 2 + h 2 then the corresponding irreducible 
elements in Z [/] are a ± bi. 

Clearly 2 = I 2 + l 2 is the sum of two squares, giving the factorization 2 = 
(1 -H‘)(l —i) = —i(l+i) 2 . The irreducibles 1 +i and 1 — i = — i(l + i) are associates 
and it is easy to check that this is the only situation in which conjugate irreducibles 
a + bi and a — bi can be associates. 

Since the square of any integer is congruent to either 0 or 1 modulo 4, an odd prime 
in Z that is the sum of two squares must be congruent to 1 modulo 4. Thus if p is 
a prime of Z with p = 3 mod 4 then p is not the sum of two squares and p remains 
irreducible in Z[ij. 

Suppose now that p is a prime of Z with p = 1 mod 4. We shall prove that p cannot 
be irreducible in Z[r ] which will show that p = (a + bi)(a — bi) factors as the product 
of two distinct irreducibles in Z[i] or, equivalently, that p = a 2 +b 2 is the sum of two 
squares. We first prove the following result from elementary number theory: 

Lemma 17. The prime number p e Z divides an integer of the form n 2 + 1 if and only 
if p is either 2 or is an odd prime congruent to l modulo 4. 

Proof: The statement for p = 2 is trivial since 2 \ 1 2 + 1. If p is an odd prime, 
note that p | n 2 + 1 is equivalent to n 2 = — 1 in Z/ pZ. This in turn is equivalent to 
saying the residue class of n is of order 4 in the multiplicative group (Z/ pZ) x . Thus 
p divides an integer of the form n 2 + 1 if and only if (Z/pZ) x contains an element 
of order 4. By Lagrange’s Theorem, if (Z/pZ) x contains an element of order 4 then 
| (Z / pZ) x | = p — 1 is divisible by 4, i.e., p is congruent to 1 modulo 4. 

Conversely, suppose p — 1 is divisible by 4. We first argue that (Z/pZ) x contains 
a unique element of order 2. If m 2 = 1 mod p then p divides m 2 — 1 = (m— l)(m + 1). 
Thus p divides either m — 1 (i.e., m = 1 mod p) or m + 1 (i.e., m = — 1 mod p), so — 1 
is the unique residue class of order 2 in (Z/pZ) x . Now the abelian group (Z/pZ) x 
contains a subgroup H of order 4 (for example, the quotient by the subgroup (±1) 
contains a subgroup of order 2 whose preimage is a subgroup of order 4 in (Z/ pZ) x ). 
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Since the Klein 4-group has three elements of order 2 whereas (Z/pZ) x — hence also 
H — has a unique element of order 2, H must be the cyclic group of order 4. Thus 
(Z/pZ) x contains an element of order 4, namely a generator for H. 

Remark; We shall prove later (Corollary 19 in Section 9.5) that (Z/pZ) x is a cyclic 
group, from which it is immediate that there is an element of order 4 if and only if p — 1 
is divisible by 4. 

By Lemma 17, if p = 1 mod 4 is a prime then p divides n 2 + 1 in Z for some 
n G Z, so certainly p divides n 2 + l = (n + i)(n — i) in Z [/]. If p were irreducible 
in Z[i] then p would divide either n i or n — i in Z [/]. In this situation, since p is a 
real number, it would follow that p divides both n + i and its complex conjugate n — i ; 
hence p would divide their difference, 2 i. This is clearly not the case. We have proved 
the following result: 

Proposition 18. 

(1) (Fermat's Theorem on sums of squares) The prime p is the sum of two integer 
squares, p = a 2 + b 2 t a, b e Z, if and only if p = 2 or p = 1 mod 4. Except 
for interchanging a and b or changing the signs of a and b, the representation 
of p as a sum of two squares is unique. 

(2) The irreducible elements in the Gaussian integers Z[i] are as follows: 

(a) 1 -I- i (which has norm 2), 

(b) the primes p e Z with p = 3 mod 4 (which have norm p 2 ), and 

(c) a + bi, a — bi, the distinct irreducible factors of p = a 2 + h 2 = 
(a + bi)(a — bi) for the primes pe Z with p = 1 mod 4 (both of which 
have norm p). 


The first part of Proposition 18 is a famous theorem of Fermat in elementary number 
theory, for which a number of alternate proofs can be given. 

More generally, the question of whether the integer n e Z can be written as a sum 
of two integer squares, n = A? -+- B 2 , is equivalent to the question of whether n is the 
norm of an element A + Bi in the Gaussian integers, i.e., n = A 2 + B 2 = N(A + Bi). 
Writing A + Bi = it\Tti • * • zr* as a product of irreducibles (uniquely up to units) it 
follows from the explicit description of the irreducibles in Z[i] in Proposition 18 that n 
is a norm if and only if the prime divisors of n that are congruent to 3 mod 4 occur to 
even exponents. Further, if this condition on n is satisfied, then the uniqueness of the 
factorization of A + Bi in Z[i] allows us to count the number of representations of n 
as a sum of two squares, as in the following corollary. 


Corollary 19. Let n be a positive integer and write 


n = 2 k p° l . . . p° T q x 


bi 




where p\ , . . . , p r are distinct primes congruent to 1 modulo 4 and q\ , . . . , q s are distinct 
primes congruent to 3 modulo 4. Then n can be written as a sum of two squares in Z, 
i.e., n = A 2 + B 2 with A, B € Z, if and only if each b, is even. Further, if this condition 
on n is satisfied, then the number of representations of n as a sum of two squares is 
4(tfi + l)(fl2 + 1) ■ ■ ' ( a r + 1)- 
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Proof: The first statement in the corollary was proved above. Assume now that 

b\ bs are all even. For each prime pt congruent to 1 modulo 4 write p, = KiWi 

for i = 1, 2, . . . , r, where 7r, and WJ are irreducibles as in (2)(c) of Proposition 18. If 
N(A + Bi) — n then examining norms we see that, up to units, the factorization of 
A + Bi into irreducibles in Z[i] is given by 

A + Bi = ( 1 + 

with nonnegative integers [ , a ,, 2 satisfying a; t \ + a , ,2 = flj for i = 1,2 r. Since 

ai i can have the values 0, 1, . . . , a t (and then a , ,2 is determined), there are a total of 
(fli + l)(fl 2 + 1) ■ ■ ■ {a r + 1) distinct elements A + Bi in Z[i] of norm n, up to units. 
Finally, since there are four units in Z[i], the second statement in the corollary follows. 

Example 

Since 493 = 17 -29 and both primes are congruent to 1 modulo 4, 493 - A 2 + B 2 is 
the sum of two integer squares. Since 17 = (4 + i)(4 — i ) and 29 = (5 + 2i)(5 — 2 i) 
the possible factorizations of A + Bi in Z[i] up to units are (4 -I- <)(5 + 2i) = 18 + 13i, 
(4 -f i)( 5 - 2r) = 22 - 3/, (4 - /)(5 - 2i) = 22 + 3i, and (4 - 0(5 - 2 0 = 18 - 13i. 
Multiplying by — 1 reverses both signs and multiplication by i interchanges the A and B 
and introduces one sign change. Then 493 = (±18) 2 + (±13) 2 = (±22) 2 + (±3) 2 with 
all possible choices of signs give 8 of the 16 possible representations of 493 as the sum of 
two squares; the remaining 8 are obtained by interchanging the two summands. 

Similarly, the integer 58000957 = 7 6 - 17 - 29 can be written as a sum of two squares 
in precisely 16 ways, obtained by multiplying each of the integers A, B in 493 = A 2 + B 2 
above by 7 3 . 


Summary 

In summary, we have the following inclusions among classes of commutative rings with 
identity; 

fields C Euclidean Domains C Pl-D.s CL U.ED.s C integral domains 

with all containments being proper. Recall that Z is a Euclidean Domain that is not a 
field, the quadratic integer ring Z[(l + >/ — 19) /2] is a Principal Ideal Domain that is 
not a Euclidean Domain, Z[jc] is a Unique Factorization Domain (Theorem 7 in Chapter 
9) that is not a Principal Ideal Domain and Z[-y/— 5 ] is an integral domain that is not a 
Unique Factorization Domain. 


EXERCISES 

1. Let G = <Q> X be the multiplicative group of nonzero rational numbers. If a = p/q € G, 
where p and q are relatively prime integers, let <p : G -*■ G be the map which inter- 
changes the primes 2 and 3 in the prime power factorizations of p and q (so, for example, 
e>(2 4 3 ll 5‘ 13 2 ) = 13 2 , ?>(3/16) = 0(3/ 2 4 ) = 2 / 3 4 = 2/81, and <p is the identity 

on all rational numbers with numerators and denominators relatively prime to 2 and to 3). 

(a) Prove that 0 is a group isomorphism. 

(b) Prove that there are infinitely many isomorphisms of the group G to itself. 
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(c) Prove that none of the isomorphisms above can be extended to an isomorphism of the 
ring Q to itself. In fact prove that the identity map is the only ring isomorphism of <Q>. 

2. Let a and b be nonzero elements of the Unique Factorization Do main R. Prove that a and 
b have a least common multiple (cf. Exercise 1 1 of Section 1) and describe it in terms of 
the prime factorizations of a and b in the same fashion that Proposition 13 describes their 
greatest common divisor. 

3. Determine all the representations of the integer 21 30797 = 17 2 • 73 ■ 101 as a sura of two 
squares. 

4. Prove that if an integer is the sum of two rational squares, then it is the sum of two integer 
squares (for example, 1 3 = ( 1 /5) 2 + (1 8/5) 2 = 2 2 + 3 2 ). 

5. Let R = ] where n is a squarefree integer greater than 3. 

(a) Prove that 2, and 1 + sT^n are irreducibles in R. 

(b) Prove that R is not a U.F.D. Conclude that the quadratic integer ring O is not a U.F.D. 

for D = 2,3 mod 4, D < — 3 (so also not Euclidean and not a PJ.D.). [Show that 
either V—n or 1 4- is not prime.J 

(c) Give an explicit ideal in R that is not principal. [Using (b) consider a maximal ideal 
containing the nonprime ideal ) or (1 -f )■] 

6. (a) Prove that the quotient ring Z[i]/(1 4- i ) is a field of order 2. 

(b) Let q € Z be a prime with q = 3 mod4. Prove that the quotient ring Z[i\/(q) is a 
field with q 2 elements. 

(c) Let peZbea prime with p = 1 mod 4 and write p = nn as in Proposition 1 8. Show 
that the hypotheses for the Chinese Remainder Theorem (Theorem 17 in Section 7.6) 
are satisfied and that Z[i]/(p) = Z[i ]/(rr) x Z[i]/(jr) as rings. Show that the quotient 
ring Z[/]/(p) has order p 2 and conclude that Z[i j/0r) and Z[i]/(7r) are both fields 
of order p. 

7. Let tc be an irreducible element in Z[i]. 

(a) For any integer n > 0, prove that (7r ,l " , ' , ) = n n+l Z[i] is an ideal in (ji 71 ) = 7t n Z[i ] 
and that multiplication by n n induces an isomorphism Z[i]/(7r) = (Ji n )/(n n+l ) as 
additive abelian groups. 

(b) Prove that |Z[i]/(7r w )| = |Z[r]/(7r)l w . 

(c) Prove for any nonzero a in Z[i] that the quotient ring Z[/]/(of) has order equal to 
N(a ). [Use (b) together with the Chinese Remainder Theorem and the results of the 
previous exercise.] 

8- Let R be the quadratic integer ring Z[\/^5] and define the ideals h = (2, 1 4- V— 5), 
h = (3, 2 + \/— 5), and 1^ = (3,2--v/=5). 

(a) Prove that 2, 3, 1 + sf—5 and 1 — -s/— 5 are irreducibles in R, no two of which 
are associate in R, and that 6 = 2 ■ 3 = (1 + V— 5) ■ (1 — y/—5) are two distinct 
factorizations of 6 into irreducibles in R. 

(b) Prove that h, and 1'^ are prime ideals in R. [One approach: for / 3, observe 

that R//3 = (R/( 3))/(/3/(3)) by the Third Isomorphism Theorem for Rings. Show 
that R/( 3) has 9 elements, has 3 elements, and that R/h = Z/3Z as an 

additive abelian group. Conclude that 73 is a maximal (hence prime) ideal and that 
R/h = Z/3Z as rings.] 

(c) Show that the factorizations in (a) imply the equality of ideals (6) = (2)(3) and 
(6) = (1 + V— 5)0 — ■/—$)■ Show that these two ideal factorizations give the 
same factorization of the ideal (6) as the product of prime ideals (cf. Exercise 5 in 
Section 2). 
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9. Suppose that the quadratic integer ring O is a P.LD. Prove that the absolute value of 
the field norm N on O (cf. Section 7.1) is a Dedekind-Hasse norm on O. Conclude 
that if the quadratic integer ring O possesses any Dedekind-Hasse norm, then in fact the 
absolute value of the field norm on O already provides a Dedekind-Hasse norm on O. 
[If a, ft e O then (a, fi) = (y) for some y e O. Show that if does not divide a then 
0 < | N(y ) | < \N(p) | — use the fact that the units in O are precisely the elements whose 
norm is ±1.] 

Remark : If O is a Euclidean Domain with respect to some norm it is not necessarily true that 
it is a Euclidean Domain with respect to the absolute value of the field norm (although this is 
true for D < 0, cf. Exercise 8 in Section 1). An example is D = 69 (cf. D. Clark, A quadratic 
field which is Euclidean but not norm-Euclidean, Manuscripta Math., 83(1994), pp. 327-330). 

10 . ( k-stage Euclidean Domains ) Let R be an integral domain and let N \ R Z + U {0} be 
a norm on R. The ring R is Euclidean with respect to N if for any a,b e R with b ^ 0, 
there exist elements q and r in R with 

a = qb + r with r = 0 or N(r ) < N(b). 

Suppose now that this condition is weakened, namely that for any a,b e R with b 0, 
there exist elements q, q' and r, r' in R with 

a=qb + r , b = q'r + r > with r' = 0 or N(r') < N(b), 

i.e., the remainder after two divisions is smaller. Call such a domain a 2-stage Euclidean 
Domain. 

(a) Prove that iterating the divisions in a 2-stage Euclidean Domain produces a greatest 
common divisor of a and b which is a linear combination of a and b. Conclude that 
every finitely generated ideal of a 2-stage Euclidean Domain is principal. (There 
are 2-stage Euclidean Domains that are not PJ.D.s, however.) [Imitate the proof of 
Theorem 4.] 

(b) Prove that a 2-stage Euclidean Domain in which every nonzero nonunit can be factored 
into a finite number of irreducibles is a Unique Factorization Domain. [Prove first 
that irreducible elements are prime, as follows. If p is irreducible and p \ ab with 
p not dividing a, use part (a) to write px + ay = 1 for some x, y. Multiply through 
by b to conclude that p \ h, so p is prime. Now follow the proof of uniqueness in 
Theorem 14.] 

(c) Make the obvious generalization to define the notion of a A-stage Euclidean Domain 
for any integer k > 1. Prove that statements (a) and (b) remain valid if “2-stage 
Euclidean” is replaced by “k-s tage Euclidean.” 

Remarks .“ There are examples of rings which are 2-stage Euclidean but are not Euclidean. 
There are also examples of rings which are not Euclidean with respect to a given norm but 
which are A-stage Euclidean with respect to the norm (for example, the ring Z[Vl4 ] is not 
Euclidean with respect to the usual norm N(a+b*J)A) — \a 2 — 14b 2 !, but is 2-stage Euclidean 
with respect to this norm). The A-stage Euclidean condition is also related to the question of 
whether the group GL„ (R) of invertible n x n matrices with entries from R is generated by 
elementary matrices (matrices with l’s along the main diagonal, a single 1 somewhere off the 
main diagonal, and 0’s elsewhere). 

11 . ( Characterization of P.I.D.s) Prove that R is a P.I.D. if and only if R is a U.F.D. that is 
also a Bezout Domain (cf. Exercise 7 in Section 2). [One direction is given by Theorem 
14. For the converse, let a be a nonzero element of the ideal I with a minimal number of 
irreducible factors. Prove that / = (a) by showing that if there is an element b e I that is 
not in (a) then (a, b) = (d) leads to a contradiction.] 
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CHAPTER 9 


Polynomial Rings 


We begin this chapter on polynomial rings with a summary of facts from the preceding 
two chapters (with references where needed). The basic definitions were given in 
slightly greater detail in Section 7.2. For convenience, the ring R will always be a 
commutative ring with identity 1^0. 

9.1 DEFINITIONS AND BASIC PROPERTIES 

The polynomial ring f?[x] in the indeterminate x with coefficients from R is the set 
of all formal sums x n + jc" -1 + • • - + a\x + ao with n > 0 and each a,- € R. 

If ^ 0 then the polynomial is of degree n, a„x n is the leading term, and a„ is the 
leading coefficient (where the leading coefficient of the zero polynomial is defined to be 
0). The polynomial is monic if = 1. Addition of polynomials is “componentwise”: 

n n n 

^ 2 a ' x ' ~ 

(=0 i=0 (=0 

(here or b n may be zero in order for addition of polynomials of different degrees 
to be defined). Multiplication is performed by first defining (ax')(bx J ) = abx ,+ i and 
then extending to all polynomials by the distributive laws so that in general 

( n » / m v n+m , k , 

x ( 12 biX ') = 5Z( 

i=0 / ' i=0 ' O' i=0 ' 

In this way /?[x] is a commutative ring with identity (the identity 1 from R) in which 
we identify R with the subring of constant polynomials. 

We have already noted that if R is an integral domain then the leading term of a 
product of polynomials is the product of the leading terms of the factors. The following 
is Proposition 4 of Section 7.2 which we record here for completeness. 

Proposition 1. Let R be an integral do main. Then 

(1) degree p(x)q(x) = degree p(x) + degree q(x) if p(x), q(x) are nonzero 

(2) the units of /?[x] are just the units of R 

(3) R[x] is an integral domain. 

Recall also that if R is an integral domain, the quotient field of R[x] consists of all 
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p(x) 

quotients — — where q(x) is not the zero polynomial (and is called the field of rational 
q(x) 

functions in x with coefficients in R). 

The next result describes a relation between the ideals of R and those of /?[*]. 


Proposition 2. Let / be an ideal of the ring R and let (/) = l[x] denote the ideal of 
R[x] generated by I (the set of polynomials with coefficients in /). Then 

R[x]/(1)^(R/I)[x]. 

In particular, if / is a prime ideal of R then (/) is a prime ideal of /?[x]. 


Proof: There is a natural map <p : /?[x] — > (R/I)[x] given by reducing each of the 
coefficients of a polynomial modulo I. The definition of addition and multiplication 
in these two rings shows that <p is a ring homomorphism. The kernel is precisely the 
set of polynomials each of whose coefficients is an element of /, which is to say that 
ker <p = I[x] = (/), proving the first part of the proposition. The last statement follows 
from Proposition 1, since if / is a prime ideal in R, then R/I is an integral domain, 
hence also (R/I)[x\ is an integral domain. This shows if / is a prime ideal of R, then 
(/) is a prime ideal of /?[*]. 

Note that it is not true that if I is a maximal ideal of R then (/) is a maximal ideal 
of /?[* ). However, if I is maximal in R then the ideal of R[x] generated by I and x is 
maximal in /?[*]. 

We now give an example of the “reduction homomorphism” of Proposition 2 which 
will be useful on a number of occasions later (“reduction homomorphisms” were also 
discussed at the end of Section 7.3 with reference to reducing the integers mod n) . 

Example 

Let /? = Z and consider the ideal nZ of Z. Then the isomorphism above can be written 

Z[x]//iZ[x] = Z/nZ[x] 

and the natural projection map of Z[x] to Z/mZ[x] by reducing the coefficients modulo n is 
a ring homomorphism. If n is composite, then the quotient ring is not an integral domain. 
If, however, n is a prime p, then Z/pZ is a field and so Z/pZ[x] is an integral domain (in 
fact, a Euclidean Domain, as we shall see shortly). We also see that the set of polynomials 
whose coefficients are divisible by p is a prime ideal in Z[x], 

We close this section with a description of the natural extension to polynomial rings 
in several variables. 

Definition. The polynomial ring in the variables X\, x*i , . . . , x„ with coefficients in R, 
denoted /?[X| . * 2 , - - . , x„], is defined inductively by 


R[xi,x 2 , . ...xj = R[x i,x 2 , ■ 


This definition means that we can consider polynomials in n variables with coeffi- 
cients in R simply as polynomials in one variable (say jc„) but now with coefficients that 
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are themselves polynomials inn — 1 variables. In a slightly more concrete formulation, 
a nonzero polynomial in *1 , *2» • ■ • . x„ with coefficients in R is a finite sum of nonzero 
monomial terms , i.e., a finite sum of elements of the form 

_ d i d? 

ax 1*2 . . * jc “" 

where a g R (the coefficient of the term) and the d\ are nonnegative integers. A monic 
term x^'x^ 7 . ■ ■xff” is called simply a monomial and is the monomial part of the term 
axf' x^ 2 . . . x%‘ . The exponent d x is called the degree in x, of the term and the sum 

d = d\ + d% + • > ■ + d„ 

is called the degree of the term. The ordered n -tuple (d\,d^ dr,) is the multidegree 

of the term. The degree of a nonzero polynomial is the largest degree of any of its 
monomial terms. A polynomial is called homogeneous or a form if all its terms have the 
same degree. If / is a nonzero polynomial in n variables, the sum of all the monomial 
terms in / of degree k is called the homogeneous component of f of degree k. If / has 
degree d then / may be written uniquely as the sum f 0 + f\ + ■ ■ ■ + fj where f k is 
the homogeneous component of / of degree k y for 0 < k < d (where some /* may be 
zero). 

Finally, to define a polynomial ring in an arbitrary number of variables with coef- 
ficients in R we take finite sums of monomial terms of the type above (but where the 
variables are not restricted to just x\, . . . , x„), with the natural addition and multiplica- 
tion. Alternatively, we could define this ring as the union of all the polynomial rings in 
a finite number of the variables being considered. 


Example 

The polynomial ring Z[x, y] in two variables x and y with integer coefficients consists of 
all finite sums of monomial terms of the form ax' y J (of degree i + j). For example, 

p( x. y) = 2x 3 + xy - y 2 

and 

q(x, y) = -3xy + 2y 2 + x 2 y 2 

are both elements of Z[x, y], of degrees 3 and 5, respectively. We have 
p(x , y) + q(x, y) = 2x 2 - 2xy + y 2 + x 2 y 2 

and 

p( x, y)q(x , y) = -6x 4 y + 4 x 2 y 2 -I- 2x 5 y 3 — 3 x 2 y 2 + 5j ry 3 + x 3 y 4 - 2y 4 — x 2 y 5 , 

a polynomial of degree 8. To view this last polynomial, say, as a polynomial in y with 
coefficients in Z[x] as in the definition of several variable polynomial rings above, we 
would write the polynomial in the form 

(— 6x 4 )y + (4x 3 - 3x 2 )y 2 + (2x 5 + 5 x)y 3 + (x 2 - 2)y 4 - (* 2 )y 5 . 

The nonzero homogeneous components of / = f(x, y) = p(x, y)q(x, y) are the poly- 
nomials /4 = —3x 2 y 2 + 5-ry 3 — 2y 4 (degree 4), fs = —6 x 4 y + 4x 2 y 2 (degree 5), 
f 7 =x 3 y 4 — x 2 y 5 (degree 7), and fs = 2x 5 y 3 (degree 8). 
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Each of the statements in Proposition 1 is true for polynomial rings with an arbitrary 
number of variables. This follows by induction for finitely many variables and from 
the definition in terms of unions in the case of polynomial rings in arbitrarily many 
variables. 


EXERCISES 

1. Let p(i c, y, z) = 2x 2 y — 3xy 3 z 4- 4 y 2 z? and q(x, y, z) = lx 2 4- 5 x 2 y 3 z 4 — 3x 2 z 3 be 
polynomials in Z[x, y, z]. 

(a) Write each of p and q as a polynomial in x with coefficients in Z[y, z). 

(b) Find the degree of each of p and q. 

(c) Find the degree of p and q in each of the three variables x. y and z. 

(d) Compute pq and find the degree of pq in each of the three variables x, y and z- 

(e) Write pq as a polynomial in the variable z with coefficients in Z[x, yj. 

2. Repeat the preceding exercise under the assumption that the coefficients of p and q are in 
Z/3Z. 

3. If R is a commutative ring and x\ , X2 x„ are independent variables over R, prove 

that x^(2) x n(n) 1 is isomorphic to R[xi , X 2 , . . . . x n ] for any permutation it 

of {1. 2 n). 

4 . Prove that the ideals (x) and (x, y) are prime ideals in Q[x, y] but only the latter ideal is 
a maximal ideaL 

5. Prove that (x, y) and (2, x, y) are prime ideals in Z[x, yj but only the latter ideal is a 
maximal ideal. 

6. Prove that (x, y) is not a principal ideal in Q[x, y]. 

7. Let R be a commutative ring with 1 . Prove that a polynomial ring in more than one variable 
over R is not a Principal Ideal Domain. 

8. Let F be a field and let R = F[x, x 2 y, x 3 y 2 , .... x n y n ~' , ... J be a subring of the poly- 
nomial ring F[x, yj. 

(a) Prove that the fields of fractions of R and F[x, y] are the same. 

(b) Prove that R contains an ideal that is not finitely generated. 

9. Prove that a polynomial ring in infinitely many variables with coefficients in any commu- 
tative ring contains ideals that are not finitely generated. 

10. Prove that the ring Z[x\ , X2, X3 ]/(xjX2, X3X4, X5X6 , . . . ) contains infinitely many min- 

imal prime ideals (cf. Exercise 36 of Section 7.4). 

11. Show that the radical of the ideal I = (x. y 2 ) in Q[x, y] is (x. y) (cf. Exercise 30, Section 
7.4). Deduce that 7 is a primary ideal that is not a power of a prime ideal (cf. Exercise 41, 
Section 7.4). 

12. Let R = Q[x, y, z] and let bars denote passage to Q[x, y, zl/(*y — z 2 ). Prove that 
— __ — 2 _ — 2 

P = (T, z) is a prime ideal. Show that xy e P but that no power of y lies in P . (This 

shows P is a prime ideal whose square is not a primary ideal — cf. Exercise 41, Section 
7.4). 

13. Prove that the rings ;F[x, y]/(y 2 — x) and F[x, y]/(y 2 — x 2 ) are not isomorphic for any 
field F. 

14 . Let R be an integral domain and let f, j be relatively prime integers. Prove that the ideal 
(x 1 — yj) is a prime ideal in R[x. y], [Consider the ring homomorphism q> from 7?[x, y] 
to defined by mapping x to t J and mapping y to Show that an element of 7?[x, y J 
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differs from an element in (x 1 — yl) by a polynomial /( x) of degree at most j — 1 in y 
and observe that the exponents of <p(x r y s ) are distinct for 0 < s < j.] 

15- Let p(x i , JC 2 x n ) be a homogeneous polynomial of degree k in R[x i , . . . , x„\. Prove 

that for all X e R we have p{kxy , Xx 2 Xx„) = X k p(xi , X 2 , . . . , x„). 

16. Prove that the product of two homogeneous polynomials is again homogeneous. 

17. An ideal 1 in , . . . , jt„] is called a homogeneous ideal if whenever pel then each 
homogeneous component of p is also in /. Prove that an ideal is a homogeneous ideal if 
and only if it may be generated by homogeneous polynomials. [Use induction on degrees 
to show the “if* implication.] 

The following exercise shows that some care must be taken when working with polynomials 

over noncommutative rings R (the ring operations in /?[x] are defined in the same way as for 

commutative rings R), in particular when considering polynomials as functions. 

18. Let R be an arbitrary ring and let Func(/i!) be the ring of all functions from R to itself. 
If p(x) e I?[xJ is a polynomial, let f p e Func(f?) be the function on R defined by 
f p (r) = p(r) (the usual way of viewing a polynomial in J?[x] as defining a function on R 
by “evaluating at r"). 

(a) For fixed a e R, prove that “evaluation at is a ring homomorphism from Func(/?) 
to R (cf. Example 4 following Theorem 7 in Section 7.3). 

(b) Prove that the map <p : /?[x] — ► Fun c(R) defined by cp(p(x)) = f p is not a ring homo- 
morphism in general. Deduce that polynomial identities need not give corresponding 
identities when the polynomials are viewed as functions. [If R = M is the ring of real 
Hamilton Quaternions show that p(x) = x 2 + 1 factors as (x + i)(x — i ), but that 
p(J) = 0 while O' + 00 ~ 0 7^ ° ] 

(c) For fixed a e R, prove that the composite “evaluation at a ” of the maps in (a) and (b) 
mapping I?[x] to R is a ring homomorphism if and only if a is in the center of R. 


9.2 POLYNOMIAL RINGS OVER FIELDS I 

We now consider more carefully the situation where the coefficient ring is a field F. 
We can define a nom on F[x] by defining N(p(x)) = degree of p(x) (where we set 
N( 0) = 0). From elementary algebra we know that we can divide one polynomial with, 
say, rational coefficients by another (nonzero) polynomial with rational coefficients to 
obtain a quotient and remainder. The same is true over any field 

Theorem 3. Let F be a field. The polynomial ring F[x ] is a Euclidean Domain. 
Specifically, if a(x) and b(x) are two polynomials in Ffx] with b(x) nonzero, then 
there are unique q (x) and r(x) in jF[x] such that 

a(x) = q(x)b(x) + r(x) with r(x) = 0 or degree r(x) < degree b(x) . 

Proof: If a(x) is the zero polynomial then take q(x) = r(x) = 0. We may 
therefore assume a(x) ^0 and prove the existence of q(x) and r(x) by induction on 
n = degree a (x). Let b(x) have degree m. If n < m take q(x) = 0 and r(x) = a(x). 
Otherwise n > m. Write 

a(x) = a„x n + a n -ix n ~ l -\ \- a\ x+a 0 
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and 


b(x ) = b m x m + b m ~\x m 1 + ■ ■ ■ + b\x + b o- 
a n _ 

Then the polynomial a'(x) = a(x) — —x n w b(x) is of degree less than n (we have 

bm 

arranged to subtract the leading term from fl(jt)). Note that this polynomial is well 
defined because the coefficients are taken from a field and b tn ^ 0. By induction then, 
there exist polynomials q'(x) and r(x) with 

a(x) = q'(x)b(x) + r(x) with r(x) = 0 or degree r(x) < degree b(x). 

a n 

Then, letting q(x) = q'(x) H x n ~ m we have 

b m 

fl(x) = q{x)b{x) + r(x) with r(x) = 0 or degree r(x) < degree b(x) 
completing the induction step. 

As for the uniqueness, suppose qi(x) and rj (x) also satisfied the conditions of the 
theorem. Then both a(x) — q(x)b(x ) and a(x) — qi(x)b(x) are of degree less than 
m = degree b(x). The difference of these two polynomials, i.e., b(x)(q(x) — q\(x)) is 
also of degree less than m. But the degree of the product of two nonzero polynomials 
is the sum of their degrees (since F is an integral domain), hence q(x) — q\ (x) must be 
0, that is, q{x) = q\ (x). This implies r(x) = rj (x), completing the proof. 

Corollary 4. If F is a field, then F[x] is a Principal Ideal Domain and a Unique 
Factorization Domain. 

Proof: This is immediate from the results of the last chapter. 

Recall also from Corollary 8 in Section 8.2 that if R is any commutative ring such 
that R[x] is a Principal Ideal Domain (or Euclidean Domain) then R must be a field 
We shall see in the next section, however, that /?[x] is a Unique Factorization Domain 
whenever R itself is a Unique Factorization Domain. 

Examples 

(1) By the above remarks the ring Z[x] is not a Principal Ideal Domain. As we have 
already seen (Example 3 begi nnin g of Section 7.4) the ideal (2, x) is not principal in 
this ring. 

(2) Q[x] is a Principal Ideal Domain since the coefficients lie in the field <Q>. The ideal 
generated in Z[x] by 2 and x is not principal in the subring Z[x] of Q[x]. However, the 
ideal generated in Q[x] is principal; in fact it is the entire ring (so has 1 as a generator) 
since 2 is a un it in <Q)[x]. 

(3) If p is a prime, the ring Z/pZ[x] obtained by reducing Z[x] modulo the prime ideal 
{p) is a Principal Ideal Domain, since the coefficients lie in the field Z/pZ. This 
example shows that the quotient of a ring which is not a Principal Ideal Domain may 
be a Principal Ideal Domain, lb follow the ideal (2, x) above in this example, note 
that if p = 2, then the ideal (2, x) reduces to the ideal (x) in the quotient Z/2Z[x], 
which is a proper (maximal) ideal. If p ^ 2, then 2 is a unit in the quotient, so the 
ideal (2, x) reduces to the entire ring Z/pZfx]. 

(4) <Q>[x, y], the ring of polynomials in two variables with rational coefficients, is not a 
Principal Ideal Domain since this ring is Q[x][y] and Q[x] is not a field (any element 
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of positive degree is not invertible). It is an exercise to see that the ideal (x, y) is not a 
principal ideal in this ring. We shall see shortly that <Q[x, y] is a Unique Factorization 
Domain. 

We note that the quotient and remainder in the Division Algorithm applied to 
a(x), b(x) e F[x] are independent of field extensions in the following sense. Suppose 
the field F is contained in the field E and a(x) = Q(x)b(x) + R(x) for some Q(x), 
R( x) satisfying the conditions of Theorem 3 in E[x], Write a(x) = q(x)b(x)+r(x) for 
some q(x), r(x) e F[x] and apply the uniqueness condition of Theorem 3 in the ring 
E[x] to deduce that Q(x ) = q(x) and R(x) = r(x). In particular, b(x) divides a(x ) 
in the ring E{x\ if and only if b(x) divides a(x) in F[x]. Also, the greatest common 
divisor of a(x) and b(x) (which can be obtained from the Euclidean Algorithm) is the 
same, once we make it unique by specifying it to be tnonic, whether these elements are 
viewed in F[x] or in E[x\. 


EXERCISES 

Let F be a field and let x be an indeterminate over F. 

1. Let / (x) e F[ x] be a polynomial of degree n > 1 and let bars denote passage to the 
quotient F[x]/(/(x)). Prove that for each g(x) there is a unique polynomial go(*) of 
degree < n — 1 such that g(x) = go(x) (equivalently, the elements 1,1,..., x" -1 are a 
basis of the vector space F[x]/(f (x)) over F — in particular, the dimension of this space 
is n). [Use the Division Algorithm.] 

2. Let F be a finite field of order q and let fix) be a polynomial in F[x] of degree n > 1 . 
Prove that F[x]/(J (x)) has q" elements. [Use the preceding exercise.] 

3. Let fix) be a polynomial in F[ jc]. Prove that F[x]/(/ (x)) is a field if and only if fix) is 
irreducible. [Use Proposition 7, Section 8.2.] 

4. Let F be a finite field. Prove that F[x] contains infinitely many primes. (Note that over 
an infinite field the polynomials of degree 1 are an i nfinit e set of primes in the ring of 
polynomials). 

5. Exhibit all the ideals in the ring F[x]/(p(x)), where F is a field and p(x) is a polynomial 
in F[x] (describe them in terms of the factorization of pix)). 

6. Describe (briefly) the ring structure of the following rings: 

(a)Z[x]/(2), (b)Z[x]/(x), (c) Z[x]/(x 2 ), (d) Z[jc, y]/(* 2 , y 2 . 2). 

Show that a 2 = 0 or 1 for every a in the last ring and determine those elements with 
a 2 = 0. Determine the characteristics of each of these rings (cf. Exercise 26, Section 7.3). 

7. Determine all the ideals of the ring Z[x]/ (2, x 3 + 1). 

8. Determine the greatest common divisor of a(x) = x 3 — 2 and b(x) = x + 1 in <Q>[x] and 
write it as a linear combination (in Q[x]) of aix) and fc(x). 

9. Determine the greatest common divisor of a(x) = x 5 +2x 3 +x 2 +x + 1 and the polynomial 
h(x) = x 5 + x 4 + 2x 3 + 2x 2 + 2x + 1 in <Q[x] and write it as a linear combination (in 
Q[x]) of aix) and fe(x). 

10 . Determine the greatest common divisor of a (x) = x 3 +4x 2 +x— 6andh(x) = x 5 — 6x+5 
in Q[x] and write it as a linear combination (in Q[x]) of aix) and h(x). 

11 . Suppose /(x) and g(x) are two nonzero polynomials in Q[x] with greatest common divisor 
d(x). 
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(a) Given h(x) € Q[jc], show that there are polynomials a(x), b(x) € Q[jt] satisfying the 
equation a(x)f(x ) + b(x)g(x) = h(x) if and only if h(x) is divisible by d( jc). 

(b) If oq(x), b o(x) € Q[;c] are particular solutions to the equation in (a), show that the 
full set of solutions to this equation is given by 

a(x) = a Q (x) +m(x)^- 
d(x) 

b(x) = bo(x)~ 

d(x) 

as m( x) ranges over the polynomials in Q[jc], [cf. Exercise 4 in Section 8.1] 

12. Let F[x, yi , > 7 , . . . ] be the polynomial ring in the infinite set of variables x, yi , yi, ■ ■ ■ 
over the' field F, and let / be the ideal (x — y^, yi — yf . ... ,yi ~ y? + \> • - - ) in this ring. 
Define R to be the ring F[.x, yi , ys, . . . )//, so that in R the square of each y, + i is y,- and 
y\= x modulo /, i.e., x has a 2' th root, for eveiy i . Denote the image of y; in R as x 1/2 . 
Let R„ be the subring of R generated by F and x 1 / 2 " . 

(a) Prove that R\ c Q - ■ ■ and that R is the union of all R n , i.e., R = , R n . 

(b) Prove that R„ is isomorphic to a polynomial ring in one variable over F, so that Rn is 
a P.I.D. Deduce that R is a Bezout Domain (cf. Exercise 7 in Section 8.2). [First show 

that the ring S„ = F[x, yi y„]/(x - y? . yi - y?> • ■ . y«-i - y*) is isomorphic 

to the polynomial ring F[y„). Then show any polynomial relation y„ satisfies in R„ 
gives a corresponding relation in £;v for some N > n.] 

(c) Prove that the ideal generated by x, jr’/ 2 , jc 1 / 4 , . . . in R is not finitely generated (so 
R is not a P.LD.). 

13. This exercise introduces a noncommutative ring which is a “right” Euclidean Domain (and 

a “left” Principal Ideal Domain) but is not a “left” Euclidean Domain (and not a “right” 
Principal Ideal Domain). Let F be a field of characteristic p in which not every element is 
a power F F p (for example the field F = F p (/) of rational functions in the variable 
t with coefficients in F p is such a field). Let R = F[x) be the “twisted” polynomial ring 
of polynomials a > xi ' n x w >^ coefficients in F with the usual (termwise) addition 




J2 a ’ xi 

i= 0 


n 

+ Y2 bix ‘ 


i=0 


n 


= + bi)x‘ 

/= o 


but with a noncommutative multiplication defined by 



n+m / 

E E a ‘ b i I ** ■ 

Jt=0 \i+j=k 


This multiplication arises from defining xa = a p x for every a e F (so the powers of x 
do not commute with the coefficients) and extending in a natural way. Let TV be the norm 
defined by taking the degree of a polynomial in R: N(f) = deg(/). 

(a) Show that x k a = x k for every a € F and every integer k > 0 and that R is a 
ring with this definition of multiplication. [Use the fact that (a + b) p = a p +b p for 
every a,b e F since F has characteristic p, so also ( a + b) 1 ^ = a pk + b pk for every 
a,b € F.] 

(b) Prove that the degree of a product of two elements of R is the sum of the degrees of 
the elements. Prove that R has no zero divisors. 
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(c) Prove that R is “right Euclidean” with respect to N, i.e., for any polynomials f,geR 
with g # 0, there exist polynomials q and r in R with 

/ = qg + r with t = 0 or deg(r) < deg(g). 

Use this to prove that every left ideal of R is principal. 

(d) Let / = Qx for some 6 e F t 6 £ F p and let g = x. Prove that there are no 
polynomials q and r in R with 

f = 84 + r with r = 0 or deg(r) < deg(g), 

so in particular R is not “left Euclidean” with respect to N. Prove that the right ideal 
of R generated by x and 6x is not principal Conclude that R is not “left Euclidean” 
with respect to any norm. 


9.3 POLYNOMIAL RINGS THAT ARE UNIQUE 
FACTORIZATION DOMAINS 

We have seen in Proposition 1 that if R is an integral domain then F[x] is also an integral 
domain. Also, such an R can be embedded in its field of fractions F (Theorem 15, 
Section 7.5), so that /?[x] c F[jc] is a subring, and F[jc] is a Euclidean Do main (hence 
a Principal Ideal Domain and a Unique Factorization Domain). Many computations for 
R[x) may be accomplished in F[x] at the expense of allowing fractional coefficients. 
This raises the immediate question of how computations (such as factorizations of 
polynomials) in F[x] can be used to give information in /?[•*). 

For instance, suppose p(x) is a polynomial in R[x]. Since F[jc] is a Unique 
Factorization Domain we can factor p(x) uniquely into a product of irreducibles in 
F[x]. It is natural to ask whether we can do die same in R[jc], i.e., is F[jc] a Unique 
Factorization Domain? In general the answer is no because if F[x] were a Unique 
Factorization Domain, the constant polynomials would have to be uniquely factored 
into irreducible elements of F[x], necessarily of degree 0 since the degrees of products 
add, that is, R would itself have to be a Unique Factorization Domain. Thus if R 
is an integral domain which is not a Unique Factorization Domain, R[x] cannot be a 
Unique Factorization Domain. On the other hand, it turns out that if R is a Unique 
Factorization Domain, then /?[x] is also a Unique Factorization Domain. The method 
of proving this is to first factor uniquely in F[x] and then “clear denominators” to obtain 
a unique factorization in /?[*]. The first step in making this precise is to compare the 
factorization of a polynomial in F[ x] to a factorization in /?[*]. 

Proposition 5. ( Gauss’ Lemma) Let R be a Unique Factorization Domain with field of 
fractions F and let p(x) e F[x]. If p(x) is reducible in F[jc] then p(x) is reducible 
in /?[x]. More precisely, if p(x) = A(x)B( x) for some nonconstant polynomials 
A(x), B(x) € F[jc], then there are nonzero elements r,s e F such that rA(x ) = a(x) 
and sB(x) = b(x) both lie in F[x] and p(x) = a(x)b(x) is a factorization in F[jc]. 

Proof: The coefficients of the polynomials on the right hand side of the equation 
p(x) = A(x)B(x) are elements in the field F, hence are quotients of elements from 
the Unique Factorization Domain R. Multiplying through by a common denominator 
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for all these coefficients, we obtain an equation dp(x ) = a'(x)b'(x) where now a'(x) 
and b'(x) are elements of R[x] and d is a nonzero element off?. If d is a unit in R, the 
proposition is true with a(x) = d~ l a'(x) and b(x) = fc'(jt). Assume d is not a unit and 
write d as a product of irreducibles in /?, say d = pi ■ ■ ■ p„. Since p\ is irreducible in 
F, the ideal (pi) is prime (cf. Proposition 12, Section 8.3), so by Proposition 2 above, 
the ideal p\ F[jc] is prime in F[jc] and (R/pi /?)[jc] is an integral domain. Reducing the 
equation dp(x) = a'(x)b'(x) modulo p u we obtain the equation 0 = a'( jc) fix) in this 
integral domain (the bars denote the images of these polynomials in the quotient ring), 
hence one of the two factors, say a'(x ) must be 0. But this means all the coefficients of 
a'(x ) are divisible by p\, so that -j^a'(x) also has coefficients in R. In other words, in 
the equation dp(x) = a'{x)b'(x) we can cancel a factor of p\ from d (on the left) and 
from either a\x) or t/{x) (on the right) and still have an equation in /?[*]. But now the 
factor d on the left hand side has one fewer irreducible factors. Proceeding in the same 
fashion with each of the remaining factors ofd,we can cancel all of the factors of d into 
the two polynomials on the right hand side, leaving an equation p(x) = a(x)b(x) with 
a( jc), b(x) € F[jc] and with a(jc), b(x) being F-multiples of A(jc), B(x), respectively. 
This completes the proof. 

Note that we cannot prove that a(x) and b(x) are necessarily R -multiples of A 00, 
B(x ), respectively, because, for example, we could factor x 2 in Q[x] with A(x) = 2x 
and B(x) = ~x but no integer multiples of A(x) and B(x) give a factorization of x 2 in 
Z[x]. 

The elements of the ring R become units in the Unique Factorization Domain 
F[jc] (the units in F[jc] being the nonzero elements of F). For example, 7jc factors 
in Z[x] into a product of two irreducibles: 7 and jc (so 7x is not irreducible in Z[jc]), 
whereas 7x is the unit 7 times the irreducible jc in Q[jc] (so 7x is irreducible in Q[jc]). 
The following corollary shows that this is essentially the only difference between the 
irreducible elements in R[x] and those in F[jc]. 

Corollary 6. Let R be a Unique Factorization Domain, let F be its field of fractions and 
let p(x) € F[x]. Suppose the greatest common divisor of the coefficients of p(x) is 1. 
Then p(x) is irreducible in F[x] if and only if it is irreducible in F[x], In particular, if 
p(x) is a monic polynomial that is irreducible in R [jc], then p(x) is irreducible in F[x]. 

Proof: By Gauss’ Lemma above, if p(x) is reducible in F[x], then it is reducible 
in R[x]. Conversely, the assumption on the greatest common divisor of the coefficients 
of p(x) implies that if it is reducible in /?[jc], then p(x) = a(x)b(x) where neither a(x) 
nor b(x) are constant polynomials in R[x], This same factorization shows that p(x) is 
reducible in F[jc], completing the proof. 

Theorem 7. R is a Unique Factorization Domain if and only if /?[jc] is a Unique 
Factorization Domain. 

Proof: We have indicated above that /?[jc] a Unique Factorization Domain forces R 
to be a Unique Factorization Domain. Suppose conversely that R is a Unique Factoriza- 
tion Domain, F is its field of fractions and p(x) is a nonzero element of /?[jc]. Let d be 
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the greatest common divisor of the coefficients of p(x), so that p(x) = dp'(x), where 
the g.c.d. of the coefficients of p'(x) is 1 . Such a factorization of p(x) is unique up to a 
change in d (so up to a unit in R), and since d can be factored uniquely into irreducibles 
in R (and these are. also irreducibles in the larger ring /?[*]), it suffices to prove that 
p'(x) can be factored uniquely into irreducibles in /?[*]. Thus we may assume that the 
greatest common divisor of the coefficients of p(x) is 1. We may further assume p(x) 
is not a unit in F[jc], Le., degree p(x) > 0. 

Since F[x] is a Unique Factorization Domain, p(x) can be factored uniquely into 
irreducibles in F[jc]. By Gauss’ Lemma, such a factorization implies there is a factor- 
ization of p(x) in /?[jc] whose factors are F -multiples of the factors in F[jc]. Since the 
greatest common divisor of the coefficients of p(x) is 1, the g.c.d. of the coefficients in 
each of these factors in /?[*] must be 1 . By Corollary 6, each of these factors is an irre- 
ducible in F[jc]. This shows that p(x) can be written as a finite product of irreducibles 
in /?[*]. 

The uniqueness of the factorization of p(x) follows from the uniqueness in F[jc]. 
Suppose 

p(x) =qi(x)‘-- q r (x) =q[(x)-- q' s (x) 

are two factorizations of p(x) into irreducibles in F[jc]. Since the g.c.d. of the co- 
efficients of p(x) is 1, the same is true for each of the irreducible factors above — 
in particular, each has positive degree. By Corollary 6, each qi(x) and q'jix) is an 
irreducible in F[jc]. By unique factorization in F[jc], r = s and, possibly after re- 
arrangement, qi(x) and q'fx) are associates in F[jc] for all i € {1, . . . , r}. It remains to 
show they are associates in F[jc]. Since the units of F [x] are precisely the elements of 
F* we need to consider when q(x) — |g'(jt) for someg(x), q'(x) e F[z] and nonzero 
elements a, b of R, where the greatest common divisor of the coefficients of each of 
q(x ) and q'(x) is 1. In this case bq{x) — aq'(x)\ the g.c.d. of the coefficients on the left 
hand side is b and on the right hand side is a. Since in a Unique Factorization Domain 
the g.c.d. of the coefficients of a nonzero polynomial is unique up to units, a = ub for 
some unit u in R. Thus q(x) = uq'{x) and so q(x) and q'(x) are associates in R as 
well. This completes the proof. 

Corollary 8. If R is a Unique Factorization Domain, then a polynomial ring in an 
arbitrary number of variables with coefficients in R is also a Unique Factorization 
Domain. 

Proof: For finitely many variables, this follows by induction from Theorem 7, since 
a polynomial ring in n variables can be considered as a polynomial ring in one variable 
with coefficients in a polynomial ring in n — 1 variables. The general case follows from 
the definition of a polynomial ring in an arbitrary number of variables as the union of 
polynomial rings in finitely many variables. 

Examples 

(1) Z[xJ, Z[x, >], etc. are Unique Factorization Domains. The ring Z[x] gives an example 
of a Unique Factorization Domain that is not a Principal Ideal Domain. 

(2) Similarly, Q[*, y], etc. are Unique Factorization Domains. 
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We saw earlier that if R is a Unique Factorization Domain with field of fractions 
F and p(x) e /?[x], then we can factor out the greatest common divisor d of the 
coefficients of p(x) to obtain p(x) = dp'(x), where p'(x) is irreducible in both 7?[x] 
and F[x]. Suppose now that R is an arbitrary integral domain with field of fractions F. 
In R the notion of greatest common divisor may not make sense, however one might 
still ask if, say, a monic polynomial which is irreducible in R[x] is still irreducible in 
F[x] (i.e., whether the last statement in Corollary 6 is true). 

Note first that if a monic polynomial p(x) is reducible, it must have a factorization 
p(x) = a(x)b(x) in F[x] with both a(x) and b(x) monic, nonconstant polynomials 
(recall that the leading term of p(x ) is the product of the leading terms of the factors, so 
the leading coefficients of both a(x) and b(x) are units — we can thus arrange these to 
be 1). In other words, a nonconstant monic polynomial p(x) is irreducible if and only 
if it c ann ot be factored as a product of two monic polynomials of smaller degree. 

We now see that it is not true that if R is an arbitrary integral domain and p(x) is a 
monic irreducible polynomial in F[x], then p(x) is irreducible in F[x], For example, 
let R = Z[2 /] = {a + 2bi | a, b e Z} (a subring of the complex numbers) and let 
p(x ) = x 2 + l. Then the fraction field of R is F = { a+bi \ a,b e <Qj}. The polynomial 
p(x) factors uniquely into aproductof two linear factors in F[x]: x 2 +\ — (x — i)(x+i) 
so in particular, p(x) is reducible in F[x]. Neither of these factors lies in F[x] (because 
i £ R) so p(x) is irreducible in /?[*]. In particular, by Corollary 6, Z[2i] is not a 
Unique Factorization Domain . 


EXERCISES 

1. Let R be an integral domain with quotient field F and let p(x) be a monic polynomial in 
/?[x). Assume that p(x) = a(x)b{x) where a(x) and b(x) are monic polynomials in F[x] 
of smaller degree than p{x). Prove that if a (x) 4 F[x] then/? is not a Unique Factorization 
Domain. Deduce that Z[2\/2 ] is not a U.F.D. 

2. Prove that if f(x) and g(x) are polynomials with rational coefficients whose product 
f(x)g(x) has integer coefficients, then the product of any coefficient of g( x) with any 
coefficient of f(x) is an integer. 

3. Let F be a field. Prove that the set R of polynomials in F[x] whose coefficient of x is 
equal to 0 is a subring of F[x] and that R is not a U.FX). [Show that x 6 = (x 2 ) 3 = (x 3 ) 2 
gives two distinct factorizations of x 6 into irreducibles.] 

4. Let R = X+xQ[x] C Q[x] be the set of polynomials in x with rational coefficients whose 
constant term is an integer. 

(a) Prove that R is an integral do main and its units are ±1. 

(b) Show that the irreducibles in R are ±p where p is a prime in Z and the polynomi- 
als / (x) that are irreducible in Q[x] and have constant term ±1. Prove that these 
irreducibles are prime in R. 

(c) Show that x cannot be written as the product of irreducibles in R (in particular, x is 
not irreducible) and conclude that R is not a U.F.D. 

(d) Show that x is not a prime in R and describe the quotient ring R/{x). 

5. Let R = Z + xQ[x] c Q[x] be the ring considered in the previous exercise. 

(a) Suppose that f(x), g(x) e Q[xj are two nonzero polynomials with rational coeffi- 
cients and that x r is the largest power of x dividing both f(x) and g(x) in Q[x], (i.e., 
r is the degree of the lowest order term appearing in either f(x) or g(x)). Let f r and 
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g r be the coefficients of x r in /( x) and g(x), respectively (one of which is nonzero 
by definition of r). Then TLf r + %g T = %*dr for some Donzero d r e Q (cf. Exercise 14 
in Section 2.4). Prove that there is a polynomial d(x ) e Q[jt] that is a g.c.d. of f(x) 
and g(x) in <Q[x) and whose term of minimal degree is d r x r . 

(b) Prove that f(x) = d(x)q\( x) and g(x) = d(x)q 2 ( x) where qi(x) and q 2 (x) are 
elements of the subring R of Qf>]. 

(c) Prove that d(x) = a(x)f(x) + b(x)g(x) for polynomials a(x), b(x) in R. [The 
existence of a(x), b(x) in the Euclidean Domain <Q)[jc] is immediate. Use Exercise 1 1 
in Section 2 to show that a(x) and b( x) can be chosen to lie in /?.] 

(d) Conclude from (a) and (b) that R f(x) + Rg(x) = Rd(x) in <Q{.x] and use this to prove 
that R is a Bezout Domain (cf. Exercise 7 in Section 8.2). 

(e) Show that (d), the results of the previous exercise, and Exercise 11 of Section 8.3 
imply that R must contain ideals that are not principal (hence not finitely generated). 
Prove that in fact / = xQ[x] is an ideal of R that is not finitely generated. 


9.4 IRREDUCIBILITY CRITERIA 

If R is a Unique Factorization Domain, then by Corollary 8 a polynomial ring in any 
number of variables with coefficients in R is also a Unique Factorization Domain. It 
is of interest then to determine the irreducible elements in such a polynomial ring, 
particularly in the ring /?[*]. In the one- variable case, a nonconstant monic polynomial 
is irreducible in /?[*] if it cannot be factored as the product of two other polynomials of 
smaller degrees. Determining whether a polynomial has factors is frequently difficult to 
check, particularly for polynomials of large degree in several variables. The purpose of 
iixeducibility criteria is to give an easier mechanism for determining when some types 
of polynomials are irreducible. 

For the most part we restrict attention to polynomials in one variable where the 
coefficient ring is a Unique Factorization Domain. By Gauss’ Lemma it suffices to 
consider factorizations in F[x] where F is the field of fractions of R (although we 
shall occasionally consider questions of irreducibility when the coefficient ring is just 
an integral domain). The next proposition considers when there is a factor of degree 
one (a linear factor). 


Proposition 9. Let F be a field and let p(x) € F[x J. Then p(x) has a factor of degree 
one if and only if p(x) has a root in F , i.e., there is an a e F with p(ct) — 0. 


Proof: If p(x) has a factor of degree one, then since F is a field, we may assume 
the factor is monic, i.e., is of the form (x — a) for some a e F . But then p(ct) = 0. 
Conversely, suppose p(a) = 0. By the Division Algorithm in F[x] we may write 

P(x) =q(x)(x -a)+r 

where r is a constant. Since p(a) = 0, r must be 0, hence p(x) has (x — a) as a factor. 
Proposition 9 gives a criterion for irreducibility for polynomials of small degree: 
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Proposition 10 . A polynomial of degree two or three over a field F is reducible if and 
only if it has a root in F. 

Proof: This follows immediately from the previous proposition, since a polynomial 
of degree two or three is reducible if and only if it has at least one linear factor. 

The next result limits the possibilities for roots of polynomials with integer coef- 
ficients (it is stated for Z[x] for convenience although it clearly generalizes to R[x], 
where R is any Unique Factorization Domain). 

Proposition 11. Let p(x) = a n x n 4- a n -\x T] ~ x + ■ « ■ + be a polynomial of degree 
n with integer coefficients. If r/s € Q is in lowest terms (i.e., r and s are relatively 
prime integers) and r/s is a root of p(x) y then r divides the constant term and s divides 
the leading coefficient of p(x): r | oq and s | a n . In particular, if p(x) is a monic 
polynomial with integer coefficients and p(d ) / 0 for all integers d dividing the constant 
term of p(x), then p(x) has no roots in <Q>. 

Proof: By hypothesis, p(r/s) — 0 = a„(r/s) n + a n ^\(r/sY~ l -I +flo- Multi- 

plying through by s' 1 gives 

0 = ci n r n + a n —\r n 's + ■ ■ • + oqS . 

Thus aj,r n = s(—a n ^\r n ~ l — ... — aos n ~ l ), so s divides a n r n . By assumption, s is 
relatively prime to r and it follows that s | a„. Similarly, solving the equation for OQS n 
shows that r | oq. The last assertion of the proposition follows from the previous ones. 

Examples 

(1) The polynomial x 3 — 3x — 1 is irreducible in Z[x]. To prove this, by Gauss’ Lemma 
and Proposition 10 it suffices to show it has no rational roots. By Proposition 1 1 tire 
only candidates for rational roots are integers which divide the constant term 1, namely 
±1. Substituting both 1 and —1 into the polynomial shows that these are not roots. 

(2) For p any prime the polynomials x 2 — p and x 2 — p are irreducible in Q[x], This is 
because they have degrees < 3 so it suffices to show they have no rational roots. By 
Proposition 1 1 the only candidates for roots are ±1 and ±p t but none of these give 0 
when they are substituted into the polynomial. 

(3) The polynomial x 2 -I- 1 is reducible in Z/2Z[x] since it has 1 as a root, and it factors 
as (x + l) 2 . 

(4) The polynomial x 2 + x + l is irreducible in Z/2Z[x] since it does not have a root in 
Z/2Z: C^+O+l = land l 2 + 1 + 1 = 1. 

(5) Similarly, the polynomial x 3 + x + 1 is irreducible in Z/2Z[x]. 

This technique is limited to polynomials of low degree because it relies on the 
presence of a factor of degree one. A polynomial of degree 4, for example, may be 
the product of two irreducible quadratics, hence be reducible but have no linear factor. 
One fairly general technique for checking irreducibility uses Proposition 2 above and 
consists of reducing the coefficients modulo some ideal. 
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Proposition 12. Let / be a proper ideal in the integral domain R and let p(x) be a 
nonconstant monic polynomial in P[x]. If the image of p(x) in (P//)[x] cannot be 
factored in (R/ 1) [jc] into two polynomials of smaller degree, then p(x) is irreducible 
in P[x]. 

Proof: Suppose p(x) cannot be factored in (/?//) [x] but that p(x) is reducible 
in R[x]. As noted at the end of the preceding section this means there are monic, 
nonconstant polynomials a(x) and b(x) in P[x] such that p(x) — a(x)b(x). By 
Proposition 2, reducing the coefficients modulo / gives a factorization in (R/J) [x] 
with nonconstant factors, a contradiction. 

This proposition indicates that if it is possible to find a proper ideal / such that 
the reduced polynomial cannot be factored, then the polynomial is itself irreducible. 
Unfortunately, there are examples of polynomials even in Z[x] which are irreducible 
but whose reductions modulo every ideal are reducible (so their irreducibility is not 
detectable by this technique). For example, the polynomial x 4 + 1 is irreducible in 
Z[x] but is reducible modulo every prime (we shall verify this in Chapter 14) and the 
polynomial x 4 — 72x 2 + 4 is irreducible in Z[x] but is reducible modulo every integer. 

Examples 

(1) Consider the polynomial p(x) = x 2 +x + 1 in Z[x]. Reducing modulo-2, we see from 
Example 4 above that p(x) is irreducible in Z[x]. Similarly, x 3 + x + 1 is irreducible 
in Z[x] because it is irreducible in Z/2Z[x]. 

(2) The polynomial x 2 + 1 is irreducible in Z[x] since it is irreducible in Z/3Z[x] (no 
root in Z/3Z), but is reducible mod 2. This shows that the converse to Proposition 12 
does not hold. 

(3) The idea of reducing modulo an ideal to determine irreducibility can be used also 
in several variables, but some care must be exercised For example, the polynomial 
x 2 + xy + 1 in Z[x, y] is irreducible since modulo the ideal (y) it is x 2 + 1 in Z[x], 
which is irreducible and of the same degree. In this sort of argument it is necessary to 
be careful about “collapsing.” For example, the polynomial xy + x + y + 1 (which 
is (x + l)(y + 1)) is reducible, but appears irreducible modulo both (x) and (y). The 
reason for this is that nonunit polynomials in Z[x, y] can reduce to units in the quotient 
To take account of this it is necessary to determine which elements in the original ring 
become units in the quotient. The elements in Z[x, y] which are units modulo (y), for 
example, are the polynomials in Z[x, y] with constant term ±1 and all nonconstant 
terms divisible by y. The fact that x 2 + xy + 1 and its reduction mod (y) have the 
same degree therefore eliminates the possibility of a factor which is a unit modulo (y), 
but not a unit in Z[x, y] and gives the irreducibility of this polynomial. 

A special case of reducing modulo an ideal to test for irreducibility which is fre- 
quently useful is known as Eisenstein’s Criterion (although originally proved earlier by 
Schonemann, so more properly known as the Eisenstein-Schonemarm Criterion)-. 

Proposition 13. (Eisenstem’s Criterion) Let P be a prime ideal of the integral domain 

Pandlet/(x) = x” +a n - l x n ~ l I- flix+tfo be a polynomial in P[x] (here n > 1). 

Suppose a n -\, . . . , a\ , oq are all elements of P and suppose oq is not an element of P 2 . 
Then /(x) is irreducible in P[x]. 
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Proof: Suppose f(x ) were reducible, say fix) = a(x)b(x) in /?[*], where a(x) 
and b(x) are nonconstant polynomials. Reducing this equation modulo P and using 
the assumptions on the coefficients of / (jt) we obtain the equation x n = a(x)b(x) in 
(R/P)[x], where the bar denotes the polynomials with coefficients reduced mod P. 
Since P is a prime ideal, R/P is an integral domain, and it follows that both a(x) and 
b(x) have 0 constant term, i.e., the constant terms of both a(x) and b(x) are elements 
of P . But then the constant term oq of fix) as the product of these two would be an 
element of P 2 , a contradiction. 

Eisenstein’s Criterion is most frequently applied to Z[x] so we state the result 
explicitly for this case: 

Corollary 14. (Eisenstein’s Criterion for Z[x\) Let p be a prime in Z and let 

fix) = x n + a„-jx n ~ 1 H + a\x -|- a 0 e Z[xJ, n > 1. Suppose p divides a, 

for all / e {0, 1 , . . . , n— 1} but that p 2 does not divide oq. Then fix) is irreducible in 
both Z[x] and (Q)[jc]. 

Proof: This is simply a restatement of Proposition 13 in the case of the prime ideal 
ip) in Z together with Corollary 6. 


Examples 

(1) The polynomial x 4 + 10* + 5 in Z[x\ is irreducible by Eisenstein’s Criterion applied 
for the prime 5. 

(2) If a is any integer which is divisible by some prime p but not divisible by /t 2 , then x n — a 
is irreducible in Z[x] by Eisenstein’s Criterion. In particular, x n — p is irreducible for 
all positive integers n and so for n > 2 the n ^ roots of p are not rational numbers (i.e., 
this polynomial has no root in Q). 

(3) Consider the polynomial fix) = x 4 + 1 mentioned previously. Eisenstein’s Criterion 
does not apply directly to / (jr). The polynomial gix) = fix + 1) is (x + 1) 4 -t- 1 , Le., 
x 4 + 4x 3 + 6X 2 + 4x + 2, and Eisenstein’s Criterion for the prime 2 shows that this 
polynomial is irreducible. It follows then that / (*) must also be irreducible, since any 
factorization for / (x) would provide a factorization for g(x) (just replace x by x + 1 
in each of the factors). This example shows that EiscnsteLn’s Criterion can sometimes 
be used to verify the irreducibility of a polynomial to which it does not immediately 
apply. 

(4) As another example of this, let p be a prime and consider the polynomial 

= x p ~ ] +x p ~ 2 + ---+X+1, 

an example of a cyclotomic polynomial which we shall consider more thoroughly in 
Part IV. Again, Eisenstein’s Criterion does not immediately apply, but it does apply 
for the prime p to die polynomial 


0 P (x + 1) = 


ix + IF - 1 


— v/*-' 


+ px f 


+ P ^ P 2 + Z[x] 


since all the coefficients except the first are divisible by p by the Binomial Theorem. 
As before, this shows <P p (x) is irreducible in Z[x]. 

(S) As an example of the use of the more general Eisenstein’s Criterion in Proposition 13 
we mimic Example 2 above. Let R = <Q>[x] and let n be any positive integer. Consider 
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the polynomial X" — x in the ring F[X]. The ideal (x) is prime in the coefficient 
ring R since R/(x) = Q[x]/(x) is the integral domain Q. Eisenstein’s Criterion for 
the ideal (jc) of R applies directly to show that X n — x is irreducible in F[X]. Note 
that this construction works with <Q> replaced by any field or, indeed, by any integral 
domain. 

There are now efficient algorithms for factoring polynomials over certain fields. 
For polynomials with integer coefficients these algorithms have been implemented in a 
number of computer packages. An efficient algorithm for factoring polynomials over 
F p , called the Berlekamp Algorithm, is described in detail in the exercises at the end of 
Section 14.3. 


EXERCISES 


1. Determine whether the following polynomials are irreducible in the rings indicated. For 
those that are reducible, determine their factorization into irreducibles. The notation F p 
denotes the finite field Z/pZ, p a prime. 

(a) jc 2 + jc + 1 in F 2 M. 

(b) jc 3 + jc + 1 in IF 3 f jc J . 

(c) jc 4 + 1 in Fjfx]. 

(d) x 4 + 10 jc 2 + 1 in Z[xj. 


2 . 


3. 


Prove that the following polynomials are irreducible in Zfx]: 

(a) jc 4 — 4x 3 + 6 

(b) x 6 + 30jc 5 - 15x 3 + 6* - 120 

(c) x 4 + 4x 3 + 6x 2 + 2x + 1 [Substitute x — 1 for x.] 


(d) 


(x + 2 y - ip 


, where p is an odd prime. 


x 

Show that the polynomial (jc— 1)(x~2) • ■ • (x~n) — 1 is irreducible over Z for all n > 1. 
[If the polynomial factors consider the values of the factors at x = 1,2 n.] 


4. Show that the polynomial (x — l)(x — 2) ■ (x — n) + 1 is irreducible over Z for all n > 1, 

n^4. 


5. Find all the monic irreducible polynomials of degree < 3 in IF 2 M. and the same in F 3 [x). 

6. Construct fields of each of the following orders: (a) 9, (b) 49, (c) 8, (d) 8 1 (you 

may exhibit these as F[x]/(/(x)) for some F and/). [Use Exercises 2 and 3 in Section 2.] 

7. Prove that M[x]/(x 2 + 1) is a field which is isomorphic to the complex numbers. 

8. ProvethatXi = lFu[x]/(x 2 + 1) and K% = Fn[y]/(y 2 +2y + 2) are both fields with 121 
elements. Prove that the map which sends the element p(x) of Ki to the element p(y+ 1) 
of K 2 (where p is any polynomial with coefficients in F n ) is well defined and gives a ring 
(hence field) isomorphism from Ki to K 2 . 

9. Prove that the polynomial x 2 — %/2 is irreducible over Z[V2] (you may use the fact that 
Z[V2 ] is a UFJD. — cf. Exercise 9 of Section 8.1). 

10. Prove that the polynomial p(x) = x 4 — 4x 2 + 8x + 2 is irreducible over the quadratic 
field F = (Q>(V— 2) = [a + 2 | a, b e Q}. [First use the method of Proposition 11 

for the Unique Factorization Domain Z[V— 2 ] (cf. Exercise 8, Section 8.1) to show that if 
a e Z[-J^2] is a root of p(x) then a is a divisor of 2 in Z[.y^2 J. Conclude that a must 
be ±1, ±V— 2 or ±2, and hence show p(x) has no linear factor over F. Show similarly 
that p(x) is not the product of two quadratics with coefficients in F.) 
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11. Prove that x 2 + y 2 — 1 is irreducible in Q[x, y ]. 

12. Prove that x n ~ l + x n ~ 2 + ■ ■ ■ + x + 1 is irreducible over Z if and only if n is a prime. 

13. Prove that x 3 + nx + 2 is irreducible over Z for all integers n / 1 , —3, —5. 

14. Factor each of the two polynomials: x 8 — 1 and x 6 — 1 into irreducibles over each of the 

following rings: (a) Z, (b) Z/2Z, (c) Z/3Z. 

15. Prove that if F is a field then the polynomial X" — x which has coefficients in the ring 
F[[x]] of formal power series (cf. Exercise 3 of Section 7.2) is irreducible over F[[x]]. 
[Recall that F[[x]] is a Euclidean Domain — cf. Exercise 5, Section 7.2 and Example 4, 
Section 8.1.] 

16. Let F be a field and let fix) be a polynomial of degree n in E[x]. The polynomial 
g(x) = x"/(l /x) is called the reverse of / (x). 

(a) Describe the coefficients of g in terms of the coefficients of /. 

(b) Prove that / is irreducible if and only if g is irreducible. 

17. Prove the following variant of Eisenstein’s Criterion: let P be a prime ideal in the Unique 

Factorization Domain /?andlet/(x) = a n x n +a n -~ix n ~ l 4 \-a\x+aQ be a polynomial 

in R[x], n > 1. Suppose $ P, a„-i, . . . ,oq e P and oq £ P 2 . Prove that f(x) is 
irreducible in F[x], where F is the quotient field of R. 

18. Show that 6x 5 + 14x 3 — 21x + 35 and 18x 5 — 3Qx 2 + 120x + 360 are irreducible in Q[x]. 

19. Let F be a field and let /(x) = a„x n + a n -\x fl ] + ■ • • + ao € F[x], The derivative , 
Dx (/(x)). of f{x) is defined by 

Dx(/(x» = na„x n ~ l + (n-l)a„-ix n ~ 2 H h a\ 

where, as usual, na = a + a-1 \-a(n times). Note that D x (/ (x)) is again a polynomial 

with coefficients in F. 

The polynomial /(x) is said to have a multiple root if there is some field E containing F 
and some a e E such that (x — a) 2 divides / (x) in £[x]. For example, the polynomial 
fix) = (x - l) 2 (x — 2) e Q[x] has a = 1 as a multiple root and the polynomial 
fix) = x 4 +2x 2 + 1 = (x 2 + l) 2 6 R[x] has a = ±i 6 C as multiple roots. We shall 
prove in Section 13.5 that a nonconstant polynomial fix) has a multiple root if and only 
if / (x) is not relatively prime to its derivative (which can be detected by the Euclidean 
Algorithm in F[x]). Use this criterion to determine whether the following polynomials 
have multiple roots: 

(a) x 3 - 3x - 2 € Q[x] 

(b) x 3 + 3x + 2 € Q[x] 

(c) x 6 - 4x 4 + 6x 3 + 4x 2 — 12x + 9 € Q[x] 

(d) Show for any prime p and any a e¥ p that the polynomial x p — a has a multiple root 

20. Show that the polynomial fix) = x in Z/6Z[x] factors as (3x + 4)(4x -1- 3), hence is not 
an irreducible polynomial. 

(a) Show that the reduction of / (x) modulo both of the nontrivial ideals (2) and (3) of 
Z/6Z is an irreducible polynomial, showing that the condition that R be an integral 
domain in Proposition 12 is necessary. 

(b) Show that in any factorization fix) = gix)h(x) in Z/6Z[x] the reduction of g(x) 
modulo (2) is either 1 or x and the reduction of /i(x) modulo (2) is then either x or 1, 
and similarly for the reductions modulo (3). Determine all the factorizations of / (x) 
in Z/6Z[x]. [Use the Chinese Remainder Theorem.] 

(c) Show that the ideal (3, x) is a principal ideal in Z/6Z[x]. 

(d) Show that over the ring Z/30Z[x] the polynomial fix) = x has the factorization 
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/ (x) = (lOx + 21) (15* + 16) (6* + 25) . Prove that the product of any of these factors 
is again of the same degree. Prove that the reduction of / (x) modulo any prime 
in Z/30Z is an irreducible polynomial. Determine all the factorizations of f(x) in 
Z/30Z[x]. [Consider the reductions modulo (2), (3) and (5) and use the Chinese 
Remainder Theorem.] 

(e) Generalize part (d) to Z/nZ[x] where n is the product of k distinct primes. 


9.5 POLYNOMIAL RINGS OVER FIELDS II 

Let F be a field. We prove here some additional results for the one- variable polynomial 
ring F[x]. The first is a restatement of results obtained earlier. 

Proposition 15. The maximal ideals in F[x] are the ideals (/ (x)) generated by irre- 
ducible polynomials f(x). In particular, F[x]/(/(x)) is a field if and only if /(x) is 
irreducible. 

Proof: This follows from Proposition 7 of Section 8.2 applied to the Principal Ideal 
Domain F[x]. 

Proposition 16. Let g(x) be a nonconstant element of F[x] and let 

g(x) = Mxr f 2 (xr ■ ■ ■ Mxr 

be its factorization into irreducibles, where the f (x) are distinct. Then we have the 
following isomorphism of rings: 

FM/te(x)) = F[x)/(MxD x F[x]/(/ 2 (xD x x F[x]/(/*(*D. 


Proof: This follows from the Chinese Remainder Theorem (Theorem 7.17), since 
the ideals (/;(-*)'") and (fj(x) n >) are comaximal if f; (x) and /)(x) are distinct (they 
are relatively prime in the Euclidean Domain F[x], hence the ideal generated by them 
is F[x]>. 

The next result concerns the number of roots of a polynomial over a field F. By 
Proposition 9, a root a corresponds to a linear factor (x — ct) of /(x). If /(x) is divisible 
by (x — a) m but not by (x — a)' 71+, 1 then a is said to be a root of multiplicity m. 

Proposition 17. If the polynomial /(x) has roots oq, a 2 , . . . , or* in F (not necessarily 
distinct), then /(x) has (x — ori) • * * (x — ctf) as a factor. In particular, a polynomial 
of degree n in one variable over a field F has at most n roots in F, even counted with 
multiplicity. 

Proof: The first statement follows easily by induction from Proposition 9. Since 
linear factors are irreducible, the second statement follows since F[x] is a Unique 
Factorization Domain. 

This last result has the following interesting consequence. 
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Proposition 18. A finite subgroup of the multiplicative group of a field is cyclic. In 
particular, if F is a finite field, then the multiplicative group F x of nonzero elements 
of F is a cyclic group. 

Proof: We give a proof of this result using the Fundamental Theorem of Finitely 
Generated Abelian Groups (Theorem 3 in Section 5.2). A more number-theoretic proof 
is outlined in the exercises, or Proposition 5 in Section 6.1 may be used in place of 
the Fundamental Theorem. By the Fundamental Theorem, the finite subgroup can be 
written as the direct product of cyclic groups 

Z/uiZ x TL/nfL x ■ ■ ■ x Z/n*Z 

where n k | ji k -\ | ■ ■ • | n*i \ n\ . In general, if G is a cyclic group and d | |G| then G 
contains precisely d elements of order dividing d. Since n* divides the order of each 
of the cyclic groups in the direct product, it follows that each direct factor contains 
n* elements of order dividing n*. If k were greater than 1, there would therefore be a 
total of more than n* such elements. But then there would be more than n* roots of the 
polynomial x" k — 1 in the field F, contradicting Proposition 17. Hence k = 1 and the 
group is cyclic. 

Corollary 19. Let p be a prime. The multiplicative group (Z/pZ) x of nonzero residue 
classes mod p is cyclic. 

Proof: This is the multiplicative group of the finite field Z/pZ. 

Corollary 20. Let n "> 2 be an integer with factorization n = p" 1 p“ 2 - - ■ p“ r in Z, where 
Pi , . . . , p r are distinct primes. We have the following isomorphisms of (multiplicative) 
groups: 

(1) (Z/nZ) x = (Z/p"’Z) x x (Z/pfZy x • ■ - x (Z/p?'Z) x 

(2) (Z/2“Z) X is the direct product of a cyclic group of order 2 and a cyclic group 
of order 2 a ~ 2 , for all a > 2 

(3) (Z/p a Z) x is a cyclic group of order p“ _1 (p — 1), for all odd primes p. 

Remark: These isomorphisms describe the group-theoretic structure of the automor- 
phism group of the cyclic group, Z„, of order n since Aut(Z„) = (Z/nZ) x (cf. Propo- 
sition 16 in Section 4.4). In particular, for p a prime the automorphism group of the 
cyclic group of order p is cyclic of order p — 1. 

Proof: This is mainly a matter of collecting previous results. The isomorphism in 
(1) follows from the Chinese Remainder Theorem (see Corollary 18, Section 7.6). The 
isomorphism in (2) follows directly from Exercises 22 and 23 of Section 2.3. 

For p an odd prime, (Z/p“Z) x is an abelian group of order p a [ (p — 1). By 
Exercise 21 of Section 2.3 the Sylow p-subgroup of this group is cyclic. The map 

Z/p a Z Z/pZ defined by a + (p a ) i— ► # + (p) 

is a ring homomorphism (reduction mod p) which gives a surjective group homo- 
morphism from (Z/p“Z) x onto (Z/pZ) x . The latter group is cyclic of order p — 1 
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(Corollary 19). The kernel of this map is of order p al , hence for all primes q / p, the 
Sylow ^-subgroup of (Z/p^Z)* maps isomorphically into the cyclic group (Z/pZ)* . 
All Sylow subgroups of (Z/p“Z) x are therefore cyclic, so (3) holds, completing the 
proof. 


EXERCISES 

1. Let F be a field and let f(x) be a nonconstant polynomial in F[x], Describe the nilradical 
of F[x]/(f(x)) in terms of the factorization of f(x) (cf. Exercise 29, Section 7.3). 

2. For each of the fields constructed in Exercise 6 of Section 4 exhibit a generator for the 
(cyclic) multiplicative group of nonzero elements. 

3. Let p be an odd prime in Z and let n be a positive integer. Prove that x n — p is irreducible 
over Z[i]. [Use Proposition 1 8 in Chapter 8 and Eisenstein’s Criterion.] 

4. Prove that x 3 + \2x 2 + 18x + 6 is irreducible over Z[i\. [Use Proposition 8.18 and 
Eisenstein’s Criterion.] 

5. Let <p denote Euler’s ^-function. Prove the identity Yld\n ^(*0 = n > where the sum is 

extended ova: all the divisors d of n. [First observe that the identity is valid when n = p™ 
is the power of a prime p since the sum telescopes. Write n = p rn n' where p does not 
divide ri. Prove that J2d\ n <p(d) — Y,d» \ P " <p(4 n ) Y.d>w ty multiplying out the right 

hand side and using the multiplicalivity <p(ab) = <p(a)<p(b) when a and b are relatively 
prime. Use induction to complete the proof. This problem may be done alternatively 
by letting Z be the cyclic group of order n and showing that since Z contains a unique 
subgroup of order d for each d dividing n, the number of elements of Z of order d is <p(d). 
Then |Z| is the sum of <p(d) as d ru ns over all divisors of «.] 

6. Let G be a finite subgroup of order n of the multiplicative group F* of nonzero elements 
of the field F. Let <p denote Euler’s ^-function and let \J/ (d) denote the number of elements 
of G of carder d. Prove that i]/(d) = <p(d) for every divisor d of n. In particular conclude 
that > 1, so that G is a cyclic group. [Observe that for any integer N > 1 the 
polynomial x N — 1 has at most N roots in F. Conclude that for any integer N we have 

V'(d) — N- Since = N by the previous exercise, show by induction that 

ij/(d) < <p(d) for every divisor d of n. Since rjf(d) = n = <fi(d) show that this 
implies y}j(d) = <p(d) for every divisor d of n.] 

7. Prove that the additive and multiplicative groups of a field are never isomorphic. [Consider 
three cases: when |F| is finite, when — 1 # 1 in F, and when — 1 = 1 in F.] 


9.6 POLYNOMIALS IN SEVERAL VARIABLES OVER A HELD 

AND GROBNER BASES 

In this section we consider polynomials in many variables, present some basic computa- 
tional tools, and indicate some applications. The results of this section are not required 
in Chapters 10 through 14. Additional applications will be given in Chapter 15. 

We proved in Section 2 that a polynomial ring F[z] in a variable x over a field F 
is a Euclidean Domain, and Corollary 8 showed that the polynomial ring F[x \ , . . . , x n ] 
is a U.F.D. However it follows from Corollary 8 in Section 8.2 that the latter ring is 
not a P.I.D. unless n = 1. Our first result below shows that ideals in such polynomial 
rings, although not necessarily principal, are always finitely generated. General rings 
with this property are given a special name: 
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Definition. A commutative ring R with 1 is called Noetherian if every ideal of R is 
finitely generated. 

Noetherian rings will be studied in greater detail in Chapters 15 and 16. In this 
section we develop some of the basic theory and resulting algorithms for working with 
(finitely generated) ideals in F[xi, . . . , x n ]. 

As we saw in Section 1, a polynomial ring in n variables can be considered as a 
polynomial ring in one variable with coefficients in a polynomial ring in n — 1 variables. 
By following this inductive approach — as we did in Theorem 7 and Corollary 8 — we 
can deduce that F[j <\,X 2 , . . . , x„] is Noetherian from the following more general resulL 

Theorem 21. (Hilbert’s Basis Theorem ) If R is a Noetherian ring then so is the poly- 
nomial ring /?[*]. 

Proof: Let / be an ideal in R[x] and let L be the set of all leading coefficients of 
the elements in I. We first show that L is an ideal of R, as follows. Since 7 contains 
the zero polynomial, 0 e L. Let / — ax d + * * ■ and g — bx e H — - be polynomials in 
7 of degrees d, e and leading coefficients a y b e R. Then for any r € R either ra — b 
is zero or it is the leading coefficient of the polynomial rx e f — x d g. Since the latter 
polynomial is in / we have ra — b € L y which shows L is an ideal of R. Since R is 
assumed Noetherian, the ideal L in R is finitely generated, say by ... t an G R. 

For each i = 1 , . . . , n let f, be an element of 7 whose leading coefficient is a . Let e : 
denote the degree of f , and let N be the maximum of e \ , £ 2 , - ■ - . ■ 

For each d e {0, 1, . . . , IV — 1}, let La be the set of all leading coefficients of 
polynomials in 7 of degree d together with 0. A similar argument as that for L shows 
each Ld is also an ideal of R, again finitely generated since R is Noetherian. For each 
nonzero ideal Ld let bd,u bd, 2 , . . . , bd,„ d g 7? be a set of generators for Ld , and let /</,- 
be a polynomial in 7 of degree d with leading coefficient bdj . 

We show that the polynomials f \, . . . , f„ together with all the polynomials for 
all the nonzero ideals Ld are a set of generators for 7, i.e., that 

7 = ( {/1 /„} U {fd y ; | 0 < d < N, 1 <i < rid)). 

By construction, the ideal I' on the right above is contained in 7 since all the generators 
were chosen in 7. If V I, there exists a nonzero polynomial / G 7 of minim um 
degree with f £ I‘. Let d — deg / and let a be the leading coefficient of /. 

Suppose first that d > N. Since a € L we may write a as an R -linear combination 

of the generators of L: a — r\a\ H 1 -r„a n . Then g = f\ H h r n x d ~ e " f n is 

an element of V with the same degree d and the same leading coefficient aas f. Then 
/ — g G 7 is a polynomial in 7 of smaller degree than /. By the minimality of /, we 
must have f — g = 0, so / = g g 7', a contradiction. 

Suppose next that d < N. In this case a G Lj for some d < N, and so we may 

write a = r y b d ,\ H 1- r nd b„ d for some r t € R. Then g - nf di 1 H 1- is a 

polynomial in V with the same degree d and the same leading coefficient a as /, and 
we have a contradiction as before. 

It follows that 7 = V is finitely generated, and since 7 was arbitrary, this completes 
the proof that 7?[x] is Noetherian. 
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Since a field is clearly Noetherian, Hilbert’s Basis Theorem and induction imme- 
diately give: 

Corollary 22. Every ideal in the polynomial ring F[x\ ,X 2 ,*..,x„] with coefficients 
from a field F is finitely generated 

If / is an ideal in F[x! , . . . , x n ] generated by a (possibly infinite) set <S of polyno- 
mials, Corollary 22 shows that I is finitely generated, and in fact I is generated by a 
finite number of the polynomials from the set S (cf. Exercise 1). 

As the proof of Hilbert’s Basis Theorem shows, the collection of leading coefficients 
of the polynomials in an ideal I in F[jc] forms an extremely useful ideal in R that can 
be used to understand I. 'This suggests studying “leading terms” in F\x\ , j e 2 , . . . , x„] 
more generally (and somewhat more intrinsically). To do this we need to specify a 
total ordering on the monomials, since without some sort of ordering we cannot in 
general tell which is the ‘leading” term of a polynomial. We implicitly chose such an 
ordering in the inductive proof of Corollary 22 — we first viewed a polynomial fa s a 
polynomial in X] with coefficients in F — F[x 2 , . . . , x„], say, then viewed its “leading 
coefficient” in F[x 2 , . . . , x„] as a polynomial in x 2 with coefficients in Ffe, . . . , x„], 
etc. This is an example of a lexicographic monomial ordering on the polynomial 
ring Ffxi, . . . , x„ ] which is defined by first declaring an ordering of the variables, for 
example xi > x 2 > ■ - • > Xn and then declaring that the monomial term Ax^'x^ 2 ■ - ■ x£ n 
with exponents (a\, a 2 , . . . , a^) has higher order than the monomial term Bx\ l x^ ■■■x*" 
with exponents (b\, b 2 , . . . , b n ) if the first component where the n-tuples differ has 
< 2 ; > b{. This is analogous to the ordering used in a dictionary (hence the name), 
where the letter “a” comes before “b” which in turn comes before “c”, etc., and then 
“aardvaik” comes before “abacus” (although the ‘word’ a 2 — aa comes before a in 
the lexicographical order). Note that the ordering is only defined up to multiplication 
by units (elements of F x ) and that multiplying two monomials by the same nonzero 
monomial does not change their ordering. This can be fo rmaliz ed in general. 

Definition. A monomial ordering is a well ordering “> ” on the set of monomials that 
satisfies mm\ > mm 2 whenever m\ > m 2 for monomials m, m\,m 2 . Equivalently, 
a monomial ordering may be specified by defining a well ordering on the n-tuples 
a = (fli, . . . , a n ) € Z" of multidegrees of monomials Ax j 1 ■ • • x%" that satisfies 
a + y >/0 + yifa>j6. 

It is easy to show for any monomial ordering that m > I for every monomial m 
(cf. Exercise 2). It is not difficult to show, using Hilbert’s Basis Theorem, that any total 
ordering on monomials which for every monomial m satisfies m > 1 and mm\ > mm 2 
whenever m\ > m 2 , is necessarily a well ordering (hence a monomial ordering) — this 
equivalent set of axioms for a monomial ordering may be easier to verify. For simplicity 
we shall limi t the examples to the particularly easy and intuitive lexicographic ordering, 
but it is important to note that there are useful computational advantages to using other 
monomial orderings in practice. Some additional commonly used monomial orderings 
are introduced in the exercises. 
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As mentioned, once we have a monomial ordering we can define the leading term 
of a polynomial: 

Definition. Fix a monomial ordering on the polynomial ring F[x i , X 2 , . . . , x„]. 

(1) The leading term of a nonzero polynomial / in F[jci, * 2 . - - ■ . **], denoted 
LT (/), is the monomial term of maximal order in / and the leading term of 
/ = 0 is 0. Define the multidegree of f, denoted d(f), to be the multidegree 
of the leading term of /. 

(2) If 7 is an ideal in F[x\ , x-i, ■ ■ ■ , x„], the ideal of leading terms , denoted LT (/), 
is the ideal generated by the leading terms of all the elements in the ideal, i.e., 
LT(l) = (LT(J) [/€/)- 

The leading term and the multidegree of a polynomial clearly depend on the choice 
of the ordering. For example LT(lxy + y 3 ) = Ixy with multidegree (1, 1) if x > y, 
but LT (2xy + y 3 ) = y 3 with multidegree (0, 3) if y > x. In particular, the leading 
term of a polynomial need not be the term of largest total degree. Similarly, the ideal 
of leading terms LT(I) of an ideal I in general depends on the ordering used. Note 
also that the multidegree of a polynomial satisfies 8(fg) — df + dg when / and g are 
nonzero, and that in this case LT ( fg ) — LT (/) + LT (g) (cf. Exercise 2). 

The ideal LT(1) is by definition generated by monomials. Such ideals are called 
monomial ideals and are typically much easier to work with than generic ideals. For 
example, a polynomial is contained in a monomial ideal if and only if each of its 
monomial terms is a multiple of one of the generators for the ideal (cf. Exercise 10). 

It was important in the proof of Hilbert's Basis Theorem to have all of the leading 
terms of the ideal /. If / = (fa, .... f m ), then LT(I) contains the leading terms 
LT (/ 1 ), ...» LT (f m ) of the generators for I by definition. Since LT (I) is an ideal, it 
contains the ideal generated by these leading terms: 

an/i) l7(/„))clt(/). 

The first of the following examples shows that the ideal LT (I) of leading terms can 
in general be strictly larger than the ideal generated just by the leading terms of some 
generators for 7. 

Examples 

(1) Choose the lexicographic ordering x > y on F[x, y]. The leading terms of the 
polynomials fa — x 3 y — xy 2 + 1 and fa = x 2 y 2 — y 3 — 1 are LT(fa) = j t 3 y (so 
the mulddegree of fa is 'd(fa) = (3, 1)) and LT(fa) = ->c 2 y 2 (so d(fa) = (2, 2)). If 
I = (fa, fa) is the ideal generated by fa and fa then the leading term ideal LT(I) 
contains LT(fa) = x 3 y and LT(fa) — x 2 y 2 , so (x 3 y, x 2 y 2 ) c LT(J). Since 

y/l - xfa = y(x 3 y - xy 2 + 1) - x(x 2 y 2 - y 3 - 1) = x + y 

we see that g — x + y is an element of 7 and so the ideal LT (I) also contains the leading 
term LT(g) = x. This shows that LT(I) is strictly larger than (LT(fa), LT(fa)), 
since every element in (LT(fa), LT(fa)) = (x 3 y, x 2 y 2 ) has total degree at least 4. 
We shall see later that in this case LT(I) = (x, y 4 ). 
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(2) With respect to the lexicographic ordering y > x, the leading terms of f\ and f 2 
in the previous example are LT(f\) = —xy 2 (which one could write as — y 2 * to 
emphasize the chosen ordering) and LT (/ 2 ) = — y 3 . We shall see later that in this 
ordering LT (/) = (x A , y), which is a different ideal than the ideal LT(I) obtained 
in the previous example using the ordering x > y, and is again strictly larger than 
C LT(h),LT(f 2 )). 

(3) Choose any ordering on F[x, y] and let / = f(x , y) be any nonzero polynomial. The 
leading term of every element of the principal ideal I = (/) is then a multiple of the 
leading term of /> so in this case LT (I) — (LT (/)). 

In the case of one variable, leading terms are used in the Division Algorithm to 
reduce one polynomial g modulo another polynomial / to get a unique remainder r, and 
this remainder is 0 if and only if g is contained in the ideal (/). Since F[xi , X 2 , . ■ ■ , *„] 
is not a Euclidean Domain if n > 2 (since it is not a P.I.D.), the situation is more 
complicated for polynomials in more than one variable. In the first example above, 
neither f\ nor f 2 divides g in F[x, y] (by degree considerations, for example), so 
attempting to first divide g by one of /i or f 2 and then by the other to try to reduce g 
modulo the ideal / would produce a (nonzero) “remainder” of g itself. In particular, 
this would suggest that g = yfi — xf 2 is not an element of the ideal / even though 
it is. The reason the polynomial g of degree 1 can be a linear combination of the two 
polynomials f\ and f 2 of degree 4 is that the leading terms in yf\ and xf 2 cancel in the 
difference, and this is reflected in the fact that LT (f\) and LT (f 2 ) are not sufficient 
to generate LT( I). A set of generators for an ideal 1 in F[x\, whose leading 

terms generate the leading terms of all the elements in / is given a special name. 

Definition. A Grobner basis for an ideal 1 in the polynomial ring F[x \ , . . . , x n ] is a 
finite set of generators {gi , . . . , £,„} for / whose leading terms generate the ideal of all 
leading terms in /, i.e., 

/ = and LT(I) = (LT(g l ),...,LT(g m )). 


Remark: Note that a Grobner “basis” is in fact a set of generators for I (that depends on 
the choice of ordering), i.e., every element in / is a linear combination of the generators, 
and not a basis in the sense of vector spaces (where the linear combination would be 
unique , cf. Sections 10.3 and 11.1). Although potentially misleading, the terminology 
“Grobner basis” has been so widely adopted that it would be hazardous to introduce a 
different nomenclature. 

One of the most important properties of a Grobner basis (proved in Theorem 23 
following) is that every polynomial g can be written uniquely as the sum of an element 
in I and a remainder r obtained by a general polynomial division. In particular, we 
shall see that g is an element of / if and only if this remainder r is 0. While there is 
a similar decomposition in general, we shall see that if we do not use a Grobner basis 
the uniqueness is lost (and we cannot detect membership in / by checking whether the 
remainder is 0) because there are leading terms not accounted for by the leading terms 
of the generators. 
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We first use the leading terms of polynomials defined by a monomial ordering 
on F[*i, . . . , j t„] to extend the one variable Division Algorithm to a noncanonical 
polynomial division in several variables. Recall that for polynomials in one variable, 
the usual Division Algorithm determines the quotient qfx) and remainder r(x) in the 
equation f(x) = q(x )g(x) + r(x) by successively testing whether the leading term of 
the dividend f(x) is divisible by the leading term of g(x): if LT (/) = a(x)LT (g), 
the monomial term a(x) is added to the quotient and the process is iterated with fix) 
replaced by the dividend fix) — fl(x)g(x), which is of smaller degree since the leading 
terms cancel (by the choice of a (*)). The process terminates when the leading term 
of the divisor gix) no longer divides the leading term of the dividend, leaving the 
remainder r(x). We can extend this to division by a finite number of polynomials in 
several variables simply by allowing successive divisions, resulting in a remainder and 
several quotients, as follows. 

General Polynomial Division 

Fix a monomial ordering on F[xi and suppose gi , .... g m is a set of nonzero 

polynomials in F[xi, . . . , x„]. If / is any polynomial in F|>i, . . . , x„], start with a 
set of quotients and a remainder r initially all equal to 0 and successively 

test whether the leading term of the dividend / is divisible by the leading terms of the 
divisors g \, . . . , g m , in that order. Then 

i. If LT if) is divisible by LT ig t ), say, LT (/) = a\LT igj), add a, to the quotient < 7 , , 
replace / by the dividend / — a- t g\ (a polynomial with lower order leading term), 
and reiterate the entire process. 

ii. If the leading term of the dividend / is not divisible by any of the leading terms 
LT (gi), . . . , LT ig m ), add the leading term of / to the remainder r, replace / by 
the dividend / — LT if) (i.e., remove the leading term of /), and reiterate the 
entire process. 

The process terminates (cf. Exercise 3) when the dividend is 0 and results in a set of 
quotients q \ , . . . , q m and a remainder r with 

/ = 4igi H + <lmgm + r. 

Each qigi has multidegree less than or equal to the multidegree of / and the remainder 
r has the property that no nonzero term in r is divisible by any of the leading terms 
LT (gi), . . . , LT ig m ) (since only terms with this property are added to r in (ii)). 

Examples 

Fix the lexicographic ordering x > y on F[x, y]. 

(1) Suppose / = x 3 y 3 + 3 x 2 y 4 and g = xy 4 . The leading term of f is x 3 y 3 , which is 
not divisible by (the leading term of) g, so x 3 y 3 is added to the remainder r (so now 
r = x 3 y 3 ) and / is replaced by / — LT if) — 3x 2 y 4 and we start over. Since 3x 2 y 4 
is divisible by LTig) = xy 4 , with quotient a = 3jc, we add 3x to the quotient q (so 
q = 3x), and replace 3x 2 y 4 by 3x 2 y 4 — aLT(g) = 0, at which point the process 
terminates. The result is the quotient q — 3x and remainder r — x 3 y 3 and 

x 3 y 3 + 3x 2 y 4 = / = qg + r = (3x)(xy 4 ) + x 3 y 3 . 
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Note that if we had terminated at the first step because the leading term of / is 
not divisible by the leading term of g (which terminates the Division Algorithm for 
polynomials in one variable), then we would have been left with a ‘remainder’ of / 
itself, even though ‘more’ of / is divisible by g. This is the reason for step 2 in the 
division process (which is not necessary for polynomials in one variable). 

(2) Let/ = x 2 +x—y 2 +y, and suppose gi =Jty+l and g 2 = x+y. In the first iteration, 
the leading term x 2 of / is not divisible by the leading term of gi, but is divisible by 
the leading term of g 2 , so the quotient qi is x and the dividend / is replaced by the 
dividend / — xg 2 — —xy + x — y 2 + y. In the second iteration, the leading term 
of — xy + x — y 2 + y is divisible by LT (gi ), with quotient — 1, so q\ = — 1 and the 
dividend is replaced by (— xy + x — y 2 + y) — (— l)gi = x — y 2 + y + 1. In the third 
iteration, the leading tom of x — y 2 + y + 1 is not divisible by the leading tom of g \ , 
but is divisible by the leading term of g 2 , with quotient 1, so 1 is added to qi (which is 
now qi = x + l)andthedividendbecomes(x — y^ + y + l) — (l)(g2) = — > ,2 + l. The 
leading term is now — y 2 , which is not divisible by either L7(g|) = xyorL7(g2) = x, 
so — y 2 is added to the remainder r (which is now — y 2 ) and the dividend becomes 
simply 1 . Finally, 1 is not divisible by either LT(g\) or LT(g 2 ), so is added to the 
remainder (so r is now — y 2 + 1), and the process terminates. The result is 

qi ~ 1, q 2 —x + l, r = —y 2 + 1 and 

/=x 2 + x-y 2 +y = ( — 1) (Ary + 1) + (x + l)(x + y) + (-y 2 + 1) 

= 41 Si + 42 g 2 +r. 

(3) Let / = x 2 + x — y 2 + y as in the previous example and interchange the divisors gi 
and g 2 '. gi = x + y and g 2 = xy + 1. In this case an easy computation gives 

q\=x — y + 1 . 42 = 0, r ~ 0 and 

f = x 2 -\-x-y 2 +y = (x-y + l)(jr + y) = qig\ + q2g2 + r , 

showing that the quotients q t and the remainder r are in general not unique and depend 
on the order of the divisors gi g m . 

The computation in Example 3 shows that the polynomial / = jc 2 +x — y^ + yis 
an element of the ideal 7 — (x + y, xy + 1) since the remainder obtained in this case 
was 0 (in fact / is just a multiple of the first generator). In Example 2, however, the 
same polynomial resulted in a nonzero remainder —y 2 + 1 when divided by xy + 1 and 
x+y, and it was not at all clear from that computation that / was an element of I. 

The next theorem shows that if we use a Grobner basis for the ideal 1 then these 
difficulties do not arise: we obtain a unique remainder, which in turn can be used to 
determine whether a polynomial / is an element of the ideal I. 

Theorem 23. Fix a monomial ordering on R = /’[xi , . . . , x n ] and suppose (gi , - - . , g m } 
is a Grobner basis for the nonzero ideal / in R. Then 

(1) Every polynomial / € R can be written uniquely in the form 

f = fi+r 

where // G 1 and no nonzero monomial term of the ‘remainder’ r is divisible 
by any of the leading terms LT (gi), . . . , LT (g,„). 
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(2) Both f i and r can be computed by general polynomial division by g\ , . . . , g m 
and are independent of the order in which these polynomials are used in the 
division. 

(3) The remainder r provides a unique representative for the coset of / in the 
quotient ring F[xu .... x n ]/I. In particular, / e / if and only if r = 0. 

Proof: Letting // = Yl?=i QiSi e 1 in general polynomial division of / 
by Si > - ■ ■ » 8m immediately gives a decomposition / = fi + r for any generators 
gi, ■ ■ ■ , g m - Suppose now that (g i , . , g m } is a Grobner basis, and/ = fj+r = fj+r 
Then r—r' = fj — fi G /, so its leading term LT(r — r') is an element of L 7(7), which 
is the ideal ( LT (gj ), LT ( g m )) since {gi, . . . , g w } is a Grobner basis for 7. Every 
element in this ideal is a sum of multiples of the monomial terms LT (gi), . LT(g m ) t 
so is a sum of terms each of which is divisible by one of the LT(gi). But both r 
and r', hence also r — r 7 , are sums of monomial terras none of which is divisible by 
LT (gi ), . . . , LT(g , n ), which is a contradiction unless r — r' = 0. It follows that r = r' 
is unique, hence so is fi = f — r, which proves (1). 

We have already seen that // and r can be computed algorithmically by polynomial 
division, and the uniqueness in (1 ) implies that r is independent of the order in which the 
polynomials gi, . . . , g m are used in the division. Similarly // = <?,g, is uniquely 

determined (even though die individual quotients q, are not in general unique), which 
gives (2). 

The first statement in (3) is imme diate from the uniqueness in (1). If r = 0, then 
f = fi el. Conversely, if / e /, then / = / + 0 together with the uniqueness of r 
implies that r = 0, and the final statement of the theorem follows. 

As previously mentioned, the importance of Theorem 23, and one of the principal 
uses of Grobner bases, is the uniqueness of the representative r, which allows effective 
computation in the quotient ring F [x i, . . . , *„]//. 

We next prove that a set of polynomials in an ideal whose leading terms generate 
all the leading terms of an ideal is in fact a set of generators for the ideal itself (and so 
is a Grobner basis — in some works this is takm as the definition of a Grobner basis), 
and this shows in particular that a Grobner basis always exists. 

Proposition 24. Fix a monomial ordering on R = F[X| , . . . , x„] and let / be a nonzero 
ideal in R. 

(1) ifg!, . . . , g m are any elements of I such that LT(I) = (LT(gi ),..., LT 
then {gi , • - - , gm) is a Grobner basis for /. 

(2) The ideal I has a Grobner basis. 

Proof: Suppose g\,...,g m e I with LT (/) = {LT (g0, . . . , LT (g m )). We need 
to see that gi, . . . , g m generate the ideal 7. If / g 7, use general polynomial division 
to write / = Yl?=i Qi Si + r where no nonzero term in the remainder r is divisible by 
any LT(gi). Since / G 7, also r G 7, which means LT(r) is in LT(I). But then 
LT(r) would be divisible by one of LT (gi ), . . . , L7’(g m )» which is a contradiction 
unless r = 0. Hence / — Qigi gi , . . - , g w generate 7, so are a Grobner basis 

for 7, which proves (1). 
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For (2), note that the ideal LT (I) of leading terms of any ideal / is a monomial ideal 
generated by all the leading terms of the polynomials in 7 . By Exercise 1 a finite number 
of those leading terms suffice to generate LT(I), say LT (1) — (LT(h[) f . . . , LT ( hk )) 
for some h j , . . . . h k € 7. By (1), the polynomials ..... hk are a Grobner basis of /, 
completing the proof. 


Proposition 24 proves that Grobner bases always exist. We next prove a criterion 
that determines whether a given set of generators of an ideal 7 is a Grobner basis, 
which we then use to provide an algorithm to find a Grobner basis. The basic idea is 
very simple: additional elements in LT (I) can arise by taking linear combinations of 
generators that cancel leading terms, as we saw in taking y/ v — xf 2 in the first example 
in this section. We shall see that obtaining new leading terms from generators in this 
simple manner is the only obstruction to a set of generators being a Grobner basis. 

In general, if fi , fi are two polynomials in F [x\ , . . . , jc„] and M is the monic least 
common multiple of the monomial terms LT (fi) and LT (fi) then we can cancel the 
leading terms by taking the difference 


S(f i, fi) = 


M 


■/i- 


M 


LT(fi) Jl LT(fi) 


fi- 


(9.1) 


The next le mma shows that these elementary linear combinations account for all can- 
cellation in leading terms of polynomials of the same multidegree. 


Lemma 25. Suppose /[ f m e F[xi, are polynomials with the same 

multidegree a and that the linear combination h = o\fi -I + a m f m with constants 

d € F has strictly smaller multidegree. Then 

m 

h = ^ fi), for some constants h, € F. 

i=2 


Proof: Write f = c/ fj where c,- e F and f( is a monic polynomial of multidegree 
a. We have 

h = ^aidf- = a\C\{f[ - f{) -{- (a\c\ + a 2 c 2 )(f 2 - / 3 ') - 1 

+ (tf|Cl H ){fm-\ ~ fm ) + (^lCi H h 0,„C m ) f' m . 

Note that //_, — fi = S(//_i, f). Then since h and each fi_ y — fi has multidegree 

strictly smaller than a, we have a\C\ H + a m c m = 0, so the last term on the right 

hand side is 0 and the lemma follows. 

The next proposition shows thht a set of generators gi , . . . , g m is a Grobner basis if 
there are no new leading terms among the differences S(gi, gj ) not already accounted 
for by the g,-. This result provides the principal ingredient in an algorithm to construct 
a Grdbner basis. 

For a fixed monomial ordering on R = F[jci, . . . , jc„] and ordered set of polyno- 
mials G = {gi, . . . , g m } in R, write f = r mod G if r is the remainder obtained by 
general polynomial division of / € 7? by gi , . . . , g m (in that order). 


Sec. 9.6 Polynomials in Several Variables over a Field and Grfibner Bases 


323 



Proposition 26. (Buchberger’s Criterion) Let R ~ / r [xi, . . . , x n ] and fix a monomial 
ordering on R. If / = (gi, . . . , g m ) is a nonzero ideal in R, then G = {gi, . . . , g m ) is 
a Grobner basis for I if and only if S(gj , gj) =0 mod G for 1 < i < j < m. 

Proof: If {g\ gm] is a Grobner basis for /, then S(g; , gj) = 0 mod G by 

Theorem 23 since each 5(g,-, gj) is an element of 1. 

Suppose now that S(gj, gj) = 0 mod G for 1 < i < j < m and take any element 
/ G /. To see that G is a Grobner basis we need to see that (IT (gi), . . . , LT (g m )) 
contains LT(f). Since / G /, we can write / — higi for some polynomials 
hi, . . . , h m . Such a representation is not unique. Among all such representations 
choose one for which the largest multidegree of any summand (i.e., max /= i ... m d(h jg { )) 
is minimal, say a. It is clear that the multidegree of / is no worse than the largest 
multidegree of all the s umman ds hjg so 9(/) < a. Write 

m 

f = Yl higi = higi+ H higi 

/=! a(h,g,)=a 

= Y, LT ( h ‘)Si+ L (hi-LT(h,))gi+ Y *<*<• < 9 - 2 ) 

<>(higi)=0 a(higi)=a d(h,g,)<a 

Suppose that d(f) < a. Then since the multi degree of the second two sums is also 
strictly smaller than a it follows that the multidegree of the first sum is strictly smaller 
than a. If a, G F denotes the constant coefficient of the monomial term LT (h,) then 
LT (hi) = Qih\ where h\ is a monomial. We can apply Lemma 25 to ^tf,-(/ijg>) to 
write the first sum above as h\g{) with 9(/i-_ 1 gi-i) = 9(/z'g/) = a. 

Let fii-ij be the multidegree of the monic least common multiple of LT (g/_i) and 
LT(gi). Then an easy computation shows that S(h' l _ l g i _i, h\gi) is just S(gi-i, gi) 
multiplied by the monomial of multidegree a — /?,_ 1(J . The polynomial 5(g,_i, gi) has 
multidegree less than A- 1,/ and, by assumption, 5(g/_i, g,) = 0 mod G. This means 
that after general polynomial division of 5(gj_i , g,-) by gi , . . . , g m , each 5(g,_j , g/) can 
be written as a sum ^2% gj with 9 (,qjgf) < fii-u- It follows that each 5 , (/iJ_ 1 g J _j , /i'g,) 
is a sum ^2qjgj with 9(<?jgj) < a. But then all the su ms on the right hand side of 
equation (2) can be written as a sum of terms of the form pigi with polynomials p\ 
satisfying d(pigj) < a. This contradicts the minimality of a and shows that in fact 
9(/) = a, i.e., the leading term of / has multidegree a. 

If we now take the terms in equation (2) of multidegree a we see that 

LT(f)= Y LT(,h,)LT(g,). 

so indeed LT(f) e (IT(gO. . . . , LT(g m )). It follows that G = (gi g m ) is a 

Grobner basis. 

Buchberger’s Algorithm 

Bucbbeiger’s Criterion can be used to provide an algorithm to find a Grobner basis 

for an ideal I, as follows. If / = (gi g m ) and each S(g,, gj) leaves a remainder 

of 0 when divided by G — {gi, . . . , g m ) using general polynomial division then G 
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is a Grobner basis. Otherwise S(g t , gj) has a nonzero remainder r. Increase G by 

appending the polynomial g m+ i = r: G' = {g\ g m . £ m +i} and begin again (note 

that this is again a set of generators for I since g m +i e /). It is not hard to check 
that this procedure terminates after a finite number of steps in a generating set G that 
satisfies Buchberger’s Criterion, hence is a GrSbner basis for / (cf. Exercise 16). Note 
that once an S(gj , gj) yields a remainder of 0 after division by the polynomials in G it 
also yields a remainder of 0 when additional polynomials are appended to G. 

If (g j , . . . , g m } is a Grobner basis for the ideal I and LT (gj ) is divisible by LT(gi) 
for some j^i, then LT(gj) is not needed as a generator for LT (/). By Proposition 24 
we may therefore delete gj and still retain a Grobner basis for /. We may also assume 
without loss that the leading term of each g, is monic. A Grobner basis {gi, . . - , gm) 
for / where each LT (g,) is monic and where LT(gj) is not divisible by LT(g ,) for 
i ^ j is called a minimal Grbbner basis. Whip a minimal Grobner basis is not unique, 
the number of elements and their leading terms are unique (cf . Exercise 1 5). 

Examples 

(1) Choose the lexicographic ordering-*: > y on F(x, y] and consider the ideal 7 generated 

by /j = x 3 y — jcy 2 + 1 and fi = x 2 }* 2 — y 3 — 1 as in Example 1 at the beginning of 
this section, lb test whether G = [f\, f 2 ) is a Grobner basis we compute S(J \ , fj) = 
yf\ —xf 2 = x + y, which is its own remainder when divided by [f\ , f%), so G is not a 
Grdbner basis for I . Set = x+y, and increase the generating set: G' = {/] , / 2 , /■$}. 

Now S(f \ , f 2 ) = 0 mod G\ and a brief computation yields 

S(f j, fo) = f\ - x 2 yft = -x 2 y 2 — xy 2 + 1=0 mod G' 

S(h< h) = h - xy 2 f3 = - xy 3 - y 3 - 1 = y 4 - y 3 - 1 mod G'. 

Let /4 = y 4 — y 3 — 1 and increase the generating set to C = {/i, / 2 , /a}. The 

previous 0 remainder is still 0, and now S(/ 2 , h) = 0 mod G" by the choice of j 4 . 
Some additional computation yields 

S(f 1 , U) = S(f 2 , / 4 ) = S(h, U) = 0 mod G" 

and so {-x^y — xy 2 + 1 , x 2 y 2 — y 3 — 1, x + y, y 4 — y 3 — l}isa Grobner basis for 
J. In particular. LT (/) is generated by the leading terms of these four polynomials, 
so LT(I ) = (x 3 y, x 2 y 2 , x, y 4 ) = (x, y 4 ), as previously mentioned. Then x + y 
and y 4 — y 3 — 1 in 7 have leading terms generating LT(I), so by Proposition 24, 
{jc + y, y 4 — y 3 — 1} gives a minim al Grobner basis for 7: 

i = (* + y.y 4 -y 3 - D- 

This description of 7 is much simpler than 7 = (x 3 y — xy 2 + 1 , x 2 y 2 — y 3 — 1 ). 

(2) Choose the lexicographic ordering y > x on F[x, y] and consider the ideal 7 in the 
previous example. In this case, S(f \ , / 2 ) produces a remainder of f$ = — x — y; then 
S(fi , fa) produces a remainder of / 4 = — x 4 — x 3 + 1, and then all remainders are 0 
with respect to the Grdbner basis {x 3 y — xy 2 + 1 , x 2 y 2 — y 3 — 1 , — x — y , — x 4 — x 3 + 
1}. Here LT(1) = (-xy 2 , -y 3 , — y, — x 4 ) = (y.x 4 ), as previously mentioned, and 
{x + y, x 4 + x 3 - 1} gives a minimal Grobner basis for 7 with respect to this ordering: 

7 = (x + y,x 4 + x 3 —1), 
a different simpler description of 7. 
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In Example 1 above it is easy to check that {x+y 4 — y 3 +y — 1, y 4 — y 3 — 1} is again 
a minimal Grobner basis for 7 (this is just 1/3 + U « so even with a fixed monomial 

ordering on F[x j . . . . , x„] a minimal Grobner basis for an ideal 7 is not unique. We can 
obtain an important uniqueness property by strengthening the condition on divisibility 
by the leading terms of the basis. 

Definition. Fix a monomial ordering on R = F[x 1( . . . , x n ]. A Grobner basis 
{gi » - - - . 8m } for the nonzero ideal 7 in R is called a reduced Grobner basis if 

(a) each g,- has monic leading term, i.e., LT (g/) is monic, i = 1 , . . . , m, and 

(b) no term in gj is divisible by LT (g,) for j ^ i. 

Note that a reduced Grobner basis is, in particular, a minimal Grobner basis. If 
G = {gi , . . - , g m } is a minim al Grobner basis for 7, then the leading term LT (gj) is not 
divisible by LT (g,) for any i ^ j. As a result, if we use polynomial division to divide 
gj by the other polynomials in G we obtain a remainder gj in the ideal 7 with the same 
leading term as gj (the remainder gj does not depend on the order of the polynomials 
used in the division by (2) of Theorem 23). By Proposition 24, replacing gj by gj in G 
again gives a minimal Grobner basis for 7, and in this basis no term of gj is divisible 
by LT (gj) for any i ^ j. Replacing each element in G by its remainder after division 
by the other elements in G therefore results in a reduced Grobner basis for 7. The 
importance of reduced Grobner bases is that they are unique (for a given monomial 
ordering), as the next result shows. 

Theorem 27. Fix a monomial ordering on R = F[xj , . . . , x„]. Then there is a unique 
reduced Grobner basis for every nonzero ideal 7 in R. 

Proof: By Exercise 15, two reduced bases have the same number of elements and 
the same leading terms since reduced bases are also minimal bases. If G = {g j ..... g m } 
and G' = {gj , .... g^} are two reduced bases for the same nonzero ideal 7, then after a 
possible rearrangement we may assume LT(gi) = LT (g-) = hj for i — 1 , . . . , m . For 
any fixed i, consider the poly nomial f = g, — g'. If f is nonzero, then since f e 7, its 
leading term must be divisible by some hj. By deGnitiou of a reduced basis, hj for j ^ i 
does not divide any of the terms in either g, or g{, hence does not divide LT(fj). But hi 
also does not divide LT(fi) since all the terms in // have strictly smaller multidegree. 
This forces f = 0, i.e., g; = g\ for every /, so G = G'. 

One application of the uniqueness of the reduced Grobner basis is a computational 
method to determine when two ideals in a polynomial ring are equal. 

Corollary 28. Let 7 and J be two ideals in F[x\ , x n ]. Then 7 = J if and only 
if 7 and J have the same reduced Grobner basis with respect to any fixed monomial 
ordering on F[xi, . . . , x n ]. 


Examples 

(1) Consider the ideal/ = (h\ t 7i2, hf) withAi = x 2 +xy 5 +y 4 ,7»2 = xy 6 — xy^ + y 5 — y 2 , 
and /13 = xy s — xy 2 in Fix, y]. Using the lexicographic ordering x > y we find 
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S(h\,h 2 ) = S(h u h 3 ) = 0 mod {h\, h 2 , h 3 )andS(h 2 , h 3 ) = y s -y 2 mod [h\ ,h 2 , h 3 ). 
Setting /j 4 = y 5 — y 2 we find S(hj,hj ) = 0 mod (h\,h 2 , h 3 , h/\) for 1 <i<j< 4, 
so 

x 2 + xy 5 + y 4 , xy 6 - xy 3 + y 5 - y 2 , xy 5 - xy 2 , y 5 - y 2 

is a Grobner basis for /. The leading terms of this basis are x 2 , xy 6 , xy 5 , y 5 . Since y 5 
divides both xy 6 and xy 5 , we may remove the second and third generators to obtain a 
minimal Grtibner basis {x 2 + xy 5 + y 4 , y 5 — y 2 } for /. The second term in the first 
generator is divisible by the leading term y 5 of the second generator, so this is not a 
reduced Grobner basis. Replacing x 2 + xy 5 + y 4 by its remainder x 2 + xy 2 + y 4 after 
division by the other polynomials in the basis (which in this case is only the polynomial 
y 5 — y 2 ), we are left with the reduced GrGbner basis {x 2 + xy 2 + y 4 , y 5 — y 2 } for I. 

(2) Consider the ideal J = (hi, h 2 , h 3 ) with hi = xy 3 + y 3 + 1, h 2 = x 3 y — x 3 + 1 , and 
h 3 = x + y in F[x, y]. Using the lexicographic monomial ordering x > y we find 
S(hi.h 2 ) = 0 mod {hi, h 2 , h 3 ] andS(hi, h 3 ) = y 4 — y 3 — 1 mod [hi,h 2 , h 3 ). Setting 
/14 = y 4 — y 3 — 1 we find S(hi,hj ) = 0 mod [h\, h 2 , h 3 , A 14 } for 1 < i < j < 4, so 

xy 3 + y 3 + l, x 3 y — x 3 + 1, x + y, y 4 - y 3 - 1 

is a Grobner basis for J. The leading terms of this basis are xy 3 , x 3 y, x, and y 4 , so 
{x + y, y 4 — y 3 — 1} is a minimal Grobner basis for J. In this case none of the terms 
in y 4 — y 3 — 1 are divisible by the leading term of x + y and none of the terms in 
x + y are divisible by the leading term in y 4 — y 3 — 1, so {x + y, y 4 — y 3 — 1 } is the 
reduced Grobner basis for J. This is the basis for the ideal I in Example 1 following 
Proposition 26, so these two ideals are equal: 

(x 3 y - xy 2 + 1, x 2 y 2 - y 3 - 1) = (xy 3 + y 3 + 1, x 3 y - x 3 + 1, x +y) 

(and both are equal to the ideal (x + y, y 4 — y 3 — 1)). 


Grobner Bases and Solving Algebraic Equations: Elimination 

The theory of Grobner bases is very useful in explicitly solving systems of algebraic 
equations, and is the basis by which computer algebra programs attempt to solve systems 
of equations. Suppose S — {/1 , . . . , f m ) is a collection of polynomials in n variables 
xi, ... ,x n and we are trying to find the solutions of the system of equations fi = 0, 

f 2 — 0 f m =0 (i.e., the common set of zeros of the polynomials in 5). If 

(fli, . . . , a„) is any solution to this system, then every element / of the ideal 7 generated 
by S also satisfies f(a \ , . . . , a„) = 0. Furthermore, it is an easy exercise to see that if 
S' = {gi , . . . , is any set of generators for the ideal / then the set of solutions to the 
system gj = 0, . . . , g s = 0 is the some as the original solution set. 

In the situation where / 1 , . . . , / m are linear polynomials, a solution to the system 
of equations can be obtained by successively eliminating the variables x\,x 2 , . . . by 
elementary means — using linear combinations of the original equations to e limin ate 
the variable xj, then using these equations to eliminate x 2 , etc., producing a system of 
equations that can be easily solved (this is “Gauss- Jordan elimination” in linear algebra, 
cf. the exercises in Section 1 1 .2). 

The situation for polynomial equations that are nonlinear is naturally more com- 
plicated, but the basic principle is the same. If there is a nonzero polynomial in the 
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ideal 7 involving only one of the variables, say p( x„), then the last coordinate a„ is 
a solution of p(x„) = 0. If now there is a polynomial in 7 involving only x„_i and 
x n , say q(x n -y, x n ), then the coordinate a n -\ would be a solution of q(x„^\, of) = 0, 
etc. If we can successively find polynomials in 7 that eliminate the variables xt , xi , . . . 
then we will be able to determine all the solutions (a\, .... a n ) to our original system 
of equations explicitly. 

Finding equations that follow from the system of equations in S, i.e., finding ele- 
ments of the ideal 7 that do not involve some of the variables, is referred to as elitni- 
nation theory. The polynomials in 7 that do not involve the variables x \, . . . , x f , i.e., 
7 fl F[xj + i , . . . , x„], is easily seen to be an ideal in F[x,+i , . . . , x„] and is given a name. 

Definition. If 7 is an ideal in F[x i , . . . , x„] then 7, = 7 Pi F[x,- + i, . . . , x„] is called 
the I th elimination ideal of 7 with respect to the ordering jci > - - - > x n . 

The success of using elimination to solve a system of equations depends on being 
able to determine the elimination ideals (and, ultimately, on whether these elimination 
ideals are nonzero). 

The following fundamental proposition shows that if the lexicographic monomial 
ordering x\ > ■ - - > x n is used to compute a Grobner basis for 7 then the elements 
in the resulting basis not involving the variables x l5 x,- not only determine the 1 th 
elimination ideal, but in fact give a Grobner basis for the I th elimination ideal of 7. 

Proposition 29. ( Elimination ) Suppose G = {gi, . . . , g m ) is a Grobner basis for the 
nonzero ideal 7 in F[x i , . . . , x„] with respect to the lexicographic monomial ordering 
xi > ■ * ■ > jc„. Then G H F[x; + t , . . . , x„] is a Grobner basis of the I th elimination 
ideal 7, = 7 fl F[x,+i , . . . , x„] of 7. In particular, 7 n F[x i+ \, . . . , x„] = 0 if and only 
if G n F[x, + i, . . . , x n ] = 0. 

Proof: Denote G, = G fl F[x l+I , . . . , x„]. Then G; c l iy so by Proposition 
24, to see that G, is a Grobner basis of 7, it suffices to see that LT (G,-), the leading 
terms of the elements in G, , generate LT (7/) as an ideal in F[x, + i, . . . , x*]. Certainly 
( LT(Gj )) C 7.7(7, ) as ideals in F[x, + [, . . . , x„]. To show the reverse containment, 
let / be any element in l t . Then / € 7 and since G is a Grdbner basis for 7 wc hove 

LT(f) = ai(x 1 ,...,x»)LT(gi) + -‘+a m (xi,... i x n )LT(g m ) 

for some polynomials a \ , . . . , e F[x lt . . . , x„]. Writing each polynomial a* as a 
sum of monomial terms we see that LT(f) is a sum of monomial terms of the form 
ax^' . . . x* n LT ( gj ). Since LT(f) involves only the variables Xj+i , . . . , x„, the sum of 
all such terms containing any of the variables x \ , . . . , x ; must be 0, so L T (/) is also the 
sum of those monomial terms only involving x /+ i , . . . , x„ . It follows that LT (/) can be 
written as a F[x/ + i , . . . , x„]-linear combination of some monomial terms L T (g,) where 
LT (gi) does not involve the variables Xi, . . . , x, . But by the choice of the ordering, 
if LT (jgt) does not involve x \, . . . , x;> then neither do any of the other terms in g,, 
i.e., g t e G t . Hence LT(f) can be written as a F[x 1+ t, . . . , x„]- linear combination of 
elements LT (G,), completing the proof. 

Note also that Grobner bases can be used to eliminate any variables simply by using 
an appropriate monomial ordering. 
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Examples 

(1) The ellipse 2x 2 + 2xy + y 2 — 2x — 2y = 0 intersects the circle x 2 + y 2 = 1 in two 
points. To find them we compute a Grdbner basis for the ideal l = (lx 2 + 2xy -f y 2 — 
2x — 2y, x 2 + y 2 — 1) c R[x, y] using the lexicographic monomial order x > y to 
eliminate x, obtaining g* = 2x+y 2 + 5y 3 — 2 and g 2 = 5y 4 — 4y 3 . Hence 5y 4 = 4y 3 
and y = 0 or y = 4/5. Substituting these values into g\ = 0 and solving for x we find 
the two intersection points are (1, 0) and (—3/5, 4/5). 

Instead using the lexicographic monomial order y > x to eliminate y results in 
the Grdbner basis {y 2 + x 2 — 1 , 2yx — 2y + x 2 — 2x + 1 , 5.x 3 - lx 2 — x + 3). Then 
5x 3 — lx 2 — x + 3 = (x — l) 2 (5x + 3) shows that x is 1 or —3/5 and we obtain the 
same solutions as before, although with more effort. 

(2) In the previous example the solutions could also have been found by elementary means. 
Consider now the solutions in C to the system of two equations 

X 3 — 2xy + y 3 = 0 and x 5 — 2x 2 y 2 -f y 5 = 0. 

Computing a Grdbner basis for the ideal generated by fi = x 3 — 2xy + y 3 and 
f 2 = x 5 — 2x 2 y 2 + y 5 with respect to the lexicographic monomial order x > y we 
obtain the basis 

g ] =x 3 -2 xy + y 3 

g 2 = 2DOxy 2 + 193y 9 + 158y 8 - 45y 7 - 456y 6 + 50y s - 100y 4 
g 3 = y ,0 -y«-2y 7 +2y 6 . 

Any solution to our original equations would satisfy gi = g 2 = gi = 0, Since 
g3 = y 6 (y — l) 2 (y 2 + 2y + 2), we have y = 0, y = 1 or y = — 1 ± /. Since 
gt(x, 0) = x 3 and g 2 (x, 0) = 0, we see that (0, 0) is the only solution with y = 0. 
Sincegi(x, 1) = x 3 — 2x+ 1 andg 2 (x, 1) = 200(x — 1) have only x = 1 as a common 
zero, the only solution with y = 1 is (1 , 1). Finally, 

gi (x, - 1 ± i) = x 3 + (2 ^ 2 i)x + (2 ± 2i ) 
g 2 (x, -1 ± i) = — 400r(x -h 1 ± r). 

and a quick check shows the common zero x = — 1 r when y = — 1 ± i , respectively. 
Hence, there are precisely four solutions to the original pair of equations, namely 

(x,y) = (0, 0), (1,1), (-1+/, -1-i), or (-1— i. — 1+i). 

(3) Consider the solutions in C to the system of equations 

x + y +z = 1 
x 2 + y 2 + z 2 = 2 
x 3 + y 3 +z 3 = 3. 

The reduced Grdbner basis with respect to the lexicographic ordering x > y > z is 

{x + y + z-1, y 2 + yz - y + z 2 - z- (1/2), z 3 - z 2 - (l/2)z - (1/6)} 

and so z is a root of the polynomial r 3 — t 2 — ( 1 /2)r — ( 1 /6) (by symmetry, also x and y 
are roots of this same polynomial). For each of the three roots of this polynomial, there 
are two values of y and one corresponding value of x making the first two polynomials 
in the Grdbner basis equal to 0. The resulting six solutions are quickly checked to be 
the three distinct roots of the polynomial f 3 — r 2 — (1 /2)t — (1 /6) (which is irreducible 
over (Q>) in some order. 
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As the previous examples show, the study of solutions to systems of polynomial 
equations f\ = 0, / 2 = 0, . . . , / m = 0 is intimately related to the study of the ideal 
7 = (/i, / 2 , . . . , f m ) the polynomials generate in . . . , x„]. This fundamental 
connection is the starting point for the important and active branch of mathematics 
called “algebraic geometry”, introduced in Chapter 15, where additional applications 
of Grobner bases are given. 

We close this section by showing how to compute the basic set-theoretic op- 
erations of sums, products and intersections of ideals in polynomial rings. Sup- 
pose 7 = (/i f s ) and J — (hu ■ ■ ■ , h,) are two ideals in F[x\ , . . . , x*]. Then 

1 + j = (/i, . - - , fs, hi , . . . , hi) and IJ = (fihi, . . . , fhj , . . . , f s h,). The following 
proposition shows how to compute the intersection of any two ideals. 

Proposition 30. If 7 and J are any two ideals in F[x j, . . . , x„] then // + (1 — t)J 
is an ideal in F[t, x \, . . . , x„] and 1 D J = (tl + (\ — t)J ) D F[x lf . . . , x„]. In 
particular, 7 D J is the first elimination ideal of tl + (1 — t)J with respect to the 
ordering t > x\ > • - ■ > x„. 

Proof: First, t 1 and (1 — t)J are clearly ideals in F[xj , . . . , x n , /], so also their sum 
/7 + (l — t) 7 is an ideal in Ft*!, . . . , x n , /]. If / € ID J, then / = // + ( 1 —t)f shows 
/ n J c (// + (1 — t)J) n T 7 !*!, .... *„]. Conversely, suppose / = //i + (1 — /)/ 2 is 
an element of F[xi , . . . , x n \, where fi e 1 and / 2 € J. Then t(f\ — / 2 ) = / — / 2 € 
F[xi, . . . , x„] shows that fi - f 2 = 0 and / = / 2 , so / = ft = f 2 e I n J. 
Since 7 n J = (tl -I- (1 — t)J) H F[xi, . . . , x n ) t 1 D J is the first elimination ideal of 
tl + (1 — t)J with respect to the ordering t > x\ > ■ ■ ■ > x n . 

We have /7 + (1 —t)J = (tfi, . .. ,tf s , (l-/)/?], . . ., (1 -t)h,) if 7 = (/i, .. . , f s ) 
and J = (Tii, . . . , h,). By Proposition 29, the elements not involving/ in a Grobner basis 
for this ideal in F[t, xy , . . . , x„], computed for the lexicographic monomial ordering 
t > x i > ■ • • > x„, give a Grobner basis for the ideal 7 H J in F[xi , . . . , x n \ 

Example 

Let I = (x, y) 2 = (x 2 , xy % y 2 ) and let J — (x). For the lexicographic monomial ordering 
t > x > y the reduced Grdbner basis for/7-|-(l — t)J in F[t, x, y ] is { tx — jc, ty 2 , x 2 , xy} 
and so / n J = (x 2 , xy). 


EXERCISES 

1. Suppose / is an ideal in F[xj x„] generated by a (possibly infinite) set S of poly- 

nomials. Prove that a finite subset of the polynomials in S suffice to generate /. [Use 

Theorem 21 to write 7 = (/j f m ) and then write each fi € I using polynomials in 

S.] 

2. Let > be any mono mial ordering. 

(a) Prove that LT(fg) = LT(f)LT(g) and 3(/g) = 3(/) -I- d(g) for any nohzero 
polynomials / and g. 

(b) Prove that d(f + g) < max(3(/), 3 (g)) with equality if 3(/) / 3 (g). 
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(c) Prove that m > 1 for every monomial m. 

(d) Prove that if mi divides m2 then m2 > m\. Deduce that the leading term of a 
polynomial does not divide any of its lower order terms. 

3 . Prove that if > is any total or partial ordering on a nonempty set then the following are 
equivalent: 

(i) Every nonempty subset contains a minimum element 

(ii) There is no infinite strictly decreasing sequence a\ > 0,2 > a$ >•■ • (this is called 
the descending chain condition or D.C.C.). 

Deduce that General Polynomial Division always terminates in finitely many steps. 

4 . Let > be a monomial ordering, and for monomials mi, m2 define mi > g m2 if either 
deg mi > deg m2, ordegmi = deg m2 and mi > m2. 

(a) Prove that > g is also a monomial ordering. (The relation > g is called the grading 
of >. An ordering in which the most important criterion for comparison is degree is 
sometimes called a graded or a degree ordering, so this exercise gives a method for 
constructing graded orderings.) 

(b) The grading of the lexicographic ordering x\ > ■ • ■ > x n is called the grlex monomial 
ordering. Show that x* > x 2 x2 > xix| > x| > xi with respect to the grlex ordering 
and x 2 x2 > xjx| > xi > x| > x| with respect to the lexicographic ordering. 

5 . The grevlex monomial ordering is defined by first choosing an ordering of the variables 
{xi, X2, . . . , x„], then defining mi > m2 for monomials mi, m2 if either deg mi > deg m2 

or deg mi = deg m2 and the first exponent of x„, x„ i x\ (in that order) where mi 

and m2 differ is smaller in mi . 

(a) Prove that grevlex is a monomial ordering that satisfies xi > X2 > ■ ■ - > x„. 

(b) Prove that the grevlex ordering on F[x \,x%] with respect to {xi, X2} is the graded 
lexicographic ordering with xi > X2, but that the grevlex ordering on F[xi , X2, X3] is 
not the grading of any lexicographic ordering. 

(c) Show that x\x^x^ > x 2 x 2 > x|x| > x^x\ > X1X2 > x| > X1X3 > x| > xi > X2 for 
the grevlex monomial ordering with respect to {xi, X2, X3}. 

6. Show that x 3 y > x 3 z 2 > x 3 z > x 2 y 2 z > x 2 y > xz 2 > y 2 z? > y 2 z with respect to 
the lexicographic monomial ordering x > y > z. Show that for the corresponding grlex 
monomial ordering x 3 z 2 > x 2 y 2 z > x 3 y > x 3 z > y 2 z 2 > x 2 y > xz 2 > y 2 z, and that 
x 2 y 2 z > x 3 z 2 > x 3 y > x 3 z > y 2 z 2 > x 2 y > y 2 z > xz 2 for the grevlex monomial 
ordering with respect to {x, y, z). 

7 . Order the monomials x 2 z, x 2 y 2 z, xy^z, x 3 y, x 3 z 2 , x 2 , x 2 yz 2 , x 2 z 2 for the lexicographic 
monomial ordering x > y > z, for the corresponding grlex monomial order, and for the 
grevlex monomial ordering with respect to {x, y, z). 

8. Show there are n! distinct lexicographic monomial orderings on F[x\ x„]. Show 

similarly that there are nl distinct grlex and grevlex monomial orderings. 

9 . It can be shown that any monomial ordering on F[x \ , . . . , x„ ] may be obtained as follows. 
For k < n let v \ , t>2 , - . . , w* be nonzero vectors in Euclidean n- space, E” , that are pairwise 
orthogonal: v; ■ vj = 0 for all i / j, where ■ is the usual dot product, and suppose also that 
all the coordinates of i>i are nonnegative. Define an order, >, on monomials by mi > m2 

if and only if for some t < k we have u/ ■ 3 (mi ) = Vj • d(m2 ) for all i e {1,2 t — 1} 

and ty ■ 9 (mi) > ly • 9 (m2). 

(a) Let k = n and let r,- = ( 0 , .... 0 , 1 , 0 0 ) with 1 in the I th position. Show that > 

defines the lexicographic order with x\ > X2 > ■ ■ ■ > x„. 

(b) Let^ = n and define in =(1,1 1) and r, =(1,1 1, — n + i — 1, 0, . . . , 0), 


Sec. 9.6 Polynomials in Several Variables over a Field and Grobner Bases 


331 



where there are i — 2 trailing zeros, 2 < i < n. Show that > defines the grlex order 
with respect to (jej , — , x„). 

10. Suppose / is a monomial ideal generated by monomials mi , m^. Prove that the 
polynomial / € F[x i, — x„] is in / if and only if every monomial term fi of / is 
a multiple of one of the mj. [For polynomials aj. . . . e F[x\ , . . . , x„] expand the 

polynomial a [ m i -| b and note that every monomial term is a multiple of at least 

one of the m^.] Show that x 2 yz+3xy 2 is an element of the ideal 7 = (xyz, y 2 ) C T^x, y, z] 
but is not an element of the ideal /' = (xz 2 , y 2 ). 

11. Fix a monomial ordering on R = F[xi and suppose (gi , . . . , gm) is a Grobner 

basis for the ideal 7 in R. Prove that h € LT(I) if and only if h is a sum of monomial 
terms each divisible by some LT (gi), 1 < i < m. [Use the previous exercise.] 

12. Suppose / is a monomial ideal with monomial generators gi g m . Use the previous 

exercise to prove directly that {gi , - - - , g m } is a Grobner basis for / . 

13. Suppose / is a monomial ideal with monomial generators gi g m . Use Buchberger’s 

Criterion to prove that {g| g m ) is a Grttbner basis for 7 . 

14. Suppose / is a monomial ideal in R = F[x \ , . . . , x„ ] and suppose (mi, , mjt] is a - 

minimal set of monomials generating 7, i.e., each m,- is a monomial and no proper subset 
of (mj mk} generates 7. Prove that the m/, 1 < i < Jk are unique. [Use Exercise 10.] 

15. Fix a monomial ordering on R = F[x \ , . . . , x„]. 

(a) Prove that (gi , . . . , g m } is a minimal Grttbner basis for the ideal 7 in R if and only if 
{L7(gi), .... LT(g m )) is a minimal generating set for LT(I). 

(b) Prove that the leading terms of a minimal Grobner basis for / are uniquely determined 
and the number of elements in any two minimal Grttbner bases for / is the same. [Use 
(a) and the previous exercise.] 

16. Fix a monomial ordering on F[x( x „ ] and suppose G = (gi, .... g m ) is a set of 

generators for the nonzero ideal /. Show that if S(gj, gj) # 0 mod G then the ideal 
(LT(gi),...,LT (g m ), LT ( S(gi , gj)) is strictly larger than the ideal (LT (gi), .... LT(g m )). 
Conclude that the algorithm for computing a Grobner basis described following Proposition 
26 terminates after a finite number of steps. [Use Exercise 1.] 

17. Fix the lexicographic ordering x > y on F[x, y]. Use Buchberger’s Criterion to show that 
{x 2 y — y 2 , x 3 — xy} is a Grobner basis for the ideal / = (x 2 y — y 2 , x 3 — xy). 

18. Show (x — y 3 , y 5 — y 6 } is the reduced Grtibner basis for the ideal 7 — (x — y 3 , — x 2 + xy 2 ) 
with respect to the lexicographic ordering defined by x > y in F[x, y], 

19. Fix the lexicographic ordering x > y on F[x, y], 

(a) Show that (x 3 — y. x 2 y — y 2 , xy 2 — y 2 , y 3 - y 2 } is the reduced Grobner basis for the 
ideal 7 = (— x 3 + y, x 2 y - y 2 ). 

(b) Determine whether the polynomial / = x 6 — x 5 y is an element of the ideal 7. 

20. Fix the lexicographic ordering x > y > z on F[x, y, z]. Show that (x 2 + xy + z, xyz + 
z 2 , xz 2 , z 3 ) is the reduced Grobner basis for the ideal 7 — (x 2 + xy 4- z, xyz + z 2 ) and in 
particular conclude that the leading term ideal LT(1) requires four generators. 

21. Fix the lexicographic ordering x > y on F[x, y]. Use Buchberger’s Criterion to show that 
(x 2 y — y 2 , x 3 — xy} is a Grobner basis for the ideal 7 = (x 2 y — y 2 , x 3 — xy). 

22. Let 7 = (x 2 - y, x 2 y - z) in F[x, y. z]. 

(a) Show that (x 2 — y, y 2 — z) is the reduced Grbbner basis for 7 with respect to the 
lexicographic ordering defined by x > y > z. 

(b) Show that (x 2 — y, z — y 2 } is the reduced Grttbner basis for 7 with respect to the 
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lexicographic ordering defined by z > x > y (note these are essentially the same 
polynomials as in (a)). 

(c) Show that {y — x 2 , z — a 4 ) is the reduced Grobner basis for 7 with respect to the 
lexicographic ordering defined by z > y > x. 

23. Show that the ideals 1 = ( x 2 y + xy 2 — 2y, x 2 + xy — x + y 2 — 2y, xy 2 —x — y + y 3 ) and 
J = (x — y 2 . xy - y, x 2 — y) in F[x, y] are equal 

24. Use reduced Grobner bases to show that the ideal I = (x 3 — yz, yz 4- y) and the ideal 
J = (x 3 z + x 3 , x 3 -f- y) in F[x, y, z] are equaL 

25. Show that the reduced Grobner basis using the lexicographic ordering x > y for the ideal 
/ = (x 2 + xy 2 , x 2 ~ y 3 , y 3 - y 2 ) is {x 2 - y 2 . y 3 - y 2 , xy 2 + y 2 ). 

26. Show that the reduced Grobner basis for the ideal 7 = (xy + y 2 , x 2 y + xy 2 + x 2 ) is 
{x 2 , xy + y 2 , y 3 } with respect to the lexicographic ordering x > y and is {y 2 + yx, x 2 }^ 
with respect to the lexicographic ordering y > x. 

There are generally substantial differences in computational complexity when using different 

monomial orders. The grevlex monomial ordering often provides the most efficient computation 

and produces simpler polynomials. 

27. Show that {x 3 — y 3 , x 2 -fxy 2 + y 4 , x 2 y +xy 3 + y 2 } is a reduced Grbbner basis for the ideal 7 
in the example following Corollary 28 with respect to the grlex monomial ordering. (Note 
that while this gives three generators for 1 rather than two for the lexicographic ordering 
as in the example, the degrees are smaller.) 

28. Let 7 = (x 4 — y 4 + z 3 — 1, x 3 + y 2 + z 2 — 1). Show that there are five elements in a 
reduced Grobner basis for 7 with respect to the lexicographic ordering with x > y > z (the 
maximum degree among the five generators is 12 and the maximum number of monomial 
terms among the five generators is 35), that there are two elements for the lexicographic 
ordering y > z > x (maximum degree is 6 and maximum number of terms is 8), and that 
{x 3 -\- y 2 + z 2 - 1, xy 2 + xz 2 — x -f y 4 — z 3 + 1) is the reduced Grobner basis for the 
grevlex monomial ordering. 

29. Solve the system of equations x 2 — yz = 3, y 2 — xz = 4, z 2 — xy = 5 over C. 

30. Find a Grobner basis for the ideal 7 = (x 2 +xy+y 2 — 1, x 2 +4y 2 — 4) for the lexicographic 
orderingx > y and use it to find the four points of intersection of theellipsex 2 +xy-fy 2 = 1 
with the ellipse x 2 + 4y 2 = 4 in R 2 . 

31. Use Grobner bases to find all six solutions to the sy stem of equations 2x 3 +2x 2 y 2 +3y 3 =0 
and 3x 5 + 2x 3 y 3 + 2y 5 = 0 over C. 

32. Use Grobner bases to show that (x, z) H (y 2 . x — yz) = (xy, x — yz) in F[x , y, z]- 

33. Use Grobner bases to compute the intersection of the ideals (x 3 y — x>^+ 1, x 2 y 2 — y 3 — 1) 
and (x 2 — y 2 , x 3 + y 3 ) in F[x, >■]. 

The following four exercises deal with the ideal quotient of two ideals 7 and J in a ring R, 

Definition. The ideal quotient (7 : J) of two ideals 7, J in a ring R is the ideal 

(7 : J) = {r € R | rJ e 7). 

34. (a) Suppose 7? is an integral domain, 0 ^ / e and 7 is an ideal in R. Show that if 

{gi , • - - , &} are generators for the ideal 7 fi (/), then {gi/f , . . . . g s /f] are generators 
for the ideal quotient (7 : (/)). 

(b) If 7 is an ideal in the commutative ring R and /i, . . . , f s e R, show that the ideal 
quotient (7 : (/i , . . . /*)) is the ideal n? =1 (7 : (fi)). 
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35. If / = (. x 2 y + z 3 ,x + y 3 - z, 2y 4 z - yz 2 - z 3 ) and 7 = ( x 2 y 5 , x 3 z 4 , y^z 1 ) in Q[x. y, z ] 
show (/ : 7) is the ideal (z 2 . y + z.x — z). [Use the previous exercise and Proposition 30.] 

36. Suppose that K is an ideal in 7?, that 7 is an ideal containing K, and 7 is any ideal. I| 7 
and 7 denote the images of 7 and 7 in the quotient ring R/K, show that (7 : 7) = (7 : 7) 
where (7 : 7) is the image in R/K of the ideal quotient (7 : 7). 

37. Let AT be the ideal (y 5 - z 4 ) in 7? = <Q>[y, z). For each of the following pairs of ideals 
7 and 7, use the previous two exercises together with Proposition 30 to verify the ideal 
quotients (7 : 7) in the ring R/K: 

i. 7 = (y 3 , y 5 - z 4 ), 7 = (z), (7 :_7)_= (y 3 , z 3 ). 

ii. 7 = (y 3 , z, y 5 - z 4 ), 7 = (y), (7 -J) = (y 2 . z)- 
m. I = (y, y 3 , z, y 5 - z 4 ), 7 = (1), (7:7) = (y, z). 

Exercises 38 to 44 develop some additional elementary properties of monomial ideals in 
F[xj, . . . , x„]. It follows from Hilbert’s Basis Theorem that ideals are finitely generated, 
however one need not assume this in these exercises — the arguments are the same for finitely 
or infinitely generated ideals. These exercises may be used to give an independent proof of 
Hilbert’s Basis Theorem (Exercise 44). In these exercises, M and N are monomial ideals with 
monomial generators [m, | i el) and {nj \ j e J) for some index sets 7 and 7 respectively. 

38. Prove that the sum and product of two monomial ideals is a monomial ideal by showing 
that M + N = (m\, n } \ i e 1, j e 7), and MN = (m t nj \ i e I , j e J). 

39. Show that if {M s \ s e S) is any nonempty collection of monomial ideals that is totally 
ordered under inclusion then Li se sMs is a monomial ideal (In particular, the union of any 
increasing sequence of monomial ideals is a monomial ideal cf. Exercise 19, Section 7.3.) 

40. Prove that the intersection of two monomial ideals is a monomial ideal by showing that 
M C\ N = (aj | i e 1, j e J), where ejj is the least common multiple of m,- and tij. 
[Use Exercise 10.] 

41. Prove that for any monomial n, the ideal quotient (M : (n)) is (mi jd{ \ i e 7), where dj 
is the greatest common divisor of wj,- and n (cf. Exercise 34). Show that if N is finitely 
generated, then the ideal quotient (M : N) of two monomial ideals is a monomial ideal. 

42. (a) Show that M is a monomial prime ideal if and only if M = (S) for some subset of S of 

(x\,X 2 , ...,x„). (In particular, there are only finitely many monomial prime ideals, 
and each is finitely generated.) 

(b) Show that (*i x„) is the only monomial maximal ideal. 

43. (Dickson ‘s Lemma — a sped al case of Hilbert’s Basis Theorem) Prove that every monomial 

ideal in F[x i x„] is finitely generated as follows. 

LetiS = [TV | N is a monomial ideal that is not finitely generated}, and assume by way of 
contradiction «S ^ 0. 

(a) Show that »S contains a maximal element M. [Use Exerdse 30 and Zorn’s Lemma.] 

(b) Show that there are monomials x, y not in M with xy e M. [Use Exerdse 33(a).] 

(c) For x as in (b), show that M contains a finitely generated monomial ideal Mo such 
that Mo + (x) = M + (x) and M = Mo + (x)(M : (x)), where (M : (x)) is the 
(monomial) ideal defined in Exercise 32, and (x)(M : (x)) is the product of these 
two ideals. Deduce that M is finitely generated, a contradiction which proves S = 0. 
[Use the maximally of M and previous exercises.] 

44. If 7 is a nonzero ideal in F[xi, .. . , x„], use Dickson’s Lemma to prove that LT(I) is finitely 
generated. Conclude that 7 has a Grobner basis and deduce Hilbert’s Basis Theorem, [cf. 
Proposition 24.] 
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45. ( n-colorings of graphs) A finite graph Q of size TV is a set of vertices i e {1,2 N) 

and a collection of edges (i, j) connecting vertex i with vertex j. An n-coloring of Q 
is an assignment of one of n colors to each vertex in such a way that vertices connected 
by an edge have distinct colors. Let F be any field containing at least n elements. If 
we introduce a variable x; for each vertex i and represent the n colors by choosing a set 
S of n distinct elements from F, then an n-coloring of Q is equivalent to assigning a 
value Xj = «, for each i = 1, 2, . ... N where a,- € S and of,- ^ aj if (i, j) is an edge 
in Q. If f(x) = — a) is the polynomial in F[x] of degree n whose roots are 

the elements in S, then x,- = ofj for some at e S is equivalent to the statement that x , 
is a solution to the equation fix,) = 0. The statement a ^ otj is then the statement 
that f(xi) = f(xj) but Xj # xj, so x,- and xj satisfy the equation g(xj,xj) = 0, where 
g(x; , xj) is the polynomial (/ (x,) — f (xy))/(x* — Xj) in F[xj , xf\. It follows that finding 
an n-coloring of G is equivalent to solving the system of equations 

J /(■*! ) = 0, for i = 1. 2 , . . . , TV. 

I g(xj , xj) = 0, for all edges (/, j) in Q 

(note also we may use any polynomial g satisfying a,- ^ ctj if g(a,, ctj) = 0). It follows by 
“Hilbert’s Nullstellensatz” (cf. Corollary 33 in Section 15.3) that this system of equations 

has a solution, hence Q has an n-coloring, unless the ideal I in F[x\ ,X 2 x/y ] generated 

by the polynomials f (x; ) for i = 1,2 At, together with the polynomials g (x / , Xj ) for 

all the edges (i, j) in the graph Q s is not a proper ideal. This in turn is equivalent to the 
statement that the reduced Grobner basis for l (with respect to any monomial ordering) is 
simply {1}. Further, when an n-coloring does exist, solving this system of equations as in 
the examples following Proposition 29 provides an explicit coloring for Q. 

There are many possible choices of field F and set S. For example, use any field F 
containing a set 5 of distinct n* roots of unity, in which case fix) = x n — 1 and we may 
take giXj.Xj) = (xj‘ - *")/(*/ - Xj) = x?~ l + x”~ 2 xj + ■■■ + x t Xj ~ 2 + xj 1-1 , or use 
any subset 5 of F = F p with a prime p > n (in the special case n = p, then, by Fermat’s 
Little Theorem, we have fix) = x p — x and g(xj, Xj) = (x; — xj) p ~ l — 1). 

(a) Consider a possible 3-coloring of the graph Q with eight vertices and 14 edges (1, 3), 
(1, 4), (1, 5), (2, 4), (2, 7), (2, 8), (3, 4), (3, 6), (3. 8). (4. 5), (5, 6), (6, 7), (6, 8), (7, 8). 
Take F = ¥3 with ‘colors’ 0, 1, 2 e F3 and suppose vertex 1 is colored by 0. In this case 
fix) = x(x - l)(x - 2) = x 3 - x -G F3U] and gixj,xj) = x? + x.xj + xj - 1. If / is 
the ideal generated by x\,xf — xy, 2 < / < 8 and #(x, , Xj ) for the edges (1, j) in (?, show 
that the reduced Grobner basis for / with respect to the lexicographic monomial ordering 
xi > X2 > ■ ■ ■ > X8 is {xi , X2, X3 + xs, X 4 + 2x8, *5 + *8. * 6 , *7 + *8, x i + Deduce 
that Q has two distinct 3-colorings, determined by the coloring of vertex 8 (which must be 
colored by a nonzero element in F3), and exhibit the colorings of Q. 

Show that if the edge (3, 7) is added to Q then the graph cannot be 3-colored. 

(b) Take F = F5 with four ‘colors’ 1,2, 3, 4 € F5, so fix) = x 4 — I and we may use 
gixj.xj) = xf -f xfxj + Xjxj + xj. Show that the graph G with five vertices having 9 
edges (1, 3), (1.4), (1, 5), (2, 3), (2, 4), (2, 5), (3, 4), (3, 5), (4. 5) (the “complete graph 
on five vertices’’ with one edge removed) can be 4-colored but cannot be 3-colored. 

(c) Use Grobner bases to show that the graph Q with nine vertices and 22 edges (1 , 4), (1 , 6), 
(1. V). (1, 8), (2 . 3), (2, 4), (2. 6), (2, 7). (3, 5). (3, 7), (3, 9), (4, 5), (4, 6), (4. 7). (4, 9), 
(5, 6), (5, 7), (5, 8), (5, 9), (6 , 7), (6, 9), (7, 8) has precisely four 4-colorings up to a 
permutation of the colors (so a total of 96 total 4-colorings). Show that if the edge (1, 5) 
is added then G cannot be 4-colored. 
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Part III 


MODULES AND VECTOR SPACES 


In Part III we study the mathematical objects called modules. The use of modules 
was pioneered by one of the most prominent mathematicians of the first part of this 
century, Emmy Noether, who led the way in demonstrating the power and elegance of 
this structure. We shall see that vector spaces are just special types of modules which 
arise when the underlying ring is a field. If R is a ring, the definition of an R -module 
M is closely analogous to the definition of a group action where R plays the role of 
the group and M the role of the set The additional axioms for a module require that 
M itself have more structure (namely that AT be an abelian group). Modules are the 
“representation objects** forrings, i.e., they are, by definition, algebraic objects on which 
rings act As the theory develops it will become apparent how the structure of the ring 
R (in particular, the structure and wealth of its ideals) is reflected by the structure of its 
modules and vice versa in the same way that the structure of the collection of normal 
subgroups of a group was reflected by its permutation representations. 
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CHAPTER 10 


Introduction to Module Theory 


10.1 BASIC DEFINITIONS AND EXAMPLES 

We start with the definition of a module. 

Definition. Let R be a ring (not necessarily commutative nor with 1 ). A left R-module 
or a left module over R is a set M together with 

(1) a binary operation + on M under which M is an abelian group, and 

(2) an action of R on M (that is, a map R x M — ► M ) denoted by rm, for all r e R 
and for all m e M which satisfies 

(a) ( r + s)m = rm + sm, for all r, s e R, m e M, 

(b) ( rs)m = r(sm ), for all r, s € R, m € M, and 

(c) r(m + n) = rm + rn, for all r € R, m, n € M. 

If the ring R has a 1 we impose the additional axiom: 

(d) 1 m = m, for all m e M. 

The descriptor “left” in the above definition indicates that the ring elements appear 
on the left; “right” R -modules can be defined analogously. If the ring R is commutative 
and M is a left /^-module we can make M into a right R -module by defining mr — rm 
for m G M and r € R. If R is not commutative, axiom 2(b) in general will not hold with 
this definition (so not every left R -module is also a right R -module). Unless explicitly 
mentioned otherwise the term “module” will always mean ‘left module.” Modules 
satisfying axiom 2(d) are called unital modules and in this book all our modules will be 
nnital (this is to avoid “pathologies” such as having rm — 0 for all r € R and m € M ). 

When R is a field F the axioms for an 7?-module are precisely the same as those 
for a vector space over F, so that 

modules over afield F and vector spaces over F are the same. 

Before giving other examples of T?-modules we record the obvious definition of 
submodules. 

Definition. Let R be a ring and let M be an /^-module. An R- submodule of M is a 
subgroup N of M which is closed under the action of ring elements, i.e., rn € N, for 
all r € R, n G N. 
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Submodules of M are therefore just subsets of M which are themselves modules 
under the restricted operations. In particular, if R = F is a field, submodules are the 
same as subspaces. Every F-module M has the two submodules M and 0 (the latter is 
called the trivial submodule ). 

Examples 

(1) Let F be any ring. Then M — R is a left F-module, where the action of a ring element 
on a module element is just the usual multiplication in the ring R (similarly, R is aright 
module over itself). In particular, every field can be considered as a (1-dimensional) 
vector space over itself. When R is considered as a left module over itself in this 
fashion, the submodules of R are precisely the left ideals of R (and if R is considered 
as a right F-noodule over itself, its submodules are the right ideals). Thus if R is not 
commutative it has a left and right module structure over itself and these structures 
may be different (e.g., the submodules may be different) — Exercise 21 at the end of 
this section gives a specific example of this. 

(2) Let R = F be a field. As noted above, every vector space over F is an F-module and 
vice versa. Let n e Z+ and let 

F n = {(fli, ct 2 a„) | a-, e F , for all i) 

(called affine n-space over F). Make F" into a vector space by defining addition and 
scalar multiplication componentwise: 

(«i. « 2 . - ... On) + (h, b 2 ,...,b n ) = (ai+b } ,a 2 + b 2 ,...,a n + b n ) 
a(a \, . . . t a n ) = (aa\ aa n ), a e F. 

As in the case of Euclidean n-space (i.e., when F = R), affine n-space is a vector space 
of dimension n over F (we shall discuss the notion of dimension more thoroughly in 
the next chapter). 

(3) Let R be a ring with 1 and let «eZ + . Following Example 2 define 

R n = {{a\,a 2 , . ■ • , a„) | a,- e R, for all i }. 

Make R n into an F-module by componentwise addition and multiplication by elements 
of R in the same manner as when R was a field. The module R n is called the free 
module of rank n over R. (We shall see shortly that free modules have the same 
“universal property” in the context of F-modules that free groups were seen to have 
in Section 6.3. We shall also soon discuss direct products of F-modules.) An obvious 
submodule of R n is given by the r ** 1 component, namely the set of n-tuples with 
arbitrary ring elements in the r* component and zeros in the 7 th component for all 

i 

(4) The same abelian group may have the structure of an F-module for a number of 
different rings F and each of these module structures may cany useful information. 
Specifically, if M is an -F-module and S is a subring of F with 1$ = Ir, then M 
is automatically an 5-module as well. For instance the field R is an R-module, a 
Q-module and a Z- module. 

(5) If M is an F-module and for some (2-sided) ideal I of F, am = 0, for all a e 7 and 
all m e M, we say M is annihilated by I. In this situation we can make M into an 
(F//)-module by defining an action of the quotient ring R/I on M as follows: for 
each m e M and coset r + 1 in R/I let 

(r + I)m = rm. 
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Since am = 0 for all a e I and all m e M this is well defined and one easily checks 
that it makes M into an (R//)-module. In particular, when / is a maximal ideal in the 
commutative ring R and IM = 0, then M is a vector space over the field R/I (cf. the 
following example). 

The next example is of sufficient importance as to be singled out. It will form the 
basis for our proof of the Fundamental Theorem of Finitely Generated Abelian Groups 
in Chapter 12. 

Example: (Z-modules) 

Let R = Z, let A be any abelian group (finite or infinite) and write the operation of A as +. 
Make A into a Z-module as follows: for any neZ and a € A define 

a + a -\ 1 -a ( n times) if n > 0 

na = 0 if n = 0 

—a — a a (— n times) if n < 0 

(here 0 is the identity of the additive group A). This definition of an action of the integers 
on A makes A into a Z-module, and the module axioms show that this is the only possible 
action of Z on A making it a (unital) Z-module. Thus every abelian group is a Z-module. 
Conversely, if M is any Z-module, a fortiori M is an abelian group, so 

Z-modules are the same as abelian groups . 

Furthermore, it is immediate from the definition that 

Z -submodules are the same as subgroups. 

Note that for the cyclic group ( a ) written multiplicatively the additive notation na becomes 
a n , that is, we have all along been using the fact that { a ) is a right Z-module (checking that 
this “exponential” notation satisfies the usual laws of exponents is equivalent to checking 
the Z-module axioms — this was given as an exercise at the end of Section 1.1). Note that 
since Z is commutative these definitions of left and right actions by ring elements give the 
same module structure. 

If A is an abelian group containing an element x of finite order n then nx — 0. Thus, 
in contrast to vector spaces, a Z-module may have nonzero elements x such that nx = 0 for 
some nonzero ring element n. In particular, if A has order m, then by Lagrange’s Theorem 
(Corollary 9, Section 3.2) mx = 0, for all x e A. Note that then A is a module over 
Z/mZ. 

In particular, if p is a prime and A is an abelian group (written additively) such that 
px = 0, for all x e A, then (as noted in Example 5) A is a Z/pZ- module, i.e., can be 
considered as a vector space over the field = Z/pZ. For instance, the Klein 4-group is 
a (2-dimensional) vector space over IF 2 . These groups are the elementary abelian p-groups 
discussed in Section 4.4 (see, in particular, Proposition 17(3)). 

The next example is also of fundamental importance and will form the basis for 
our study of canonical forms of matrices in Sections 12.2 and 12.3. 
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Example: (F[x] -modules) 

Let F be a field, let x be an indeterminate and let R be the polynomial ring F[x]. Let V be 
a vector space over F and let T be a linear transformation from V to V (we shall review 
the theory of linear transformations in the next chapter — for the purposes of this example 
one only needs to know the definition of a linear transformation). We have already seen 
that V is an F-module; the linear map T will enable us to make V into an F[x]-module. 
First, for the nonnegative integer n, define 

7° = 7, 


T n = ToTo---oT (n times) 

where 7 is the identity map from V to V and a denotes function composition (which makes 
sense because the domain and codoraain of T are the same). Also, for any two linear 
transformations A , B from V to V and elements a, fi e F, let a A + fiB be defined by 

(ctA + 0B)(v) = a(A(v)) + 0(B( v)) 

(i.e., addition and scalar multiplication of linear transformations are defined pointwise). 
Then aA + fiB is easily seen to be a linear transformation from V to V, so that linear 
combinations of linear transformations are again linear transformations. 

We now define the action of any polynomial in x on V . Let p(x) be the polynomial 
p(x) = a n x n + ijt” -1 -I 1- a\x + ao, 

where oq, . . . , a n e F . For each v e V define an action of die ring element p(x) on the 
module element u by 

p(x)v = ( a n T n + a„-iT r '~ i -| \-a\T + ao)(tO 

= a„ T n (u) +a„-i 7" -! (v) H |-ai7(u) -\-oqv 

(i.e., p(x) acts by substituting the linear transformation 7 for jc in p(x) and applying 
the resulting linear transformation to u). Put another way, x acts on V as the linear 
transformation 7 and we extend this to an action of all of F[.r] on V in a natural way. It is 
easy to check that this definition of an action of Ffx] on V satisfies all the module axioms 
and makes V into an 7 [jc] - module. 

The field F is naturally a subring of F[x] (the constant polynomials) and the action 
of these field elements is by definition the same as their action when viewed as constant 
polynomials. In other words, the definition of the F[x] action on V is consistent with the 
given action of the field F on the vector space V, i.e., the definition extends the action of 
F to an action of the larger ring F[xj. 

The way F[x] acts on V depends on the choice of 7 so that there are in general many 
different F[x]- module structures on the same vector space V. For instance, if 7 = 0, 
and p(x), v are as above, then p(x)v = oqv, that is, the polynomial p(x) acts chi d simply 
by multiplying by the constant term of p(x), so that the F[Y]-module structure is just the 
F-module structure. If, on the other hand, 7 is the identity transformation (so T n (v) = u, 

for all n and u), then p(x)v = a n v + a n -\v + ■ • ■ + oqv = (a„ H + oq)v, so that now 

p(,r) multiplies v by the sum of the coefficients of p(x). 

To give another specific example, let V be affine /j-space F n and let 7 be the "shift 
operator” 

7 (xj , X 2 x„) = (x 2 ,X 3 x„,0). 
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Let e; be the usual I th basis vector (0, 0, . . . , 0, 1,0 0) where the 1 is in position i. 

Then 


T k (ei) = 


if i > k 
0 if i < k 


so for example, if m < n, 

(a m x m +a m - ij 771-1 H \-ao)e„ = (0, 0, ,ao). 

From this we can determine the action of any polynomial on any vector. 

The construction of an F[x] -module from a vector space V over F and a linear trans- 
formation T from V to V in fact describes all F[x]- modules; namely, an F[x]-module is a 
vector space together with a linear transformation which specifies the action of x. This is 
because if V is any Fix] -module, then V is an F-module and the action of the ring element 
x on V is a linear transformation from V to V. The axioms for a module ensure that the 
actions of F and x on V uniquely determine the action of any element of F[x] on V. Thus 
there is abijection between the collection of F[x]-modules and thecollection of pairs V, T 


V an F[x]-module 

4 > 

V a vector space over F 
and 



T : V -> V a linear transformation 


given by 


the element x acts on V as the linear transformation T. 


Now we consider F[x]-submodules of V where, as above, V is any F[x]- module and 
T is the linear transformation from V to V given by the action of x. An F[x] -submodule W 
of V must first be an F-submodule, i.e., W must be a vector subspace of V. Secondly, W 
must be sent to itself under the action of the ring element x, i.e., we must have T (w) e W, 
for all w e W. Any vector subspace U of V such that T ( U ) C U is called T -stable or 
T -invariant. If U is any T -stable subspace of V it follows that T n (U) £ f/. for all n € Z + 
(for example, T (U) C U implies T 2 (U) = T(T (l/)) C T ( U ) C [/). Moreover any linear 
co mbinati on of powers of T then sends U into U so that U is also stable by the action of 
any polynomial in T. Thus U is an F[jc]-submodule of V. This shows that 

the F[x]-submodules of V are precisely the T -stable subspaces of V. 

In terms of the bijection above, 




W a subspace of V 

W an F[x]-subroodule 

<— y 

and 

W is T -stable 


which gives a complete dictionary between F[x] -modules V and vector spaces V together 
with a given linear transformation T from V to V. 

For instance, if T is the shift operator defined on affine n-space above and k is any 
integer in the range 0 < k < n, then the subspace 

Uk = [(x i , *2 x*, 0, .... 0) | x, e F } 

is clearly T -stable so is an F[x]-submodule of V. 
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We emphasize that an abelian group M may have many different F-module struc- 
tures, even if the ring R does not vary (in the s ame way that a given group G may act in 
many ways as a permutation group on some fixed set Q). We shall see that the structure 
of an F-module is reflected by the ideal structure of R. When R is a field (the subject 
of the next chapter) all R -modules will be seen to be products of copies of R (as in 
Example 3 above). 

We shall see in Chapter 12 that the relatively simple ideal structure of the ring F[jc] 
(recall that F[jc] is a Principal Ideal Domain) forces the F[jc]-module structure of V to 
be correspondingly uncomplicated, and this in turn provides a great deal of information 
about the linear transformation T (in particular, gives some nice matrix representations 
for T : its rational canonical form and its Jordan canonical form). Moreover, the same 
arguments which classify finitely generated F[x] -modules apply to any Principal Ideal 
Domain R, and when these are invoked for R = Z, we obtain the Fundamental Theorem 
of Finitely Generated Abelian Groups. These results generalize the theorem that every 
finite dimensional vector space has a basis. 

In Part VI of the book we shall study modules over certain non commutative tings 
(group rings) and see that this theory in some sense generalizes both the study of F[x]- 
modules in Chapter 12 and the notion of a permutation representation of a finite group. 

We establish a submodule criterion analogous to that for subgroups of a group in 
Section 2.1. 

Proposition 1. (The Submodule Criterion) Let R be a ring and let M be an F-module. 
A subset N of M is a submodule of M if and only if 

(1) N 0, and 

(2) x + ry g N for all r € R and for all x, y € N. 

Proof: If A is a submodule, then 0 € N so N ^ 0. Also N is closed under addition 
and is sent to itself under the action of elements of R. Conversely, suppose (1) and (2) 
hold. Let r = — 1 and apply the subgroup criterion (in additive form) to see that N is 
a subgroup of M. In particular, 0 £ N. Now let x =0 and apply hypothesis (2) to see 
that N is sent to itself under the action of R. This establishes the proposition. 

We end this section with an important definition and some examples. 

Definition. Let R be a commutative ring with identity. An R-algebra is a ring A with 
identity together with a ring homomorphism / : R — ► A mapping 1* to 1 a such that 
the subring f (R) of A is contained in the center of A. 

If A is an F-algebra then it is easy to check that A has a natural left and right 
(unital) F -module structure defined by r • a = a • r = f(r)a where / (r)a is just the 
multiplication in the ring A (and this is the same as af (r) since by assumption f(r) 
lies in the center of >4). In general it is possible for an F -algebra A to have other left (or 
right) F-module structures, but unless otherwise stated, this natural module structure 
on an algebra will be assumed. 
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Definition. If A and B are two R -algebras, an R-algebra homomorphism (or isomor- 
phism) is a ring homomorphism (isomorphism, respectively) tp : A -» B mapping l a 
to Is such that <p(r • a) = r ■ <p(a) for ail r € R and a e A. 

Examples 

Let R be a commutative ring with 1. 

(1) Any ring with identity is a Z-algebra. 

(2) For any ring A with identity, if R is a subring of the center of A containing the identity 
of A then A is an R -algebra. In particular, a commutative ring A containing 1 is an 
R-algebra for any subring R of A containing 1. For example, the polynomial ring 
R[x] is an Ralgebra, the polynomial ring over R in any number of variables is an 
R-algebra, and the group ring RG fora finite group G is an .R-algebra (cf. Section 7.2). 

(3) If A is an R-algebra then the R-module structure of A depends only on the subring 
/(R) contained in the center of A as in the previous example. If we replace R by its 
image f (R) we see that “up to a ring homomorphism” every algebra A arises from a 
subring of the center of A that contains 1^. 

( 4 ) A special case of the previous example occurs when R = F is afield. In this case 
F is isomorphic to its image under /, so we can identify F itself as a subring of A. 
Hence, saying that A is an algebra over a field F is the same as saying that the ring A 
contains the field F in its center and the identity of A and of F are the same (this last 
condition is necessary, cf. Exercise 23). 

Suppose that A is an R-algebra. Then A is a ring with identity that is a (unital) left 
R-module satisfying r ■ ( ab ) = (r - a)b = a(r ■ b) for all r e R and a,b e A (these 
are all equal to the product f(r)ab in the ring A — recall that /(R) is contained in the 
center of A). Conversely, these conditions on a ring A define an R-algebra, and are 
sometimes used as the definition of an R-algebra (cf. Exercise 22). 

EXERCISES 

In these exercises R is a ring with 1 and M is a left R-module. 

1 . Prove that Om = 0 and (— 1 )m = —m for all m € M. 

2. Prove that R y and M satisfy the two axioms in Section 1.7 for a group action of the 

multiplicative group R x on the set M. 

3. Assume that rm = 0 for some r e R and some m £ M with m # 0. Prove that r does not 

have a left inverse (i.e., there is no 5 € R such that sr = 1). 

4 . Let M be the module R n described in Example 3 and let 7i, I 2 , . . . , 7„ be left ideals of R. 

Prove that the following are submodules of M: 

(a) [(xux 2 , | x t e /,-} 

(b) {( x\ ,X 2 x n ) | Xi € R and jci + X 2 -\ h x n = 0). 

5. For any left ideal / of R define 

1M — a\m\ ] a\ e 7, m,- e M } 

finite 

to be the collection of all finite sums of elements of the form am where a € 7 and m £ M. 

Prove that 7M is a submodule of M. 

6. Show that the intersection of any nonempty collection of submodules of an R-module is 

a submodule. 
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7. Let Ni c Af 2 c . . . be an ascending chain of submodules of M. Prove that U^N,- is a 
submodule of M. 

8. An element m of the R -module M is called a torsion element if rm = 0 for some nonzero 
element r e R. The set of torsion elements is denoted 

Tor (A/) = (m e M \ rm = 0 for some nonzero r e R). 

(a) Prove that if R is an integral domain then Tor(M) is a submodule of M (called the 
torsion submodule of M). 

(b) Give an example of a ring R and an F-module M such that Tor(M) is not a submodule. 
[Consider the torsion elements in the F-module /?.] 

(c) If R has zero divisors show that every nonzero F-module has nonzero torsion elements. 

9. If N is a submodule of M, the annihilator ofN in R is defined to be 

{r e R | rn = 0 for all n e N], Prove that the annihilator of NmR isa 2-sided ideal of R. 

10. If/ is a right ideal of R, the annihilator of l in M is defined to be 

{m e M [ am = 0 for all a e I}. Prove that the annihilator of / in M is a submodule of M. 

11. Let M be the abelian group (i.e., Z-module) Z/24Z x Z/15Z x Z/50Z. 

(a) Find the annihilator of M in Z (i.e., a generator for this principal ideal). 

(b) Let 1 = 2Z. Describe the annihilator of / in M as a direct product of cyclic groups. 

12. In the notation of the preceding exercises prove the following facts about annihilators. 

(a) Let N be a submodule of M and let / be its annihilator in R. Prove that the annihilator 
of / in M contains N. Give an example where the annihilator of / in M does not 
equal N. 

(b) Let / be a right ideal of R and let N be its annihil ator in M. Prove that the a nnihil ator of 
N in R contains 1 . Give an example where the annihilator of N in R does not equal I . 

13. Let I be an ideal of R. Let M' be the subset of elements a of M that are annihilated by 
some power, l k , of the ideal /, where the power may depend on a. Prove that M' is a 
submodule of M. [Use Exercise 7.] 

14. Let z be an element of the center of /?, i.e., zr = rz for all r e /?. Prove that zM is a 
submodule of M, where zM = {zm \ m € M} . Show that if R is the ring of 2 x 2 matrices 
over a field and e is the matrix with a 1 in position 1,1 and zeros elsewhere then eR is not 
a left R -submodule (where M = R is considered as a left R -module as in Example 1) — 
in this case the matrix e is not in the center of R. 

15. If M is a finite abelian group then M is naturally a Z-module. Can this action be extended 
to make M into a (Q-module? 

16. Prove that the submodules //* described in the example of F[x]-modules are all of the 

-submodules for the shift operator. 

17. Let T be the shift operator on the vector space V and let e\ , .... e„ be the usual basis vectors 

described in the example of F[x]-modules. Ifm > n find (a m x m +a m ^\x m ~ l -\ \-ao)e„. 

18. Let F = R, let V = K 2 and let T be the linear transformation from V to V which 
is rotation clockwise about the origin by n/ 2 radians. Show that V and 0 are the only 
F[jt]-submodules for this T. 

19. Let F = R, let V = R 2 and let T be the linear transformation from V to V which is 
projection onto the y-axis. Show that V, 0, the x-axis and the y-axis are the only F[x]- 
submodules for this T. 

20. Let F = KL, let V = R 2 and let T be the linear transformation from V to V which is 
rotation clockwise about the origin by n radians. Show that every subspace of V is an 
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F[x]-submodule for this T. 

2L Let n e Z+, n > 1 and let R be the ring of n x n matrices with entries from a field F. Let 
M be the set of n x n matrices with arbitrary elements of F in the first column and zeros 
elsewhere. Show that M is a submodule of R when R is considered as a left module over 
itself, but M is not a submodule of R when R is considered as a right P-module. 

22. Suppose that A is a ring with identity 1 a that is a (unital) left /^-module satisfying r ■ (ab) = 
(r ■ a)b — a(r ■ b) for all r € R and a,b e A. Prove that the map f : R A defined 
hy /(r) = r ■ l a is a ring homomorphism mapping 1 to 1^ and that f (R) is contained 
in the center of A. Conclude that A is an /? -algebra and that the /?-module structure on A 
induced by its algebra structure is precisely the original /f-module structure. 

23. Let A be the direct product ring C x C (cf . Section 7.6). Let n denote the identity map 
on C and let T 2 denote complex conjugation. For any pair p, q e {1,2} (not necessarily 
distinct) define 

/ m :C->Cx€ by fp, q (z) = (r p (z), x q (z)). 

So, for example, / 2 ,i ; z t-> (z, z), where z is the complex conjugate of z, i.e., r 2 (z). 

(a) Prove that each f p>q is an injective ring homomorphism, and that they all agree on 
the subfield R of C. Deduce that A has four distinct C-algebra structures. Explicitly 
give the action z - (u, v) of a complex number z on an ordered pair in A in each case. 

(b) Prove that if f p<q ^ then the identity map on A is not a C-algebra homomorphism 

from A considered as a C-algebra via f p>q to A considered a C-algebra via fp> q > 
(although the identity is an R- algebra isomorphism). 

(c) Prove that for any pair p, q there is some ring isomorphism from A to itself such that 
A is isomorphic as a C-algebra via f p , q to A considered as C-algebra via fi \ (the 
“natural” C-algebra structure on A). 

Remark : In the preceding exercise A = C x C is not a C-algebra over either of the direct factor 
component copies of C (for example the subring C x 0 = C) since it is not a unital module 
over these copies of C (the 1 of these subrings is not the same as the 1 of A). 


10.2 QUOTIENT MODULES AND MODULE HOMOMORPHISMS 

This section contains the basic theory of quotient modules and module homomorphisms. 

Definition. Let R be a ring and let M and N be R -modules. 

( 1 ) A map <p : M * N is an R-module homomorphism if it respects the R -module 
structures of M and N , i.e., 

(a) <p{x + y) = <p(x) + <p(y), for all x, y € M and 

(b) <p(rx ) = r(p{x), for all r € R, x e M . 

(2) An R -module homomorphism is an isomorphism (of R-modules) if it is both 
injective and surjective. The modules M and N are said to be isomorphic, 
denoted M = N, if there is some R -module isomorphism (p : M —>■ N. 

(3) If (p : M — > N is an R -module homomorphism, let ker <p = (m e M \ <p(m) = 
0} (the kernel of (p) and let <p(M) = [n € N \ n = <p(m) for some m € M } (the 
image of (p , as usual). 

(4) Let M and N be R -modules and define Hom p (Af , AO to be the set of all R- 
module homomorphisms from M into N. 
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Any R -module homomorphism is also a homomorphism of the additive groups, but 
not every group homomorphism need be a module homomorphism (because condition 
(b) may not be satisfied). The unqualified term “isomorphism” when applied to R- 
modules will always mean R -module isomorphism. When the symbol = is used without 
qualification it will denote an isomorphism of the respective structures (which will be 
evident from the context). 

It is an easy exercise using the submodule criterion (Proposition 1) to show that 
kernels and images of /^-module homomorphisms are submodules. 

Examples 

(1) If R is a ring and M ~ R is a module over itself, then /?-module homomorphisms (even 
from R to itself) need not be ring homomorphisms and ring homomorphisms need not 
be R -module homomorphisms. For example, when R = Z the Z- module homomor- 
phism x 2x is not a ring homomorphism (1 does not map to 1). When R = F[x] 
the ring homomorphism <p : f(x) /(x 2 ) is not an F[x]-module homomorphism 
(if it were, we would have x 2 = <p(x) = <p(x ■ 1) = x^>(l) = x). 

(2) Let R be a ring, let n e Z + and let M = R n . One easily checks that for each 

i e {1 n} the projection map 

7T; \ R n -> R by 7T; (xi , . . . , X„) = X; 

is a surjective R -module homomorphism with kernel equal to the submodule of n- 
tuples which have a zero in position i . 

(3) If R is a field, ^-module homomorphisms are called linear transformations . These 
will be studied extensively in Chapter 1 1 . 

(4) For the ring R = Z the action of ring elements (integers) on any Z- module amounts to 
just adding and subtracting within the (additive) abelian group structure of the module 
so that in this case condition (b) of a homomorphism is implied by condition (a). For 
example, <p( 2x) = <p(x +x) = <p(x) + y>(x) = 2^(x), etc. It follows that 

X-module homomorphistns are the same as abelian group homomorphisms. 

(5) Let R be a ring, let / be a 2-sided ideal of R and suppose M and N are ^-modules 
annihilated by / (i.e., am = 0 and an = 0 for all a e /, n e N and m e M). 
Any tf-module homomorphism from TV to M is then automatically a homomorphism 
of (T?//)-modules (see Example 5 of Section 1). In particular, if A is an additive 
abelian group such that for some prime p, px = 0 for all x e A, then any group 
homomorphism from A to itself is a Z/ pZ-module homomorphism, i.e., is a linear 
transformation over the field ¥ p . In particular, the group of all (group) automorphisms 
of A is the group of invertible linear transformations from A to itself: GL(A). 

Proposition 2. Let M> N and L be f?-modules. 

(1) A map <p : M -> N is an /^-module homomorphism if and only if 
<p(rx + y) = r<p(x) + ^(y) for all x, y € M and all r e R. 

(2) Let <p, if/ be elements of Horn/? (A/, N). Define <p + if/ by 

(<p + if/)(m) = (pirn) + \f/(m) for all m e M . 

Then <p+ if/ e Horn/? (Af , N) and with this operation Hom^ (M , TV ) is an abelian 
group. If R is a commutative ring then for r € R define r<p by 

( r<p)(m ) = r(<pim)) for ail m e M. 
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Then r<p e Hom/?(M, N) and with this action of the commutative ring R the 
abelian group Hom/?(M, N) is an 7?-module. 

(3) If <p e Hom ff (L, Af) and \J/ € Hom/?(Af, N) then o<p € Horn r(L, N). 

(4) With addition as above and multiplication defined as function composition, 
Hom/?(Af, Af) is a ring with 1. When R is commutative Hom/?(A/, Af) is an 
/^-algebra. 


Proof: (l)Certainly (p(rx+y) = r<p(x)+<p(y)\f<p is an /^-module homomorphism. 
Conversely, if <p(rx + y) = r<p(x) + <p(y)> take r = 1 to see that <p is additive and take 
y = 0 to see that <p commutes with the action of R on M (i.e., is homogeneous ). 

(2) It is straightforward to check that all the abelian group and A -module axioms 
hold with these definitions — the details are left as an exercise. We note that the 
commutativity of R is used to show that r<p satisfies the second axiom of an R -module 
homomorphism, namely. 


(ri(p)(r 2 m) = n<p(r 2 m) 

= riT2((p(m)) 
= r 2 r\(p{m) 

~ r 2 (rnp)(m) 


(by definition of r\ip) 

(since (p is a homomorphism) 
(since R is commutative) 

(by definition of r\<p). 


Verification of the axioms relies ultimately on the hypothesis that AMs an R -module. 
The domain Af could in fact be any set — it does not have to be an /?-module nor an 
abelian group. 

(3) Let (p and ^ he as given and let r e ft, x, y g L. Then 

(& ° <P)(rx + y) = iK <p(rx + y)) 

= ij/(r<p(x) + <p(y)) 

= n}/(<p(x)) + il/(<p(y)) 

= r{f o <p)(x) + (rJ/o <p)(y) 

so, by (1), o (p is an ft -module homomorphism. 

(4) Note that since the domain and codomain of the elements of Hom/?(M, Af) 
are the same, function composition is defined. By (3), it is a binary operation on 
Hom/?(M, Af). As usual, function composition is associative. The remaining ring 
axioms are straightforward to check — the details are left as an exercise. The identity 
function, I, (as usual, I (x) = x y for all x e Af ) is seen to be the multiplicative identity 
of Horn/? (M, Af). If R is commutative, then (2) shows that the ring Hom R (M, Af) is 
a left ft-module and defining <pr = rq> for all <p € Hom/?(Af, Af) and r € R makes 
Horn r (Af, Af) into an ft -algebra. 


(by (1 ) applied to <p) 
(by (1) applied to \f) 


Definition. The ring Hom*(M, Af) is called the endomorphism ring of Af and will 
often be denoted by End/? ( Af ) , or just End(Af ) when the ring R is clear from the context 
Elements of End(Af) are called endomorphisms. 
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When R is commutative there is a natural map from R into End(M) given by 
r i-> rl, where the latter endomorphism of M is just multiplication by r on M (cf. 
Exercise 7). The image of R is contained in the center of End(M) so if R has an 
identity, End(M) is an R -algebra. The ring homomorphism (cf. Exercise 7) from R to 
Endfl(Af) may not be injective since for some r we may have rm = 0 for all m e M 
(e.g., R = X,M = TL/TL, and r — 2). When R is a field, however, this map is injective 
(in general, no unit is in the kernel of this map) and the copy of R in End^(M) is called 
the (subring of) scalar transformations . 

Next we prove that every submodule N of an ^-module M is “normal” in the 
sense that we can always form the quotient module M/N t and the natural projection 
7T : M -> M/ TV is an 7?-module homomorphism with kernel N. The proof of this fact 
and, more generally, the subsequent proofs of the isomorphism theorems for modules 
follow easily from die corresponding facts for groups. The reason for this is because a 
module is first of all an abelian group and so every submodule is automatically a normal 
subgroup and any module homomorphism is, in particular, a homomorphism of abelian 
groups, all of which we have already considered in Chapter 3 . What remains to be proved 
in order to extend results on abelian groups to corresponding results on modules is to 
check that the action of R is compatible with these group quotients and homomorphisms. 
For example, the map tt above was shown to be a group homomorphism in Chapter 3 
but the abelian group M/N must be shown to be an 7?-module (i.e., to have an action 
by R) and property (b) in the definition of a module homomorphism must be checked 
for 7T. 

Proposition 3. Let R be a ring, let M be an R -module and let N be a submodule of M. 
The (additive, abelian) quotient group M /N can be made into an /? -module by defining 
an action of elements of R by 

r(x + N) = ( rx ) + N, for all r e R, x + N e M/N . 

The natural projection map n : M -*■ M/N defined by n(x) = x + N is an /^-module 
homomorphism with kernel N. 

Proof: Since M is an abelian group under + the quotient group M/N is defined 
and is an abelian group. To see that the action of the ring element r on the coset x + N is 
well defined, suppose x + N = y + N,i.c.,x — y e N. Since N is a (left) /^-submodule, 
r(x — y) G N. Thus rx — ry € N and rx + N = ry + N, as desired. Now since the 
operations in M/ N are “compatible” with those of M, the axioms for an R -module are 
easily checked in the same way as was done for quotient groups. For example, axiom 
2(b) holds as follows: for all r\ , r 2 e R and x + N g M/N, by definition of the action 
of ring elements on elements of M/N 

(r\r 2 ){x +N) = (nr 2 x) + N 
= r\(r 2 x + N) 

= ri(r 2 (x + AO). 
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The other axioms are s imilar ly checked — the details are left as an exercise. Final ly, 
the natural projection map n described above is, in particular, the natural projection 
of the abelian group M onto the abelian group M/N bence is a group homomorphism 
with kernel N. The kernel of any module homomorphism is the same as its kernel when 
viewed as a homomorphism of the abelian group structures. It remains only to show n 
is a module homomorphism, i.e., n(rm) ~ rn(m). But 

n(rm) = rm + N 

= r(m + N) (by definition of the action of R on M/N) 

= m(m). 


This completes the proof. 

All the isomorphism theorems stated for groups also hold for R -modules. The 
proofs are similar to that of Proposition 3 above in that they begin by invoking the 
corresponding theorem for groups and then prove that the group horaomorphisms are 
also R -module horaomorphisms. To state the Second Isomorphism Theorem we need 
the following. 

Definition. Let A , B be submodules of the /^-module M. The sum of A and B is the 
set 

A B = (q h \ a G A t b e B}. 


One can easily check that the sum of two submodules A and B is a submodule and 
is the smallest submodule which con tains both A and B. 

Theorem 4. (Isomorphism Theorems) 

(1) (The First Isomorphism Theorem for Modules) Let M, N be R -modules and let 
<p : M — > N be an R -module homomorphism. Then ker tp is a submodule of 
M and A/ / ker = <p(M). 

(2) (The Second Isomorphism Theorem) Let A, B be submodules of the R -module 
M. Then (A + B)/B = A/(A D B). 

(3) (The Third Isomorphism Theorem) Let M be an R -module, and let A and B be 
submodules of M with A c B. Then ( M/A)/(B/A ) = M/B. 

(4) (The Fourth or Lattice Isomorphism Theorem) Let N be a submodule of the 
R -module M. There is a bijection between the submodules of M which contain 
N and the submodules of M/N. The correspondence is given by A ++ A/N , 
for all A 3 N. This correspondence commutes with the processes of taking 
sums and intersections (i.e., is a lattice isomorphism between the lattice of 
submodules of M/N and the lattice of submodules of M which contain N). 


Proof: Exercise. 
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EXERCISES 


In these exercises R is a ring with 1 and M is a left R -module. 

1. Use the submodule criterion to show that kernels and images of R -module homomorphisms 
are submodules. 

2 . Show that the relation “is /^-module isomorphic to” is an equivalence relation on any set 
of R -modules. 

3 . Give an explicit example of a map from one /^-module to another which is a group homo- 
morphism but not an /?-module homomorphism. 

4 . Let A be any Z-module, let a be any element of A and let n be a positive integer. Prove that 
the map <p a : Z/nZ — > A given by <p(k) = ka is a well defined Z-module homomorphism 
if and only if na = 0. Prove that Hom^iZ/nZ, A) = A nt where A n — [a e A \ na — 0} 
(so A n is the annihilator in A of the ideal (n) of Z — cf. Exercise 10, Section 1). 

5. Exhibit all Z-module homomorphisms from Z/30Z to Z/21Z. 

6. Prove that Hom^(Z/rtZ, Z/mZ) = Z/(n, m) Z. 

7. Let z be a fixed element of the center of R. Prove that the map m zm is an R- 
module homomorphism from M to itself. Show that for a commutative ring R the map 
from R to Endj?(M) given by r rl is a ring homomorphism (where / is the identity 
endomorphism). 

8. Let tp : M — > N be an ^-module homomorphism Prove that <p(Tor(M)) c Tor(W) (cf. 
Exercise 8 in Section 1). 

9. Let R be a commutative ring. Prove that Hom^(/?, M) and M are isomorphic as left 
^-modules. [Show that each element of Hom« (/?, M) is determined by its value on the 
identity of R.] 

10 . Let R be a commutative ring. Prove thai Hom/?(/?, R) and R are isomorphic as rings. 

11 . Let Ai, Ai , . . . , A n be 7?modules and let 6, be a submodule of A; foreachi = 1, 2, .... n. 
Prove that 

(At x ■ ■ ■ x A„)/(B\ x ■ ■ ■ x B n ) = (Ai/Bi) x - ■ ■ x ( A n /B n ). 

[Recall Exercise 14 in Section 5.1.] 

12 . Let / be a left ideal of R and let n be a positive integer. Prove 

R n / 1 R n = R/IR x - ■ • x R/IR (n times) 
where I R n is defined as in Exercise 5 of Section 1. [Use the preceding exercise.] 

13 . Let / be a nilpotent ideal in a commutative ring R (cf. Exercise 37, Section 7.3), let M 
and N be /^-modules and let <p : M — > N be an /?- module homomorphism. Show that if 
the induced map Ip : M/lM -x N Jib! is suijective, then <p is suijective. 

14 . Let R = Z[x\ be the ring of polynomials in x and let A = Z[/j , t 2 , . . . ] be the ring of 

polynomials in the independent indeterminates t \ , t 2 , Define an action of R on A as 

follows: 1) let 1 e R act on A as the identity, 2) for n > 1 let xf 1 o 1 = t n , let x n o t-, = f„ +I - 
for i = 1, 2, . . . , and let x n act as 0 on monomials in A of (total) degree at least two, and 
3) extend Z-li nearly, i.e., so that the module axioms 2(a) and 2(c) are satisfied. 

(a) Show that o t; = x p o (x q o t;) — t p+iJ+ i and use this to show that under this 
action the ring A is a (unital) R-module. 

(b) Show that the map <p : R —*■ A defined by <p(r) = r o 1 ^ is an /?-module homomor- 
phism of the ring R into the ring A mapping 1 r to 1 a , but is not a ring homomorphism 
from R to A. 
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103 GENERATION OF MODULES, DIRECT SUMS, AND 
FREE MODULES 


Let R be a ring with 1. As in the preceding sections the term “module” will mean “left 
module.” We first extend the notion of the sum of two submodules to sums of any finite 
number of submodules and define the submodule generated by a subset 

Definition. Let M be an /^-module and let N] N„ be submodules of M. 

(1) The sum of Ni , . . . , N n is the set of all finite sums of elements from the sets Nj : 

{aj + fl 2 H h a n | a-, G Ni for all i). Denote this sum by Ni H 1- N„. 

(2) For any subset A of M let 

RA = [r i< 2 i + r 2 a 2 -\ \-r m a m \ r u . . . , r m g R, fl ll ... t a ni eA 1 m g Z + ) 

(where by convention RA = {0} if A = 0). If A is the finite set {oi , a 2 , . . . , ctn } 
we shall write Ra\ + Ra 2 + ■ • ■ + Ro^ for RA. Call RA the submodule ofM 
generated by A. If N is a submodule of M (possibly N — M) and N = RA, 
for some subset A of M, we call A a set of generators or generating set for N, 
and we say N is generated by A. 

(3) A submodule N of M (possibly N = M) is finitely generated if there is some 
finite subset A of M such that N = RA, that is, if is generated by some finite 
subset. 

(4) A submodule N of M (possibly N = M) is cyclic if there exists an element 
a g M such that N = Ra, that is, if N is generated by one element: 

N ~ Ra = [ra | r G R). 

Note that these definitions do not require that the ring R contain a 1, however 
this condition ensures that A is contained in RA. It is easy to see using the Submodule 
Criterion that for any subset A of M, RA is indeed a submodule of M and is the smallest 
submodule of M which contains A (i.e., any submodule of M which contains A also 

contains RA). In particular, for submodules N\, . . . ,N„ oi M, N\ + 1- is just 

the submodule generated by the set N\ U - * - U N„ and is the smallest submodule of M 
containing N,, for all i. If A/j, . . . , N n are generated by sets A\ , . . . , A n respectively, 

then Ni H 1- N,, is generated by Aj U ■ ■ ■ U A„. Note that cyclic modules are, a 

fortiori, finitely generated. 

A submodule N of an /?-module M may have many different generating sets (for 
instance the set N itself always generates N). If N is finitely generated, then there is a 
smallest nonnegative integer d such that N is generated by d elements (and no fewer). 
Any generating set consisting of d elements will be called a minimal set of generators 
for N (it is not unique in general). If N is not finitely generated, it need not have a 
minimal generating set. 

The process of generating submodules of an R -module M by taking subsets A of 
M and forming all finite “7?-linear combinations” of elements of A will be our primary 
way of producing submodules (this notion is perhaps famili ar from vector space theory 
where it is referred to as taking the span of A). The obstruction which made the 
analogous process so difficult for groups in general was the noncommutativity of group 
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operations. For abelian groups, G , however, it was much simpler to control the subgroup 
( A ) generated by A, for a subset A of G (see Section 2.4 for the complete discussion 
of this). The situation for R -modules is similar to that of abelian groups (even if R is 
a noncommutative ring) because we can always collect “like terms” in elements of A, 
i.e., terms such as r\a\ + ^ai + can always be simplified to (ri + Ji)<2i + ^<22- 
This again reflects the underlying abelian group structure of modules. 

Examples 

(1) Let R — Z and let M be any R -module, that is, any abelian group. If a e M, then 
TLa is just the cyclic subgroup of M generated by a: (a) (compare Definition 4 above 
with the definition of a cyclic group). More generally, M is generated as a Z-module 
by a set A if and only if M is generated as a group by A (that is, the action of ring 
elements in this instance produces no elements that cannot already be obtained from 
A by addition and subtraction). The definition of finitely generated for Z-modules is 
identical to that for abelian groups found in Chapter 5. 

(2) Let R be a ring with 1 and let M be the (left) /?-module R itself. Note that R is 
a finitely generated, in fact cyclic, R -module because R — R 1 (i.e., we can take 
A = {1 }). Recall that the submodules of R are precisely the left ideals of /?, so saying 
/ is a cyclic R-submodule of the left 7?-module R is the same as saying 7 is a principal 
ideal of R (usually the term '‘principal ideal” is used in the context of commutative 
rings). Also, saying / is a finitely generated 7?-submodule of R is the same as saying 
7 is a finitely generated ideal. When R is a commutative ring we often write AR or 
aR for the submodule (ideal) generated by A or a respectively, as we have been doing 
for Z when we wrote «Z. In this situation AT? = RA and aR = Ra (elementwise 
as well). Thus a Principal Ideal Domain is a (commutative) integral domain R with 
identity in which every 7?-submodule of R is cyclic. 

Submodules of a finitely generated module need not be finitely generated: take 
M to be the cyclic 7? -module R itself where R is the polynomial ring in infinitely 
many variables xi, xi, *3, . . . with coefficients in some field F. The submodule (i.e., 
2-sided ideal) generated by {x\ cannot be generated by any finite set (note 
that one must show that no finite subset of this ideal will generate it). 

(3) Let 7? be a ring with 1 and let M be the free module of rank n over R, as described in 

the first section. For each i e {1,2,..., n) let e, = (0, 0 0, 1,0,..., 0), where 

the 1 appears in position i. Since 

71 

(■ST. S2, ■ ■ ■ , S „ ) = 5>* ( - 

i=l 

it is clear that M is generated by {ei, . . . , e„}. If 7? is commutative then this is a 
minimal generating set (cf. Exercises 2 and 27). 

(4) Let F be a field, let x be an indeterminate, let V be a vector space over F and let 
7 be a linear transformation from V to V . Make V into an F[x]-module via 7. 
Then V is a cyclic F[x]-module (with generator v) if and only if V = {p(x)u | 
p(x) e F[jc]}, that is, if and only if every element of V can be written as an F-linear 
combination of elements of the set [T n (v) \ n > 0}. This in turn is equivalent to 
saying {u, 7 (u), T 2 (v), . . . } span V as a vector space over F. 

For instance if 7 is the identity linear transformation from V to V or the zero linear 
transformation, then for every v e V and every p(x) e F[x] we have p(x)v = av for 
some a e F. Thus if V has dimension > 1, V cannot be a cyclic F[x {-module. 
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For another example suppose V is affine n-space and T is the “shift operator” 
described in Section 1. Let e, be the I th basis vector (as usual) numbered so that T 
is defined by T k (e n ) = e n -k for 1 < k < n. Thus V is spanned by the elements 
e n , T (e n ), . . . , T n ~ l (e„), that is, V is a cyclic F[x] -module with generator e „ . For 
n > 1, V is not, however, a cyclic F -module (i.e., is not a 1 -dimensional vector space 
over F). 

Definition. Let Mi , . . . , M k be a collection of R -modules. The collection of ^-tuples 
(m\,m 2> ... y m k ) where m i G A/, with addition and action of R defined componentwise 
is called the direct product of M \ , . . . , M*, denoted M\ x ■ - ■ x M k . 

It is evident that the direct product of a collection of R -modules is again an R- 
module. The direct product of Mi, .... M k is also referred to as the (external) direct 
sum o(M u ...,M k and denoted Ah © • • • © M k . The direct product and direct sum of 
an infinite number of modules (which are different in general) are defined in Exercise 
20. 

The next proposition indicates when a module is isomorphic to the direct product 
of some of its submodules and is the analogue for modules of Theorem 9 in Section 5.4 
(which determines when a group is the direct product of two of its subgroups). 

Proposition 5. Let Ah. N 2 , . - - , N k be submodules of the R -module M. Then the 
following are equivalent: 

(1) The map n : Ni x N 2 x • ■ • x N k -»• N\ + N 2 h N k defined by 

Tt(d\, a 2< ■ ■ ■ » a k ) — a \ + d 2 + ■ ■ ■ + a k 

is an isomorphism (of /^-modules): Ah + N 2 -\ 1- N k = N\ x Ah x • * * x N k . 

(2) Nj fl (Ah + Ah + - ■ - + Nj-i+N H i + ■ - - + N k ) = 0 for ally 6 {1, 2, . . . , k). 

(3) Every x 1 -Ah can be written uniquely in the form a\ + u 2 4 1 -a k 

with aj G Ni. 


Proof: To prove (1) implies (2), suppose for, some j that (2) fails to hold and let 
a.j g (Ah -I 1- Nj-i + Nj+i H 1- Ah) f) Nj 7 with aj ^ 0. Then 

Oj = a ! + ••- + flj - 1 + aj+\ -\ + a k 

for some a t g Ni, and (d\ , . . . , ©_i , —dj , aj + \ af) would be a nonzero element 

of ker nr, a contradiction. 

Assume now that (2) holds. If for some module elements a,-, h; G N, we have 
ai a 2 + a k = b\ ■+- b 2 ■+- • ■ - ■+- b k 
then for each j we have 

aj — bj = (b\ — a\) + ■ • ■ + (bj~ 1 — Qj- 1 ) + (bj + 1 — ay+i) + • * * + (b k — d k ). 

The left hand side is in Nj and the right side belongs to Ah -1 \-Nj-i + N j+i -\ \-N k . 

Thus 

dj — bj G Nj n (A/j + ■ • • + Nj-i + A/j + i + ■ ■ ■ + Nf) = 0. 

This shows dj — bj for all j, and so (2) implies (3). 
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Finally, to see that (3) implies (1) observe first that the map rt is clearly a suijec- 
tive R -module homomorphism. Then (3) simply implies n is injective, hence is an 
isomorphism, completing the proof. 

If an F -module M = N\ + N 2 H h Nk is the sum of submodules Nx, N 2 , . . ■ , Nk 

of M satisfying the equivalent conditions of the proposition above, then M is said to be 
the (internal) direct sum of N\ , N 2 , ■ ■ ■ , Nk, written 

M = Ni 0 N 2 © - ■ - © N*. 

By the proposition, this is equivalent to the assertion that every element m of M can be 

written uniquely as a sum of elements m = n\ + «2 H h/i* with n; € N t . (Note that 

part (1) of the proposition is the statement that the internal direct sum of N\, N 2 , . . . , Nk 
is isomorphic to their external direct sum, which is the reason we identify them and use 
the same notation for both.) 

Definition, An I? -module F is said to be free on the subset A of F if for every 
nonzero element x of F, there exist unique nonzero elements n, r 2 , . . . , r„ of R and 

unique a\, a 2 , . . . . a n in A such that* = r^i -\-r 2 a 2 H hr n a n , for some n e Z + . In 

t hi s situation we say A is a basis or set of free generators for F. If R is a commutative 
ring the cardinality of A is called the rank of F (cf. Exercise 27). 

One should be careful to note the difference between the uniqueness property of 
direct sums (Proposition 5(3)) and the uniqueness property of free modules. Namely, 
in the direct sum of two modules, say N\ ® N 2 , each element can be written uniquely 
as n 1 + ri 2 \ here the uniqueness refers to the module elements n\ and « 2 - In the case of 
free modules, the uniqueness is on the ring elements as well as the module elements . 
For example, if R = Z and N\ = N 2 = 7L/TL, then each element of N\ ® N 2 has a 
unique representation in the form «i + «2 where each n, e N i} however n 1 (for instance) 
can be expressed as n\ or 3«i or 5n\ . . . etc., so each element does not have a unique 
representation in the form r\ai + ^ 02 , where r\,r 2 6 R, a\ € Nj and 02 € N 2 . Thus 
Z/2Z ® Z/2Z is not a free Z-module on the set {(1, 0), (0, 1)}. Similarly, it is not free 
on any set 

Theorem 6. For any set A there is a free R -module F (A) on the set A and F (A) satisfies 
the following universal property: if M is any F-module and <p : A -*■ M is any map 
of sets, then there is a unique F-module homomorphism & : F(A) M such that 
0(a) — <p(a), for all a e A, that is, the following diagram commutes. 



When A is the finite set [a\, a 2 , . . . , a n ) > F(A) = Rax © Ra 2 © ■ ■ ■ © Ra n = R n . 
(Compare: Section 6.3, free groups.) 

Proof: Let F (A) — {0} if A = 0. If A is nonempty let F (A) be the collection of 
all set functions / : A ->■ R such that f(a) = 0 for all but finitely many a e A. Make 
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F(A) into an fi -module by pointwise addition of functions and pointwise multiplication 
of a ring element times a function, i.e., 

(/ + g)(a) = f(a) + g(fl) and 

C r f )(& ) = r(f(a))< for all a e A, r e R and /, g e F(A). 

It is an easy matter to check that all the fi -module axioms hold (the details are omitted). 
Identify A as a subset of F(A) by a f a , where f a is the function which is 1 at a and 
zero elsewhere. We can, in this way, think of F (A) as all finite fi -linear combinations 

of elements of A by identifying each function / with the sum r\a\ + H h r n a n , 

where / takes on the value r t at n,- and is zero at all other elements of A. Moreover, 
each element of F(A) has a unique expression as such a formal sum. To establish the 
universal property of F (A) suppose (p : A ->■ M is a map of the set A into the fi-module 
M. Define 0 : F(A) M by 

n i ) 

0 : X>.~E nwia;). 

i=l 1=1 

By the uniqueness of the expression for the elements of F (A) as linear combinations 
of the <3, we see easily that is a well defined R -module homomorphism (the details 
are left as an exercise). By definition, the restriction of 0 to A equals <p . Finally, since 
F (A) is generated by A, once we know the values of an /^-module homomorphism on 
A its values on every element of F(A) are uniquely determined, so 0 is the unique 
extension of (p to all of F (A). 

When A is the finite set {ai,a 2 a„) Proposition 5(3) shows that F(A) = Ra\ © 

Rci 2 © • ■ • © Ra„. Since R = Ra, for all i (under the map r i-> ra, ) Proposition 5(1) 
shows that the direct sum is isomorphic to R n . 


Corollary 7. 

(1) If Fi and F 2 are free modules on the same set A, there is a unique isomorphism 
between F\ and F 2 which is the identity map on A. 

(2) If F is any free R -module with basis A, then F = F (A). In particular, F enjoys 
the same universal property with respect to A as F(A) does in Theorem 6. 


Proof: Exercise. 

If F is a free /^-module with basis A, we shall often (particularly in the case of 
vector spaces) define fi- module homomorphism s from F into other fi-modules simply 
by specifying their values on the elements of A and then saying “ extend by linearity ” 
Corollary 7(2) ensures that this is permissible. 

When fi = Z, the free module on a set A is called the free abelian group on A. If 
|A| — n, F(A) is called the free abelian group of rankn and is isomorphic to Z©- ■ -@Z 
(n times). These definitions agree with the ones given in Chapter 5. 
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EXERCISES 


In these exercises R is a ring with I and M is a left I? -module. 

1 - Prove that if A and B are sets of the same cardinality, then the free modules F(A) and 
F(B) are isomorphic. 

2 . Assume R is commutative. Prove that R" = R’ n if and only if n = m, i.e., two free 
^-modules of finite rank are isomorphic if and only if they have the same r ank. [Apply 
Exercise 12 of Section 2 with 1 a maximal ideal of R . You may assume that if F is a field, 
then F” = F m if and only if n = m, i.e., two finite dimensional vector spaces over F 
are isomorphic if and only if they have the same dimension — this will be proved later in 
Section 11.1.] 

3. Show that the F[x]-modules in Exercises 18 and 19 of Section 1 are both cyclic. 

4. An F-module M is called a torsion module if for each m e M there is a nonzero element 
r € R such that rm = 0, where r may depend on m (i.e., M = Tor(M) in the notation of 
Exercise 8 of Section 1). Prove that every finite abelian group is a torsion Z-module. Give 
an example of an infini te abelian group that is a torsion Z-module. 

5. Let R be an integral domain. Prove that every finitely generated torsion F-module has a 
nonzero annihilator i.e., there is a nonzero element r e R such that rm = 0 for all m e M 
— here r does not depend on m (the annihilator of a module was defined in Exercise 9 of 
Section 1). Give an example of a torsion F- module whose annihilator is the zero ideal. 

6. Prove that if M is a finitely generated F-module that is generated by n elements then every 
quotient of M may be generated by n (or fewer) elements. Deduce that quotients of cyclic 
modules are cyclic. 

7. Let N be a submodule of M. Prove that if both M/N and N are finitely generated then so 
is M. 

8. Let S' be the collection of sequences (a\ , ai, as , . . .) of integers a\,i 22 , 03 , ... where all 
but finitely many of the a; are 0 (called the direct sum of infinitely many copies of Z). 
Recall that S is a ring under componentwise addition and multiplication and S does not 
have a multiplicative identity — cf. Exercise 20, Section 7.1. Prove that S is not finitely 
generated as a module over itself. 

9. An F-module M is called irreducible if M ^ 0 and if 0 and M are the only submodules 
of M. Show that M is irreducible if and only if M ^ 0 and M is a cyclic module with any 
nonzero element as generator. Determine all the irreducible Z-modules. 

10 . Assume R is commutative. Show that an F-module M is irreducible if and only if M is 
isomorphic (as an R-module) to R/I where 1 is a maximal ideal of R. [By the previous 
exercise, if M is irreducible there is a natural map R — > M defined by r (—>■ rm, where m 
is any fixed nonzero element of A/.] 

11 . Show that if M\ and Mi are irreducible R- modules, then any nonzero R-module homomor- 
phism from Mi to M 2 is an isomorphism. Deduce that if M is irreducible then End ^ (M) is 
a division ring (this result is called Schur's Lemma). [Consider the kernel and the image.] 

12 . Let R be a commutative ring and let A , B and M be 7?-modules. Prove the following 
isomorphisms of R-modules: 

(a) Hom^(A x B , M) = Hom^fA, M) x Horn r(B. M) 

(b) Hom^(M, A x B) = Horn r(M, A) x Hom/?(Af, B ). 

13 . Let R be a commutative ring and let F be a free R-module of finite rank. Prove the 
following isomorphism of R-modules: Homj?(F, R) = F. 
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14. Let R be a commutative ring and let F be the free 7? -module of rank n. Prove that 
Horn ft (F, M) = M x • ■ • x M (n times). [Use Exercise 9 in Section 2 and Exercise 12.] 

15. An element e e R is called a central idempotent if e 2 =e and er — re for all r € /?. If e is 
a central idempotent in R, prove that M — eM © (l—e)M. [Recall Exercise 14 in Section 
1.] 

The next two exercises establish the Chinese Remainder Theorem for modules (cf. Section 7.6). 

16. For any ideal 7 of R let IM be the submodule defined in Exercise 5 of Section 1. Let 
M , . . . , A k be any ideals in the ring R. Prove that the map 

M -> M/A]M x ■ • ■ x M/A k M defined by mr+Qn + AiM m + A*M) 

is an R -module homomorphism with kernel A\M C\ A-iM C\ ■ ■ ■ C\ . 

17. In the notation of the preceding exercise, assume further that the ideals Ay , . . . , Ak are 
pairwise comaximal (i.e.. Ay + Aj — R for all i j). Prove that 

M/(Ai ■ ■ ■ A*)M = M/AiM x - • - x M/A k M. 

[See the proof of the Chinese Remainder Theorem for rings in Section 7.6.] 

18. Let R be a Principal Ideal Domain and let M be an Tf-module that is annihila ted by the 
nonzero, proper ideal (a). Let a = p° x pff - ■ ■ be the unique factorization of a into 
distinct prime powers in R. Let My be the annihilator of pf‘ in M, i.e., My is the set 
[m e M | pf'm = 0} — called the Pi -primary component of M. Prove that 

M = M\ © M 2 © ■ ■ ■ © M k . 

19. Show that if M is a finite abelian group of order a = pff 7 ■ ■ ■ p“ k then, considered as a 
Z- module, M is annihilated by (a), the p t -primary component of M is the unique Sylow 
Pi -subgroup of M and M is isomorphic to the direct product of its Sylow subgroups. 

20. Let / be a nonempty index set and for each i e 7 let M; be an 7f -module. The direct product 
of the modules My is defined to be their direct product as abelian groups (cf. Exercise 15 
in Section 5.1) with die action of R componentwise multiplication. The direct sum of the 
modules Mi is defined to be the restricted direct product of the abelian groups Mi (cf. 
Exercise 17 in Section 5.1) with the action of R componentwise multiplication. In other 
words, thedirect sum of the My ’s is the subset of the direct product, n, € / My, which consists 
of all elements n, € / m i such that only finitely many of the components m,- are nonzero; 
the action of R on the direct product or direct sum is given by r ]”[,■<=/ my = rm ‘ ( c f- 
Appendix I for the definition of Cartesian products of infinitely many sets). The direct 
sum will be denoted by ©y e / My . 

(a) Prove that the direct product of the My ’s is an R-module and the direct sum of the 
My ’s is a submodule of their direct product 

(b) Show that if R = Z, I = Z + and Mi is the cyclic group of order i for each i, then the 
direct sum of the My’s is not isomorphic to their direct product [Look at torsion.] 

21. Let / be a nonempty index set and for each i G 7 let N ,■ be a submodule of M. Prove that 
the following are equivalent: 

(i) the submodule of M generated by all the A/y ’s is isomorphic to the direct sum of the 
Ay’s 

(ii) if {»i, (' 2 , . . . , i k } is any finite subset of 7 then Ay, n (Ay 2 H b N ik ) = 0 

(ill) if {ii , 1 * 2 , . . . , i k ) is any finite subset of 7 then Ai + - — h At = A] © - ■ ■ © A* 

J (iv) for every element x of the submodule of M generated by the Ay ’s there are unique 
elements a y e Ay for all i e 7 such that all but a finite number of the ay are zero and 
x Ls the (finite) sum of the ay . 


Sec. 103 


Generation of Modules, Direct Sums, and Free Modules 


357 



22. Let R be a Principal Ideal Domain, let M be a torsion ^-module (cf. Exercise 4) and let p 
be a prime in R (do not assume M is finitely generated, hence it need not have a nonzero 
annihilator — cf. Exercise 5). The p-primary component of M is the set of all elements 
of M that are a nnihilat ed by some positive power of p. 

(a) Prove that the p-primary component is a submodule. [See Exercise 13 in Section 1 .] 

(b) Prove that this definition of p-primary component agrees with the one given in Exer- 
cise 18 when M has a nonzero annihilator. 

(c) Prove that M is the (possibly infinite) direct sum of its p-primary components, as p 
runs over all primes of R. 

23. Show that any direct sum of free /^-modules is free. 

24. (An arbitrary direct product of free modules need not be free) For each positive integer i 
let M j be the free Z-module Z, and let M be the direct product 0/62+ ( c f- Exercise 
20). Each element of M can be written uniquely in the form (a\ ,02,03,...) with a, e Z 
for all i. Let N be the submodule of M consisting of all such tuples with only finitely 
many nonzero a,. Assume M is a free Z-module with basis B. 

(a) Show that N is countable. 

(b) Show that there is some countable subset B\ of B such that N is contained in the 
submodule, N\ , generated by B\ . Show also that N\ is countable. 

(c) Let M = MjNy. Show that M is a free Z-module. Deduce that if x is any nonzero 
element of M then there are only finitely many distinct positive integers k such that 
x = km for some m e M (depending on k). 

(d) Let <S = {(hi , &2. ki, ... ) | h, = ±i ! for all i). Prove that <S is uncountable. Deduce 
that there is some s e S with s g N 1 . 

(e) Show that the assumption M is free leads to a contradiction: By (d) we may choose 
s € S with s £ N 1 . Show that for each positive integer k there is some m e M with 
5 = km, contrary to (c). [Use the fact that W c W).] 

25. In the construction of direct limits, Exercise 8 of Section 7.6, show that if all A,- are R- 
modules and the maps pjj are /?- module homomorphi sms, then the direct limit A = lnp A, 
may be given the structure of an /^-module in a natural way such that the maps p, : A/ -*■ A 
are all ^-module homomorphisms. Verily the corresponding universal property (part (e)) 
for 7?-raodule homomorphisms <pj : A,- -* C commuting with the p l} . 

26. Carry out the analysis of the preceding exercise corresponding to inverse limits to show 
that an inverse limit of /^-modules is an /^-module satisfying the appropriate universal 
property (cf. Exercise 10 of Section 7.6). 

27. (Free modules over noncommutative rings need not have a unique rank) Let M be the 
Z-module Z x Z x - - - of Exercise 24 and let R be its endomorphism ring, R = End^(M) 
(cf. Exercises 29 and 30 in Section 7.1). Define <pi e R by 

<P\(oi, 02,03 , . ..) = (fli, 03,05,...) 

<p 2 (o\, 02 , 03 , . . .) = ( 02 , 04 , ae,...) 

(a) Prove that {cpi, <p{\ is a free basis of the left /^-module R. [Define the maps 1 and 

by ^1(01,02. - ■ ■) = (o\, 0,02,0 , . ..) and \fo.(o\, 02 , .. .) = (0, ai,0, 02 , . . .). 
Verily that <pi^ t = 1, tpifa = 0 = ^>2^1 + ^2^2 = L Use these relations 

to prove that <p \ , yn are independent and generate R as a left tf-module.] 

(b) Use (a) to prove that R = R 2 and deduce that R = R n for all n € Z + . 
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10.4 TENSOR PRODUCTS OF MODULES 


In this section we study the tensor product of two modules M and N over a ring (not 
necessarily commutative) containing 1. Formation of the tensor product is a general 
construction that, loosely speaking, enables one to form another module in which one 
can take “products” mn of elements m e M and n e N. The general construction 
involves various left- and right- module actions, and it is instructive, by way of moti- 
vation, to first consider an important special case: the question of “extending scalars” 
or “changing the base.” 

Suppose that the ring R is a subring of the ring 5. Throughout this section, we 
always assume that 1 R = 1$ (this ensures that 5 is a unital /^-module). 

If N is a left 5-module, then N can also be naturally considered as a left R -module 
since the elements of R (being elements of 5) act on N by assumption. The 5-module 
axioms for N include the relations 

($1 + S 2 )n = s\n + s^n and 5 (hi + W 2 ) = sn\ + sn 2 (10-1) 

for all s, si, S 2 e 5 and all n, n\ f 112 e N y and the relation 

(s\S 2 )ti = si(s 2 «) for all S] , S 2 € 5, and all n e N. (10.2) 

A particular case of the latter relation is 

( sr)n = s(rn ) for all s e 5, r e R and« e N . (10.20 

More generally, if / : R 5 is a ring homomorphism from R into 5 with /(I R ) = Is 
(for example the injection map if R is a subring of 5 as above) then it is easy to see that 
N can be considered as an R -module with rn = f(r)n for r e R and n e N. In t hi s 
situation 5 can be considered as an extension of the ring R and the resulting /^-module 
is said to be obtained from N by restriction of scalars from 5 to R. 

Suppose now that R is a subring of 5 and we try to reverse this, namely we start 
with an /^-module N and attempt to define an 5-module structure on N that extends 
the action of R on N to an action of 5 on iV (hence “extending the scalars” from R 
to 5). In general this is impossible, even in the simplest situation: the ring R itself is 
an i? -module but is usually not an 5-module for the larger ring 5. For example, Z is 
a Z-module but it cannot be made into a Q-module (if it could, then £ o 1 = z would 
be an element of Z with z 4- z = 1, which is impossible). Although Z itself cannot be 
made into a Q-module it is contamed in a Q-module, namely Q itself. Put another way, 
there is an injection (also called an embedding) of the Z-module Z into the Q-module Q 
(and similarly the ring R can always be embedded as an R -submodule of the 5-module 
5). This raises the question of whether an arbitrary R -module N can be embedded as 
an ^-submodule of some 5-module, or more generally, the question of what /^-module 
homomorphisms exist from N to 5-modules. For example, suppose N is a nontrivial 
finite abelian group, say N = Z/2Z, and consider possible Z-module homomorphisms 
(i.e., abelian group homomorphisms) of N into some Q-module. A Q-module is just 
a vector space over Q and every nonzero element in a vector space over Q has i nfini te 
(additive) order. Since every element of N has finite order, every element of N must 
map to 0 under such a homomorphism. In other words there are no nonzero Z-module 
homomorphisms from this N to any Q-module, much less embeddings of N identifying 
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N as a submodule of a Q-module. The two Z-modules Z and Z/2Z exhibit extremely 
different behaviors when we try to “extend scalars" from Z to Q: the first module maps 
injectively into some Q-module, the second always maps to 0 in a Q-module. 

We now construct for a general /^-module N an 5-module that is the “best possible" 
target in which to by to embed N . We shall also see that this module determines all of 
the possible R -module homomorphisms of N into 5-modules, in particular determining 
when N is contained in some 5-module (cf. Corollary 9). In the case of R = Z and 
5 = Q this construction will give us Q when applied to the module N = Z, and will give 
us 0 when applied to the module N = %fXL (Examples 2 and 3 following Corollary 9). 

If the /^-module N were already an 5-module then of course there is no difficulty 
in “extending" the scalars from R to 5, so we begin the construction by returning to 
the basic module axioms in order to examine whether we can define “products” of the 
form sn, for s e 5 and n e N. These axioms start with an abelian group N together 
with a map from S x N to N, where the image of the pair (s, n) is denoted by sn. It is 
therefore natural to consider the free Z- module (i.e., , the free abelian group) on the set 
5 x N, i.e., the collection of all finite commuting sums of elements of the form (s, , n, ) 
where s, € 5 and n, e N. This is an abelian group where there are no relations between 
any distinct pairs ( s , n) and (s', n’), i.e., no relations between the “formal products" 
sn, and in this abelian group the original module N has been thoroughly distinguished 
from the new “coefficients” from 5. To satisfy the relations necessary for an 5- module 
structure imposed in equation (1) and the compatibility relation with the action of R on 
N in (2'), we must take the quotient of this abelian group by the subgroup H generated 
by all elements of the form 


(si + s 2 , n) - (si , n) - (s 2 , n), 

(s,n j + n 2 ) - (s, n,) - (s, n 2 ), and (10.3) 

(sr, n) - (s, rn ), 

for s, S[, s 2 g 5, n, m, n 2 6 N and ref?, where rn in the last element refers to the 
J 1 ? -module structure already defined on N. 

The resulting quotient group is denoted by 5 ®r N (or just 5 <g> N if R is clear from 
the context) and is called the tensor product of S and N over R. If s <g> n denotes the 
coset containing ( 5 , n) in 5 <8>r N then by definition of the quotient we have forced the 
relations 

(Ji + *i) <8> n = s x ® n -f- s 2 <g> n, 
s <S> («t + n 2 ) = s ® n\ + s ® n 2 , and (10.4) 

sr (g>n = s <8>rn. 

The elements of S®rN are called tensors and can be written (non-uniquely in general) 
as finite sums of “simple tensors” of the form s <8> n with s € S,n € N. 

We now show that the tensor product 5 N is naturally a left 5-module under 
the action defined by 

s ( s ‘ ® n i ) = 53 (^/) ® m ■ (10.5) 

' finite ' finite 
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We first check this is well defined, i.e., independent of the representation of the 
element of 5 ® R N as a sum of simple tensors. Note first that if s' is any element of 5 
then 


(s'(si + s 2 ), n) - (s'si , n) - (s's 2i n) ( = (s'sj + s's 2 , n) - (s's\ , n) - ( s's 2 , n)), 
(s's, tii + n 2 ) — ( s's . n i) — (s's, n 2 ), and 
(s'(sr), n ) - (s's, rn) ( = ((s's)r, n) - (s's, rn )) 

each belongs to the set of generators in (3), so in particular each lies in the subgroup 
H. This shows that multiplying the first entries of the generators in (3) on the left by s' 
gives another element of H (in fact another generator). Since any element of H is a sum 
of elements as in (3), it follows that for any element ^2(Sj , ni) in H also ^(s^;, n,) 
lies in H. Suppose now that X^s, <g> rii — Yl s 'i ® n \ are two representations for the 
same element in S N. Then X^fo . «/) — X^ 5 /* w ;) is an element of H, and by what 
we have just seen, for any s e S also X^(w/, n >) — w /) i s an element of H. But 

this means that X^ ssi <g> «/ = ss{ ® n\ in 5 <g>* N, so the expression in (5) is indeed 
well defined. 

It is now straightforward using the relations in (4) to check that the action defined 
in (5) makes 5 <S>r N into a left 5-module. For example, on the simple tensor Sj <g> n,, 

(5 + s') ( Sj ® tij) = ((s -f- s')sj) <g) n t by definition (5) 

= (ssi + s' Si) <g> n f 

= ssj <g )n t + s' Si <g> rii by the first relation in (4) 

= s ( Sj ®tii) + s' ( si ® rn ) by definition (5) . 


The module 5 ®r N is called the (left) S -module obtained by extension of scalars 
from the (left) R-module N . 

There is a natural map i : N -+ S ®r N defined by n i-> 1 ® n (i.e., first map 
n e N to the element (1 , n) in the free abelian group and then pass to the quotient 
group). Since 1 <g> rn = r <g> n = r(l ig) n) by (4) and (5), it is easy to check that t is 
an -module homomorphism from N to 5 ®r N. Since we have passed to a quotient 
group, however, i is not injective in general. Hence, while there is a natural ^-module 
homomorphism from the original left f?-module N to the left 5-module 5 ®r N, in 
general S ®r N need not contain (an isomorphic copy of) N. On the other hand, the 
relations in equation (3) were the minimal relations that we had to impose in order to 
obtain an 5-module, so it is reasonable to expect that the tensor product 5 N is 
the “best possible” 5-module to serve as target for an R -module homomorphism from 
N. The next theorem makes this more precise by showing that any other ^-module 
homomorphism from N factors through this one, and is referred to as the universal 
property for the tensor product S ®r N. The analogous result for the general tensor 
product is given in Theorem 10. 
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Theorem & Let R be a subring of 5, let N be a left /(-module and let i : N -v 5 N 
be the /(-module homomorphism defined by i (n) = 1 <gm. Suppose that L is any left 5- 
module (hence also an /(-module) and that <p : N -> L is an /(-module homomorphism 
from N to L. Then there is a unique 5- module homomorphism 0 : 5 <%> R N — > L such 
that <p factors through 0 t Le., <p = 0 ot and the diagram 

N — U- S® r N 



commutes. Conversely, if 0 : 5 <& R N -*■ L is an 5- module homomorphism then 
<p = 0 o i is an /(-module homomorphism from N to L. 

Proof: Suppose <p \ N — ► L is an /(-module homomorphism to the 5-module L. 
By the universal property of free modules (Theorem 6 in Section 3) there is a Z-module 
homomorphism from the free Z-module F on the set 5 x N to L that sends each generator 
(s, n) to s<p (n). Since <p is an R - module homomorphism, the generators of the subgroup 
H in equation (3) all map to zero in L. Hence this Z-module homomorphism factors 
through H , i.e., there is a well defined Z-module homomorphism 0 from F/H = 
5 < 8 >/j N to L satisfying 0(s ® n) = s<p(n). Moreover, on simple tensors we have 

s'dKs &/?) = s'(s^(/i)) = (s's) 0 >(/i) = 0((s's) ® n) = 0(s‘(s <g>n)). 

for any s' e 5. Since 0 is additive it follows that 0 is an 5-module homomorphism, 
which proves the existence statement of the theorem. The module S<S>r N is generated 
as an 5 -module by elements of the form 1 ® n, so any 5 -module homomorphism is 
uniquely determined by its values on these elements. Since 0(1 <g>n) — (pin), it follows 
that the 5-module homomorphism 0 is uniquely determined by <p, which proves the 
uniqueness statement of the theorem. The converse statement is immediate. 

The universal property of 5 ® R N in Theorem 8 shows that R -module homomor- 
phism s of N into 5-modules arise from 5-module homomorphisms from 5 ®r N. In 
particular this determines when it is possible to map N injectively into some 5-module: 

Corollary 9. Let l : N 5 <S>r N be the /(-module homomorphism in Theorem 8 . 
Then N/ ker i is the unique largest quotient of N that can be embedded in any 5-module. 
In particular, N can be embedded as an /(-submodule of some left 5-module if and only 
if l is injective (in which case N is isomorphic to the /(-submodule i(N) of the 5-module 
5 N ). 


Proof: The quotient N / ker i is mapped injectively (by t) into the 5-module S®rN. 
Suppose now that <p is an /(-module homomorphism injecting the quotient N/kei<p 
of N into an 5-moduJe L. Then, by Theorem 8 , ker* is mapped to 0 by <p, i.e., 
ker* c ker<p. Hence N / ker is a quotient of N / ker i (namely, the quotient by 
the submodule ker (p/ ker i). It follows that N/ ker i is the unique largest quotient of 
N that can be embedded in any 5-module. The last statement in the corollary follows 
immediately. 


362 


Chap. 10 Introduction to Module Theory 



Examples 

(1) For any ring R and any left F- module N we have R ®r N = N (so “extending scalars 
from R to R ” does not change the module). This follows by taking <p to be the identity 
map from N to itself (and S = R) in Theorem 8: i is then an isomorphism with inverse 
isomorphism given by <F>. In particular, if A is any abelian group (i.e., a Z- module), 
then Z ®z A = A. 

(2) Let R = Z, S = Q and let A be a finite abelian group of order n. In this case the 
Q- module <Q>®z A obtained by extension of scalars from the Z-module A is 0. To see 
this, observe first that in any tensor product 1 ® 0 = 1 ® (0 + 0) = 1 ® 0 + 1 ® 0, by 
the second relation in (4), so 

1 ® 0 = 0 . 

Now, for any simple tensor q ® a we can write the rational number q as (q /n)n. Then 
since na = 0 in A by Lagrange’s Theorem, we have 

q <%>a = (— ■ n)(g>a = - ® (na) = ( q/n ) ® 0 = (q/n)( 1 ® 0) = 0. 
n n 

It follows that Q ®2 4 = 0. In particular, the map i : A -> S ®r A is the zero map. 
By Theorem 8, we see again that any homomorphism of a finite abelian group into a 
rational vector space is the zero map. In particular, if A is nontrivial, then the original 
Z-module A is not contained in the Q-module obtained by extension of scalars. 

(3) Extension of scalars for free modules.' If N = R n is a free module of rank n over/? then 
S®rN = S n is a free module of rank n over S. We shall prove this shortly (Corollary 
18) when we discuss tensor products of direct sums. For example, Q ®z IT = Q”. 
In this case the module obtained by extension of scalars contains (an isomorphic copy 
of) the original F-module N. For example, Q ®z IT = Q" and ZT is a subgroup of 
the abelian group <Q>" . 

(4) Extension of scalars for vector spaces: As a special case of the previous example, let 
F be a subfield of the field K and let V be an n -dimensional vector space over F (i.e., 
V = F"). Then K V = K n is a vector space over the larger field K of the same 
dimension, and the original vector space V is contained in K ®f V as an F-vec tor 
subspace. 

(5) Induced modules for finite groups: Let R be a commutative ring with 1, let G be a 
finite group and let H be a subgroup of G. As in Section 7.2 we may form the group 
ring RG and its subring RH. For any RH - module N define the induced module 
RG <8 >rh N- In this way we obtain an FG-module for each R //-module N. We shall 
study properties of induced modules and some of their important applications to group 
theory in Chapters 17 and 19. 

The general tensor product construction follows along the same lines as the ex- 
tension of scalars above, but before describing it we make two observations from this 
special case. The first is that the construction of S <S>r N as an abelian group involved 
only the elements in equation (3), which in turn only required S to be a right /f-module 
and 4/ to be a left F-module. In a similar way we shall construct an abelian group 
M <8 >r N for any right R -module M and any left F-module N. The second observation 
is that the S-module structure on S ®r N defined by equation (5) required only a left 
5-module structure on 5 together with a “compatibility relation” 

s(sr) — ( s's)r for s, s’ € 5, r € /?, 
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between this left .S'- module structure and the right R -module structure on S (this was 
needed in order to deduce that (5) was well defined). We first consider the general 
construction of M®rN as an abelian group, after which we shall return to the question 
of when this abelian group can be given a module structure. 

Suppose then that N is a left R -module and that M is a right R -module. The 
quotient of the free Z-module on the set M x N by the subgroup generated by all 
elements of the form 


(mi + to 2 , h)-(W|,«)- (m 2 , n), 

(to, n\ + n 2 ) - (to. «,) — (m, n 2 ), and (10.6) 

(mr, n) - (to, rn), 


form, toj, w 2 € M , n, n\ t n 2 € N and r e R is an abelian group, denoted by M®r N, 
or simply M ® N if the ring R is clear from the context, and is called the tensor product 
of M and N over R. The elements of M ®r N are called tensors , and the coset, to (gin, 
of (to, n) in M ®r N is called a simple tensor. We have the relations 

(toi +to 2 ) ® n = toi ® n + m 2 ® n, 
to <g> (ni + n 2 ) = to <S> /7i +to ® n 2 , and (10.7) 

mr (g> n = to <g> r«. 

Every tensor can be written (non-uniquely in general) as a finite sum of simple tensors. 


Remark: We emphasize that care must be taken when working with tensors, since each 
to ® n represents a coset in some quotient group, and so we may have = 

where to ^ m! or n ^ n'. More generally, an element of M ® N may be expressible in 
many different ways as a sum of simple tensors. In particular, care must be taken when 
defining maps from M ®r N to another group or module, since a map from M ® N 
which is described on the generators to <8>n in terms of m and n is not well defined unless 
it is shown to be independent of the particular choice of to ® n as a coset representative. 

Another point where care must be exercised is in reference to the element m ®n 
when the modules M and N or the ring R are not clear from the context. The first two 
examples of extension of scalars give an instance where M is a submodule of a larger 
module M' t and for some to € M andn € N we have to ®n = 0 in M' ®r N but to ®n 
is nonzero in M ®r N. This is possible because the symbol “to ® n" represents different 
cosets, hence possibly different elements, in the two tensor products. In particular, these 
two examples show that M N need not be a subgroup of M' ®r N even when M 
is a submodule of M' (cf. also Exercise 2). 

Mapping M x N to the free Z-module on M x N and then passing to the quotient 
defines a map i : M x N -+ M N with i(m , n) = m ® n. This map is in general 
not a group homomorphism, but it is additive in both to and n separately and satisfies 
i(mr , n) — mr ® n = m ® rn = i(m, rn). Such maps are given a name: 
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Definition. Let M be a right R -module, let TV be a left /^-module and let L be an 
abelian group (written additively). A map (p : M x N -*• L is called R-balanced or 
middle linear with respect to R if 

(p(m\ + m 2 , n) = <p(m \ , n) + <p(m 2 , n) 

<p(m, m +n 2 ) = $!>(ro, «i) w 2 ) 

^>(m, rn) = ^(mr, n) 
for all m. mj, m 2 e M, n, n\,n 2 e N, and r € R. 


With this terminology, it follows immediately from the relations in (7) that the map 
t : M x A -> M ® R N is R -balanced. The next theorem proves the extremely useful 
universal property of the tensor product with respect to balanced maps. 

Theorem 10. Suppose R is a ring with 1 , M is a right K-module, and N is a left 
R -module. Let M <8>k N be the tensor product of M and N over R and let 1 : M x N —> 
M <S>r N be the R -balanced map defined above. 

(1) If d> : M <2>r N L is any group homomorphism from M < g >* N to an abelian 
group L then the composite map <p = <t> o 1 is an I? -balanced map from M x N 
to L. 

(2) Conversely, suppose L is an abelian group and <p : M x N -> L is any im- 
balanced map. Then there is a unique group homomorphism <P : M® R N -> L 
such that <p factors through 1 , i.e., ^ = #otasin(l). 

Equivalently, the correspondence <p <-* <P in the commutative diagram 


establishes a bijection 

I R -balanced maps 1 f group homomorphisms 1 

<p:MxN^L] < > [ <P:M® R N-+L J‘ 

Proof: The proof of (1) is immediate from the properties of 1 above. For (2), die 
map <p defines a unique Z-module homomorphism <p from the free group on M x N to 
L (Theorem 6 in Section 3) such that <p(m, n) = <p(m, n) e L. Since <p is R -balanced, 
<p maps each of the elements in equation (6) to 0; for example 

ip ((mr, n) — (m, rn)) = <p(mr, n ) — <p(m, rri) = 0. 

It follows that the kernel of ip contains the subgroup generated by these elements, hence 
ip induces a homomorphism 4> on the quotient group M ® R N to L. By definition we 
then have 

<P(m <g> ri) = <p(m, n ) = <p(m, n), 

i.e., <p = <X> o 1. The homomorphism 4> is uniquely determined by this equation since 
the elements m®n generate M ® R N as an abelian group. This completes the proof. 
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Theorem 10 is extremely useful in defining homomorphisms on M <2>r N since it 
replaces the often tedious check that maps defined on simple tensors m ® n are well 
defined with a check that a related map defined on ordered pairs (m, n) is balanced. 

The first consequence of the universal property in Theorem 10 is a characterization 
of the tensor product M <2>r N as an abelian group: 

Corollary 11. Suppose D is an abelian group and i' : M x N -»■ D is an /^-balanced 
map such that 

(i) the image of l' generates D as an abelian group, and 

(ii) every /^-balanced map defined on M x N factors through t' as in Theorem 10. 
Then there is an isomorphism / : M ®r N = D of abelian groups with i' = f oi. 

Proof: Since t' : M x N — > D is a balanced map, the universal property in (2) 
of Theorem 10 implies there is a (unique) homomorphism / : M ®r N -*■ D with 
(' = / or In particular i'(m, n) = / (m <£> n) for every m e Af, n e N. By the first 
assumption on i', these elements generate D as an abelian group, so / is a surjective 
map. Now, the balanced map i : M x N -*■ M ®r N together with the second 
assumption on i' implies there is a (unique) homomorphism g \ D —*■ M N with 
i = g o t'. Then m<2> n = (g o f)(m <8> n). Since the simple tensors m <S> n generate 
M ®r N, it follows that go f is the identity map on M (S>r N and so / is injective, 
hence an isomorphism. This establishes the corollary. 

We now return to the question of giving the abelian group M <S>r N a module 
structure. As we observed in the special case of extending scalars from R to S for the 
R -module N, the S-module structure on S<8rN required only a left S-module structure 
on S together with the compatibility relation s'(sr) = (s's)r for s,s' e S and r e R. 
In this special case this relation was simply a consequence of the associative law in 
the ring S. To obtain an S-module structure on M <& R N more generally we impose a 
s imil ar structure on M\ 

Definition. Let R and S be any rings with 1 . An abelian group M is called an (S, R)- 
bimodule if M is a left 5-module, a right R -module, and s(mr) — ( sm)r for all s € 5, 
r e R and m e M. 

Examples 

(1) Any ring 5 is an (5, i?)-bimodule for any suhring R with 1 r = 1 s by the associativity 
of the multiplication in S. More generally, if /:/?—► S is any ring homomorphism 
with /(I r) = 1 s then S can be considered as a right R -module with the action 
s ■ r = sf(r), and with respect to this action S becomes an (5, i?) -bimodule. 

(2) Let / be an ideal (two-sided) in the ring R. Then the quotient ring R/I is an (R /l, R)- 
bimodule. This is easy to see directly and is also a special case of the previous example 
(with respect to the canonical projection homomorphism R R/I). 

(3) Suppose that R is a commutative ring. Then a left (respectively, right) /?-module M 
can always be given the structure of a right (respectively, left) ^-module by defining 
mr = rm (respectively, rm = mr), for all m € M and r € R, and this makes M into 
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an (R, /?)-bimodule. Hence every module (right or left) over a commutative ring R 
has at least one natural (R, /?)-bimodule structure. 

(4) Suppose that M is a left S- module and R is a subring contained in the center of S (for 
example, if S is commutative). Thai in particular R is commutative so M can be given 
a right fl-module structure as in the previous example. Then for any s € S, r e R and 
m € M by definition of the right action of R we have 

(sm)r = r(sm) = ( rs)m = ( sr)m — s(rm ) = j(mr) 

(note that we have used the fact that r commutes with $ in the middle equality). Hence 
M is an ( S , /?) -bimodule with respect to this definition of the right action of R. 

Since the situation in Example 3 occurs so frequently, we give this bimodule struc- 
ture a name: 

Definition. Suppose M is a left (or right) /^-module over the commutative ring R. 
Then the (/?, /?) -bimodule structure on M defined by letting the left and right R -actions 
coincide, i.e., mr = rm for all m e M and ref?, will be called the standard f?-module 
structure on M. 

Suppose now that N is a left /?-module and M is an (S, /?)-bimodule. Then just as 
in the example of extension of scalars the ( S , f?) -bimodule structure on M implies that 

s( <8>n, ] = ® n,- (10.8) 

^ finjic ' finite 

gives a well defined action of S under which M ®r N is a left S-module. Note that 
Theorem 10 may be used to give an alternate proof that (8) is well defined, replacing 
the direct calculations on the relations defining the tensor product with the easier check 
that a map is R -balanced, as follows. It is \cry easy to see that for each fixed s e S 
the map (m, n ) i-> sm <g> n is an /?-balanced map from M x N to M <8 >/t N- By 
Theorem 10 there is a well defined group homomorphism k s from M N to itself 
such that k s (m <g) n) = sm ® n. Since the right side of (8) is then k s ($2, m i ® n/)» 
the fact that X s is well defined shows that this expression is indeed independent of 
the representation of the tensor Y1 m i ® n i as a sum °f simple tensors. Because k s is 
additive, equation (8) holds. 

By a completely parallel argument, if M is a right /?-module and /V is an (R, S)- 
bimodule then the tensor product M<8>rN has the structure of a right S-module, where 
(£>/ ®n/)r = (»,-j). 

Before giving some more examples of tensor products it is worthwhile to highlight 
one frequently encountered special case of the previous discussion, namely the case 
when M and N are two left modules over a commutative ring R and S = R (in some 
works on tensor products this is the only case considered). Then the standard R -module 
structure on M defined previously gives M the structure of an (R, 7?)-bimodule, so in 
this case the tensor product M ®r N always has the structure of a left /?-module. 

The corresponding map i : M x N -> M ® R N maps M x N into an /? -module 
and is additive in each factor. Since r(m <8>n) = rm ®n = mr ® n = m ® m it also 
satisfies 

rt(m, n ) = t(rm,n) = t(m, rn). 
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Such maps are given a name: 

Definition. Let R be a commutative ring with 1 and let M , N, and L be left R -modules. 
The map (p : M x N L is called R-bilinear if it is ^-linear in each factor, i.e., if 

<p(nmi + nm 2 , ri) = n ) + r 2 (p(m 2 , n), and 

<p(m, nni + r 2 n 2 ) = n<p{m , nO + r 2 (p(m, n 2 ) 

for all m, mi, m 2 e M, n, n i, n 2 € N and ri, r 2 e A?. 


With this terminology Theorem 10 gives 

Corollary 12. Suppose R is a commutative ring. Let M and N be two left -modules 
and let M <S>/? N be the tensor product of M and N over R, where M is given the standard 
7? -module structure. Then M ® R TV is a left i?-module with 

r(m ® n) = ( rm ) ®n — ( mr ) ® n = m ® (rn), 

and the map i : M x N M ® R N with i(m, n ) = m®n is an -bilinear map. If L 
is any left -module then there is a bijection 

R -bilinear maps I j R -module homomorphisms 
q> : M x N -+ L ) { 0 : M ® R N ^ L 

where the correspondence between <p and 0 is given by the commutative diagram 

M x N—^M® R N 

0 
T 

L 

Proof: We have shown M ® R TV is an 7?-module and that i is bilinear. It remains 
only to check that in the bijective correspondence in Theorem 10 the bilinear maps 
correspond with the R -module homomorphisms. lf<p : M x N -+ Lis bilinear then it 
is an R -balanced map, so the corresponding <J> : M ® R N is & group homomorphism. 
Moreover, on simple tensors 3>((rm) ® n) = <p{rm , ri) = r<p(m,ri) = r<P{m ® ri), 
where the middle equality holds because cp is R -linear in the first variable. Since <P is 
additive this extends to sums of simple tensors to show 0 is an A* -module homomor- 
phism. Conversely, if 0 is an R -module homomorphism it is an exercise to see that the 
corresponding balanced map <p is bilinear. 

Examples 

(1) In any tensor product M ® R N wt have m ® 0 = m ® (0 + 0) = (m ® 0) + (m ® 0), 
so m <S> 0 = 0. Likewise 0 ® n = 0. 

(2) We have Z/2Z Z/3Z = 0, since 3a = a for a e Z/2Z so that 

a<S>b = 3a<g)b = a<g)3b = a®0 = 0 

and every simple tensor is reduced to 0. In particular 1 ® 1 = 0. It follows that there 
are no nonzero balanced (or bilinear) maps from Z/2Z x Z/3Z to any abelian group. 
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On the other hand, consider the tensor product Z /2Z ®z Z/2Z, which is generated 1 

as an abelian group by the elements 0 0 0 = 1 00 = 00 1 = 0 and 1 0 1. In this case 1 

101^0 since, for example, the map Z/2Z x Z/2Z — *■ Z/2Z defined by (a, b)y+ ab j 

is clearly nonzero and linear in both a and b. Since 2(1 0 1) = 2®1=0®1=0, 1 

the element 1 ® 1 is of order 2. Hence Z/2Z ®z Z/2Z = Z/2Z. 

(3) In general, i 

Z/wi Z ®z Z/nZ = Z/dZ, j 

where d is the g.c.d. of the integers m and n. To see this, observe first that j 

a®b = a®(b • 1) = (ab) 0 1 = ab( 1 0 1), 

from which it follows that Z/mZ 02 Z/nZ is a cyclic group with 1 ® 1 as generator. 

Since m( 1 0 1) = /w 0 1 = 0®1=0 and similarly n(l ® 1) = 1 ® n = 0, we have 
d( 1 0 1) = 0, so the cyclic group has order dividing <i. The map <p : Z/mZ x Z/nZ — >■ 

Z/dZ defined by y)(o mod w, 6 mod n) = ab mod d is well defined since d divides 
bothm and n. It is clearly Z-bilinear. The induced map 0 : Z/mZ®zZ/nZ — v Z/dZ 
from Corollary 12 maps 1 ® 1 to the element 1 € Z/dZ, which is an element of order 
d. In particular Z/mZ ®z Z/nZ has order at least d. Hence 1 0 1 is an element of 
order d and 0 gives an isomorphism Z/mZ 02 Z/nZ = Z/dZ. 

(4) In Q/Z 02 Q/Z a simple tensor has the form (a/b modZ) ® (c/d mod Z) for some 
rational numbers a/b and c/d. Then 

(^ modZ) 0 (4 modZ) = d(^- modZ) 0 (4 modZ) 
b d bd d 

= ( ^ mod Z) ® d(— modZ) = (— modZ) 0 0 = 0 
od d bd 

and so 

Q/Z 02 Q/Z = 0. 

In a similar way, A 02 B = 0 for any divisible abelian group A and torsion abelian , 

group B (an abelian group in which every element has finite order). For example 

Q ®2 Q/Z = 0. 

(5) The structure of a tensor product can vary considerably depending on the ring over 
which the tensors are taken. For example Q Q and Q ®g Q are isomorphic as left 
Q-modules (both are one dimensional vector spaces over Q) — cf. the exercises. On 
the other hand we shall see at the end of this section that C 0c C and C 0 k C are 
not isomorphic C-modules (the former is a 1 -dimensional vector space over <C and the 
latter is 2-dimensional over <C). 

(6) General extension of scalars or change of base: Let f : R S be a ring homomor- 
phism with / (1^) = 1$. Then s - r = sf (r) gives S the structure of a right ^-module 
with respect to which S is an (5, J?)-bimodule. Then for any left ^-module N, the 
resulting tensor product 5 0/? N is a left 5-module obtained by changing the base 
from R to 5. This gives a slight generalization of the notion of extension of scalars 
(where R was a subring of 5). 

(7) Let / : R — ► 5 be a ring homomorphism as in the preceding example. Then we 
have 5 R = S as left 5-modules, as follows. The map <p : 5 x R — >■ 5 defined 
by (s, r) i-> sr (where sr = sf(r) by definition of the right 7?-action on 5), is an 
R -balanced map, as is easily checked. For example, 

<p(si -\-S2,r) = (si + sf)r = Syr + s?r = <f>(sy, r) + <p(s 2 , r) 
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and 


<fi(sr, r') = ( sr)r ' = s(rr') = <p(s, rr'). 


By Theorem 10 we have an associated group homomorphism 0 : S ®r R -y S with 
0(s®r) = sr. Since 0(s'(s ®r)) = 0(s*s®r) = s'sr = s'0(s®r), it follows that 
0 is also an S-module homomorphism. The map 0' : S S®rR with s i-> s ® 1 is 
an S-module homomorphism that is inverse to 0 because 0 o 0'(s) = 0 (s ® 1 ) = s 
gives 00' = 1. and 

0' o 0(s ®r) = 0'(sr) = sr®l=s®r 


shows that 0'0 is the identity on simple tensors, hence 0'0 = l. 

(8) Let R be a ring (not necessarily commutative), let / be a two sided ideal in ft, and let N 
be a left ft-module. Then as previously mentioned, R/I is an (ft/7, ft)-bimodule, so 
the tensor product R/I ®r N is a left ft//-module. This is an example of “extension 
of scalars'’ with respect to the natural projection homomorphism R -> R/I. 

Define 


IN = 


Y fli -Hi | ai £ 7, m £ N J- , 

finite j 


which is easily seen to be a left ft -submodule of N (cf. Exercise 5, Section 1). Then 

(R/I) ®rN = N/IN. 

as left ft-modules, as follows. The tensor product is generated as an abelian group by 
the simple tensors (r mod /) ® n = r(l ® n ) for r £ R and n e N (viewing the ft/7- 
module tensor product as an ft-module on which 7 acts trivially). Hence the elements 
1 ®n generate (R/I) ®r N as an ft//-module. The map N -*■ (R/I) ®r N defined 
by 1 ® n is a left ft-module homomorphism and, by the previous observation, 
is surjective. Under this map a,-u,- with a,- e I and n,- £ N maps to 1 ® a\m = 
a t ® rtj = 0, and so IN is contained in the kernel. This induces a surjective ft-module 
homomorphism / : N/IN — ► (R/I) ®r N with /(nmod/) = 1 gin. We show / 
is an isomorphism by exhibiting its inverse. The map (R/I) x N -*■ N/IN defined 
by mapping (r mod I, n) to (rn mod IN) is well defined and easily checked to be im- 
balanced. It follows by Theorem 10 that there is an associated group homomorphism 
g : (R/I)®N N/IN with g((r mod/) ®n) = rnmodIN. As usual, fg= 1 and 
gf = l, so / is a bijection and (R/I) ®r N = N/IN , as claimed. 

As an example, let ft = Z with ideal 7 = mZ and let A/ be the Z-module Z/nZ. 
Then IN = m(Z/nZ) = (mZ + nL)/nL = dZ/nZ where d is the g.c.d. of m and n. 
Then N/IN = Z/dZ and we recover the isomorphism Z/mZ ®z IL/nL = Z/dZ of 
Example 3 above. 


We now establish some of the basic properties of tensor products. Note the frequent 
application of Theorem 10 to establish the existence of homomoiphisms. 

Theorem 13. (The “Tensor Product ’’ of Two Homomorphisms) Let M, M' be right 
ft-modules, let N, N ' be left ft-modules, and suppose <p : M M' and \]/ : N -> N' 
are ft -module homomorphisms. 

(1) There is a unique group homomorphism, denoted by <p <g> mapping M ® R N 
into Af' ® r N' such that ((p ® Jr)(m ® n) = <p(m) ® $(n) for all m e M and 
neN. 
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(2) If M, M' are also (5, ft)bimodules for some ring 5 and <p is also an 5-module 
homomorphism, then <p®tfr is a homomorphism of left 5-modules. In particular, 
if R is commutative then <p ® is always an -module homomorphism for the 
standard /^-module structures. 

(3) If k : M‘ -> M" and fi : N' -> N" are /^-module homomorphisms then 

(k ® /r.) o (<p ® \}r) = (k o <p) ® ([1 O \}f). 

Proof: The map (m, «) i-> ^(m) ® V^( w ) from M x N to M' ®r N' is clearly 
R -balanced, so (1) follows immediately from Theorem 10. 

In (2) the definition of the (left) action of 5 on M together with the assumption that 
<p is an 5-module homomorphism imply that on simple tensors 

(<p ® ij/)(s(m ® n)) = (#> ® ® n) = <p(sm ) <g> = s<p(m) ® ^r(n). 

Since <p ®\fs is additive, this extends to sums of simple tensors to show that tp ® yjr is 
an 5-module homomorphism. This gives (2). 

The uniqueness condition in Theorem 10 implies (3), which completes the proot 

The next result shows that we may write M ® N ® L,or more generally, an n-fold 
tensor product M\ ® M 2 ® - - ■ ® M n , unambiguously whenever it is defined. 

Theorem 14. (Associativity of the Tensor Product) Suppose M is a right -module, N 
is an (R, Jj-bimodule, and L is a left T -module. Then there is a unique isomorphism 

(M ® R N) ® T L~ M ®r (N ® T L) 

of abelian groups such that (m®n)®l 1 m ® (n ® /). If M is an (5, /?)-bi module, 
then this is an isomorphism of 5-modules. 

Proof: Note first that the (R, 7) -bimodule structure on N makes M ®r N into a 
right T -module and N ® T L into a left R -module, so both sides of the isomorphism are 
well defined. For each fixed l e L, the mapping (m, n) m ® (n ® Z) is R -balanced, 
so by Theorem 10 there is a homomorphism M ® R N — > M ®r (N ® t L) with 
m®n i-> m®(n®l). This shows that the map from (M®rN) x L to M ®r (N® t L) 
given by (m ® n, l) m ® (n ® l) is well defined. Since it is easily seen to be T- 
balanced, another application of Theorem 10 implies that it induces a homomorphism 
(M ®r N ) ®t L — > M ®r ( N ®t L) such that (m ® n) ® l i-> m ® (n ® /). In a 
similar way we can construct a homomorphism in the opposite direction that is inverse 
to this one. This proves the group isomorphism. 

Assume in addition M is an (5, /?) -bimodule. Then for s € 5 and t G T we have 

s ((m ® n)t ) = s(m ® nt) = sm ®nt = ( sm ® n)t = (s(m® n)) t 

so that M ® r N is an (5, T)-bimodule. Hence (M ®r N) ® T L is a left 5-module. 
Since N ® T L is a left R -module, also M ®r(N ®t L) is a left 5-module. The group 
isomorphism just established is easily seen to be a homomorphism of left 5-modules 
by the same arguments used in previous proofs: it is additive and is 5-linear on simple 
tensors since s ((m ®n) ®l) = s(m ®n)®l = (sm ® n) ® l maps to the element 
sm ® (n ® l) = s (m ® (n ® /)). The proof is complete. 
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Corollary 15. Suppose R is commutative and M, N, and L are left R -modules. Then 

(M®N)®L = M®(N®L) 

as /^-modules for the standard R -module structures on M , N and L. 

There is a natural extension of the notion of a bilinear map: 

Definition. Let R be a commutative ring with 1 and let Mi, M 2 , . . . , M„ and L be 
R -modules with the standard R -module structures. A map <p \ M\ x - ■ • x M„ — > L is 
called n-multilinear over R (or simply multilinear if n and R are clear from the context) 
if it is an R -module homomorphism in each component when the other component 
entries are kept constant, i.e., for each i 

(p(m u . . . , m,_i, rm { + r'm m i+u 

= r(p(m \, . . . , m it . . . , m n ) + rfy(mi, 

for all m,, m' e M, and r, r' e R. When n = 2 (respectively, 3) one says <p is bilinear 
(respectively trilinear) rather than 2-multilinear (or 3-multilinear). 

One may construct the n-fold tensor product Mi 0 M 2 ® • ■ • 0 M n from first 
principles and prove its analogous universal property with respect to mul tilin ear maps 
from Mi x ■ x M n to L. By the previous theorem and corollary, however, an n- 
fold tensor product may be obtained unambiguously by iterating the tensor product of 
pairs of modules since any bracketing of Mi <S> • ■ • ® M„ into tensor products of pairs 
gives an isomorphic R -module. The universal property of the tensor product of a pair 
of modules in Theorem 10 and Corollary 12 then implies that mul tilin ear maps factor 
uniquely through the R -module Mi ® • • • <g> M„, i.e., this tensor product is the universal 
object with respect to multilinear functions: 

Corollary 16. Let R be a commutative ring and let Mi, ..., M n . L be /^-modules. Let 

denote any bracketing of the tensor product of these modules and 
let 

l : Mi x ■ ■ ■ x M n -> Mi 0 ■ ■ - ® M n 

be the map defined by t(mi, . . . , m n ) = mi ® ® m n . Then 

(1) for every 7?-module homomorphism Q> : M t <g) • • • & M„ — » L the map <p = <Po 1 
is n-multilinear from Mi x ■ ■ ■ x M n to L, and 

(2) x ■ ■ ■ x M n -> L is an n-multilinear map then there is a unique 

R -module homomorphism Q> \ M\ ® ® M„ — > L such that <p — & o 1 . 

Hence there is a bijection 

n-multilinear maps 1 f R -module homomorphisms 

p : Mj x ■ ■ ■ x -> L I { d>:Mi<8i---<2>Mn->L 
with respect to which the following diagram commutes: 

M x - - ■ x Mn — L -*- M <S) - ■ - ® M« 
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We have already seen examples where M\ 0/? N is not contained in M N 
even when M\ is an R -submodule of M. The next result shows in particular that (an 
isomorphic copy of) Mi <S>r N is contained in M <g> R N if Mi is an ^-module direct 
summand of M. 

Theorem 17. (Tensor Products of Direct Sums) Let M, M' be right /^-modules and let 
N, N' be left R -modules. Then there are unique group isomorphisms 

(M © M 7 ) ® R N = (M ® R N) © (M 7 0ft N) 

M ® R (N © TV 7 ) = (M 0ft TV) © (M 0 ft TV 7 ) 

such that (m, m y )0n h* (m0n, w'0n)andw0(n, n 7 ) i-> (m®n, m®n') respectively. 
If M, M' are also (5, T?)-bimodules, then these are isomorphisms of left 5-modules. In 
particular, if I? is commutative, these are isomorphisms of T?-modules. 

Proof: Tbemap(M©M 7 )xTV (M0ft7V)©(M 7 0ftTV)definedby ((m, h* 

(/n is well defined since m and m' in M © M 7 are uniquely defined in 

the direct sum. The map is clearly ^-balanced, so induces a homomorphism / from 
(M © M 7 ) 0 TV to (M 0ft AT) © (M 7 0« AO with 

/((wi, m) 0 n) = (m 0 n, m' 0 n). 

In the other direction, the /^-balanced maps Mx/V-> (M © M 7 ) 0ft TV and M 7 xW-> 
(M © M 7 ) 0ft N given by (m, n) h> (m, 0) 0n and (m 7 , n) !->■ (0, m') 0n, respectively, 
define homomorphisms from M 0ft TV and M 7 0ft TV to (M © M 7 ) 0ft TV. These in turn 
give a homomorphism g from the direct sum (M 0ft AO © (M 7 0ft TV) to (M © M 7 ) 0j? TV 
with 

g((m 0 ni, w 7 0 n 2 )) = (m. 0) 0 n, + (0, w 7 ) 0 n 2 . 

An easy check shows that / and g are inverse homomorphisms and are 5-module 
isomorphisms when M and M' are (5, ft)-bimodules. This completes the proof. 

The previous theorem clearly extends by induction to any finite direct sum of R- 
modules. The corresponding result is also true for arbitrary direct sums. For example 

M 0 (ffiie/ty) S © ie/ (M 0 TV,), 

where I is any index set (cf. the exercises). This result is referred to by saying that 
tensor products commute with direct sums. 

Corollary 18. (Extension of Scalars for Free Modules) The module obtained from the 
free T?-module N = R n by extension of scalars from R to 5 is the free 5-module 5", 
i.e., 

5 0ft R n = S n 


as left 5-modules. 

Proof: This follows immediately from Theorem 17 and the isomorphism 5 0ft R = 
5 proved in Example 7 previously. 
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Corollary 19. Let R be a commutative ring and let M = R s and N = R 1 be free 

/^-modules with bases /«i , . ,.,m s and n\ n,, respectively. Then M® R AT is a free 

R -module of rank st y with basis /h, ® nj, 1 < i < s and 1 < j < r, i.e., 

Remark : More generally, the tensor product of two free modules of arbitrary rank over 
a commutative ring is free (cf. the exercises). 

Proof: This follows easily from Theorem 17 and the first example following Corol- 
lary 9. 

Proposition 20. Suppose R is a commutative ring and M , N are left /^-modules, 
considered with the standard /^-module structures. Then there is a unique R -module 
isomorphism 

M ® R N = N® R M 


mapping m ® n to n <g> m. 

Proof: The map M x N -> N ® M defined by (m, n) h->- n ® m is /^-balanced. 
Hence it induces a unique homomorphism / from M ® N to N ® M with f{m®n) = 
n ® m. Similarly, we have a unique homomorphism g from N ® M to M ® N with 
g(n ®m) — m®n giving the inverse of /, and both maps are easily seen to be /?- module 
isomorphisms. 

Remark : When M = N it is not in general true that a ® b — b ® a for a, b € M. We 
shall study “symmetric tensors” in Section 11.6. 

We end this section by showing that the tensor product of 7?-algebras is again an 
A -algebra. 

Proposition 21. Let R be a commutative ring and let A and B be R -algebras. Then the 
multiplication (a ® b)(a' ® b') = a a' ® bt/ is well defined and makes A ® R B into an 
fl-algebra. 

Proof: Note first that the definition of an /^-algebra shows that 

r(a ®b) = ra®b = ar®b = a®rb = a®br= (a® b)r 

for every r € R,a e A and b € B . To show that A ® B is an ^-algebra the main task is, 
as usual, showing that the specified multiplication is well defined. One way to proceed is 
to use two applications of Corollary 16, as follows. The map <p : AxBxAxB — > A®B 
defined by f(a, b , a \ t/) = aa! ® bt/ is multilinear over R. For example, 

f(a, r\b\ + r 2 b 2 , a', b') = aa' ® ( r\by + r^bf^bf 

= aa' ® r\b\b' + aa' ® rit^b' 

= nf(a,bi,a',b') + r 2 f(a,b 2 ,a',b'). 
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By Corollary 16, there is a corresponding R -module homomorphism <t> from A ® B ® 
A ® B to A ® B with 0(a <&b <&a' <2>b’) — aa! ® bbt . Viewing A ® B ® A ® B as 
(A ® B) ® (A ® B), we can apply Corollary 16 once more to obtain a well defined R- 
bilinear mapping^ from(A®B)x(A®B)to A®B vti\h<p'(a®b t =■ aa'Qbl/ . 

This shows that the multiplication is indeed well defined (and also that it satisfies the 
distributive laws). It is now a simple matter (left to the exercises) to check that with 
this multiplication A ® B is an B -algebra. 


Example 

The tensor product C ®r C is free of rank 4 as a module over R with basis given by 
e\ — 1 ® 1, €2 = 1 < 8 > i, «3 = i ® 1, and e* = i ® i (by Corollary 19). By Proposition 21, 
this tensor product is also a (commutative) ring with e\ = 1 , and, for example, 

4 = 0'® O0‘ <8> 0 = i 2 <S> i 2 = (-1) ® (-1) = (-1)(-1) ® 1 = 1. 

Then (24 — l)(&t + 1) = 0, so C C is not an integral domain. 

The ring C C is an R-algebra and the left and right R-actions are the same: xr — rx 
for every r e R and x eC®®€. The ring C C has a structure of a left C-module 
because the first C is a (C, R)-bimodule. It also has a right C-module structure because 
the second C is an (R, C)-bimodule. For example, 

i ■ e\ = i ■ (1 <8> 1) = (i - 1 ) ® 1 = i ® 1 = e$ 


and 


e\ ■ i = (1 ® 1) i = 1 ® (1 ■ i) = 1 % i = £ 2 . 


This example also shows that even when the rings involved are commutative there may be 
natural left and right module structures (over some ring) that are not the same. 


EXERCISES 


Let R be a ring with 1. 

1- Let / : R -*■ S be a ring homomorphism from the ring R to the ring S with /(\r) = Is. 
Verify the details that sr ~ sf (r) defines a right R -action on S under which 5 is an 
( S , B)-bimodule. 

2. Show that the clement “2 ® 1” is 0 in 2 2/22 but is nonzero in 22 2/22. 

3. Show that C<S>rC and C®cC are both left R-modules but are not isomorphic as R-modules. 

4. Show that <Q> Q and 0 ®q Q are isomorphic left Q-modules. [Show they are both 
1 -dimensional vector spaces over <Q>.] 

5. Let A be a finite abelian group of order n and let p k be the largest power of the prime p 
dividing n. Prove that Z/p*Z A is isomorphic to the Sylow p-subgroup of A. 

6 . If R is any integral domain with quotient field Q, prove that (Q/R) ®r ( Q/R ) = 0. 

7. If R is any integral domain with quotient field Q and N is a left R -module, prove that 
every dement of the tensor product Q®r N can be written as a simple tensor of the form 
(1/d) ® n for some nonzero d e R and some n e N. 

8 . Suppose R is an integral domain with quotient field Q and let N be any B-module. Let 
U — R x be the set of nonzero elements in R and define V ~ 1 N to be the set of equivalence 
classes of ordered pairs of elements (h, n) with u € U and n e N under the equivalence 
relation ( u , n) ~ (u' , n) if and only if u'n = un' in N. 
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(a) Prove that U~ l N is an abelian group under the addition defined by + 

(U 2 , « 2 ) = (“l“2» + “ 1 ^ 2 )- Prove that r(u, n) = (u, rn) defines an action of R 

on U~ 1 N m akin g it into an R -module. [This is an example of localization considered 
in general in Section 4 of Chapter 15, cf. also Section 5 in Chapter 7.] 

(b) Show that the map from Q x N to U~ l N defined by sending ( a/b , n ) to ( b,an ) 
for a e /?, b € t/, n e N t is an R -balanced map, so induces a homomorphism / 
from Q ®r N to U~ i N. Show that the map g from U~ l N to Q ®r N defined by 
g ((u, n)) = (1 /u) <g>n is well defined and is an inverse homomorphism to /. Conclude 
that Q®rN = U~ l N as -modules. 

(c) Conclude from (b) that (1/d) <g> n is 0 in £> ®r N if and only if rn — 0 for some 
nonzero r e R. 

(d) If A is an abelian group, show that <Q> ®% A = 0 if and only if A is a torsion abelian 
group (i.e., every element of A has finite order). 

9. Suppose R is an integral d omain with quotient field Q and let N be any -module. Let 
Q ®r N be the module obtained from N by extension of scalars from R to Q. Prove that 
the kernel of the R -module homomorphism 1 : N -» Q ®r N is the torsion submodule of 
N (cf. Exercise 8 in Section 1). [Use the previous exercise.) 

10. Suppose R is commutative and N = R n is a free /?-module of rank n with /(“-module basis 

C\ y - ■ - j - 

(a) For any nonzero /?-module M show that every element of M ® N can be written 

uniquely in the form m,- ® e# where m/ € M. Deduce that if m; ®ei =0 
in M ® N then m t = 0 for i = 1 n. 

(b) Show that if £ m, ®n; = 0 in M ® N where the n,- are merely assumed to be R- 
linearly independent then it is not necessarily true that all the mi are 0. [Consider 
R = Z, n = 1, M = Z/2Z, and the element 1 <S» 2.) 

11. Let [ei, C2) be a basis of V = R 2 . Show that the element e\ ® ei + e 2 ® e\ in V <8)® V 
cannot be written as a simple tensor v ® w for any v, w e R 2 . 

12. Let V be a vector space over the field F and let v, v' be nonzero elements of V. Prove that 
ugiu^u'^uinV ®p V if and only if u = av’ for some a e F. 

13. Prove that the usual dot product of vectors defined by letting (aj, . . . , a n ) (hi, ... , b n ) be 

aib\ -| h a n b„ is a bilinear map from R" x R” to R. 

14. Let / be an arbitrary nonempty index set and for each 1 e / let Nj be a left R -module. Let 
Wbca right /f-mudule. Prove tlie group isomorphism: M ® (®, e / A^y) = ® ( -£/(M ® A^y), 
where the direct sum of an arbitrary collection of modules is defined in Exercise 20, 
Section 3. [Use the same argument as for the direct sura of two modules, taking care to 
note where the direct sum hypothesis is needed — cf. the next exercise.) 

15. Show that tensor products do not commute with direct products in general. [Consider 
the extension of scalars from Z to Q of the direct product of the modules My = Z/2' Z, 
1 = 1 . 2 , ... ] 

16. Suppose R is commutative and let / and J be ideals of R , so R/T and R/J are naturally 
/?-modules. 

(a) Prove that every element of R/I ®r R/J can be written as a simple tensor of the form 
(1 mod/) ® ( r mod J). 

(b) Prove that there is an /^-module isomorphism R/I ®r R/J = R/(I + J) mapping 
(r mod I) ® ( r ’ mod J) to rr* mod (/ + J). 

17. Let/ = (2.x) be the ideal generated by 2 and jc in the ring R = Z[x). TheringZ/2Z = R/I 
is naturally an J? -module annihilated by both 2 and x. 
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(a) Show that the map q > ; / x / -*• Z/2Z defined by 


<p(oo +aix + 


do 

+ a„x n ,bo + b\x H +b m x m ) = ~^b\ mod 2 


Z/2Z mapping 


is -bilinear. 

(b) Show that there is an /^-module homomoiphism from I ®r I 

p(x) ® q(x) to (0) where q' denotes the usual polynomial derivative of q. 

(c) Show that 2®jt^ji:®2in7®je7. 


18. Suppose / is a principal ideal in the integral domain R. Prove that the R -module I ®r I 
has no nonzero torsion elements (Le., rm = 0 with 0/rG/f and m € 7 ®r 7 implies 
that m = 0). 


19. Let 7 = (2, jc) be the ideal generated by 2 and x in the ring R = Z[x] as in Exercise 17. 
Show that the nonzero element 2 ®jc — jt®2in7®/j I is a torsion element Show in 
fact that 2 ®jc— J t®2is annihilated by both 2 and x and that the submodule of I ®r I 
generated by 2®* — jc ® 2 is isomorphic to R/I. 

20. Let 7 = (2, x) be the ideal generated by 2 and jc in the ring R = Z[jc]. Show that the 
element 2®2 + Jc®Jcin/®^/is not a simple tensor, i.e., cannot be written as a ® b for 
some a, b € I. 


21. Suppose R is commutative and let 7 and J be ideals of R. 

(a) Show there is a suijective R -module homomorphism from I ®r J to the product ideal 
I J mapping i ® j to the element ij. 

(b) Give an example to show that the map in (a) need not be injective (cf. Exercise 17). 

22. Suppose that M is a left and a right /^-module such that rm = mr for all r e 7? and 
m e M. Show that the elements rir2 and rir\ act the same an M for every r\, ri e R. 
(This explains why the assumption that R is commutative in the definition of an 7?-algebra 
is a fairly natural one.) 

23. Verify the details that the multiplication in Proposition 19makes A®rB into an 7?-algebra. 

24. Prove that the extension of scalars from Z to the Gaussian integers Z[i] of the ring R is 
isomorphic to C as a ring: Z[r] ®z R = C as rings. 

25. Let 7? be a subring of the commutative ring S and let jc be an indeterminate over S. Prove 
that S[x] and S ®r 7?[jc] are isomorphic as S- algebras. 

26. Let S be a commutative ring containing R (with 1$ = 1/?) and let jq , . . . , x„ be indepen- 

dent indeterminaies over the ring S. Show that for every ideal 7 in the polynomial ring 
7?[*l *„] that S®r (7?|>i , . . . , x n \/I) = x„]/l S[*i , . . . , x„] as 5-algebras. 


The next exercise shows the ring C ®® C introduced at the end of this section is isomorphic 
to C x C. One may also prove this via Exercise 26 and Proposition 16 in Section 9.5, since 
C = R[jc]/(jc 2 + 1). The ring C x C is also discussed in Exercise 23 of Section 1. 


27 . (a) Write down a formula for the multiplication of two elements a-\+be2+c-e3+d-e4 
and a' ■ 1 + b' ■ £2 + tf ■ e% + d' ■ £4 in the example A = C ®® C following Proposition 
21 (where 1 = 1 ® 1 is the identity of A). 

(b) Letej = ^ (1 <S> 1 -H i ®i) ande2 = j( 1®1— i®0- Show that €2 = 0, e \+£2 = l,and 
€j = €j for j = 1 , 2 (ei and €2 are called orthogonal idempotents in A). Deduce that 
A is isomorphic as a ring to the direct product of two principal ideals: A = Ae\ x Ac 2 
(cf. Exercise 1, Section 7.6). 

(c) Prove that the map ^:CxC— »-CxCby <p(z\, zi) = (ziZ2. zi zi), where zi denotes 
the complex conjugate of Z2, is an R-bilinear map. 
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(d) Let <P be the R-module homomorphism from A to C x C obtained from <p in (c). 
Show that 0(e i) = (0, 1) and <P(e 2 ) = (1, 0). Show also that #> is C-Iinear, where 
the action of C is on the left tensor factor in A and on both factors in C x C. Deduce 
that <P is suijective. Show that 0 is a C algebra isomorphism. 


10.5 EXACT SEQUENCES— PROJECTIVE, INJECTIVE, AND 
FLAT MODULES 

One of the fundamental results for studying the structure of an algebraic object B (e.g., 
a group, a ring, or a module) is the First Isomorphism Theorem, which relates the 
subobjects of B (the normal subgroups, the ideals, or the submodules, respectively) 
with the possible homomorphic images of B. We have already seen many examples 
applying this theorem to understand the structure of B from an understanding of its 
“smaller” constituents — for example in analyzing the structure of the dihedral group 
Dg by determining its center and the resulting quotient by the center. 

In most of these examples we began first with a given B and then determined some 
of its basic properties by constructing a homomorphism tp (often given implicitly by 
the specification of ker 99 c B) and examining both ker and the resulting quotient 
B / ker <p. We now consider in some greater detail the reverse situation, namely whether 
we may first specify the “smaller constituents.” More precisely, we consider whether, 
given two modules A and C t there exists a module B cont aining (an isomorphic copy 
of) A such that the resulting quotient module B/A is isomorphic to C — in which case 
B is said to be an extension of C by A. It is then natural to ask how many such B exist 
for a given A and C, and the extent to which properties of B are determined by the 
corresponding properties of A and C. There are, of course, analogous problems in the 
contexts of groups and rings. This is the extension problem first discussed (for groups) 
in Section 3.4; in this section we shall be primarily concerned with left modules over 
a ring R t making note where necessary of the modifications required for some other 
structures, notably noncommutative groups. As in the previous section, throughout this 
section all rings contain a 1 . 

We first introduce a very convenient notation. To say that A is isomorphic to a 
submodule of B, is to say that there is an injective homomorphism : A — >■ B (so 
then A = i/f(A) c B). To say that C is isomorphic to the resulting quotient is to say 
that there is a suijective homomorphism <p : B ->■ C with ker <p — if (A). In particular 
this gives us a pair of homomorphisms: 



with image = ker <p. A pair of homomorphisms with this property is given a name: 

Definition. 

a p 

(1) The pair of homomorphisms X — > Y Z is said to be exact (at Y) if 
image or = ker /S. 

(2) A sequence *• — >■ X n X n+ \ — > ■■■ of homomorphisms is said to be 

an exact sequence if it is exact at every X„ between a pair of homomorphisms. 
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}J/ 

With this terminology, the pair of homomorphisms A -> B — ► C above is exact at B. 
We can also use this temiinology to express the fact that for these maps yjf is injective 
and <p is suijective: 

Proposition 22. Let A, B and C be /^-modules over some ring R. Then 

\j/ m 

(1) The sequence 0 — ► A -> B is exact (at A) if and only if ^ is injective. 

(2) The sequence B -► C 0 is exact (at C) if and only if <p is surjective. 

Proof: The (uniquely defined) homomorphism 0 — > A has image 0 in A. This will 
be the kernel of ^ if and only if is injective. Similarly, the kernel of the (uniquely 
defined) zero homomorphism C — > 0 is all of C, which is the image of <p if and only if 
(p is surjective. 

Corollary 23. The sequence 0->A->B->C->0is exact if and only if is 
injective, <p is suijective, and image \J/ = ker <p, i.e., B is an extension of C by A. 

Definition. The exact sequence 0 A B 4 C -> 0 is called a short exact 
sequence. 

In terms of this notation, the extension problem can be stated succinctly as follows: 
given modules A and C, determine all the short exact sequences 

0->A4fiAc-yO. (10.9) 

We shall see below that the exact sequence notation is also extremely convenient for 
analyzing the extent to which properties of A and C determine the corresponding prop- 
erties ofB. If A, B and C are groups written multipficatively, the sequence (9) will be 
written 

1 — ► A B -4 C — > 1 (10.90 

where 1 denotes the trivial group. Both Proposition 22 and Corollary 23 are valid with 
the obvious notational changes. 

Note that any exact sequence can be written as a succession of short exact sequences 

a ft 

since to say X — > Y — > Z is exact at Y is the same as saying that the sequence 
0 — > a(X) -> Y -> Y/ ker 0 is a short exact sequence. 

Examples 

(1) Given modules A and C we can always form their direct sum B = A 0 C and the 
sequence 

0->a4a©c4c->0 

where t(a) = (a, 0) and n(a, c) = c is a short exact sequence. In particular, it follows 
that there always exists at least one extension of C by A. 

(2) As a special case of the previous example, consider the two Z-modules A = Z and 
C = Z/nZ: 

0-4Z4Z® (Z/nZ) -4 Z/nZ — ► 0, 
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giving one extension of Z/«Z by Z. 

Another extension of Z/nZ by Z is given by the short exact sequence 

O^Z4z4 Z/nZ -+ 0 

where n denotes the map x i-* nx given by multiplication by n, and n denotes the 
natural projection. Note that the modules in the middle of the previous two exact 
sequences are not isomorphic even though the respective “A” and “C” terms are 
isomorphic. Thus there are (at least) two “essentially different” or “inequivalent” 
ways of extending Z/nZ by Z. 

(3) If <p : B -*■ C is any homomorphism we may form an exact sequence: 

0 — ► ker<p — U B image <p — v 0 

where i is the inclusion map. In particular, if <p is suijective, the sequence <p : B -y C 
may be extended to a short exact sequence with A = ker <p> 

( 4 ) One particularly important instance of the preceding example is when M is an R -module 
and S is a set of generators for M. Let F ( S ) be the free ^-module on S. Then 

0 — > K -U F(S) -4> M — > 0 

is the short exact sequence where <p is the unique R-module homomorphism which is 
the identity on S (cf. Theorem 6) and K = ker <p. 

More generally, when M is any group (possibly non-abelian) the above short exact 
sequence (with 1 ’s at the ends, if M is written multipiicatively) describes & presentation 
of M, where K is the normal subgroup of F(S) generated by the relations defining M 
(cf. Section 63). 

(5) T\vo “inequivalent” extensions G of the Klein 4-group by the cyclic group Z 2 of order 
two are 


1 — * Z2 Dg Z 2 x Z2 — »■ 1, and 

1 — Zz Q& Z 2 x Z 2 — ^ 1, 

where in each case t maps Z2 injectively into the cento 1 of G (recall that both Dg and 
Qg have centers of order two), and <p is the natural projection of G onto G/Z(G). 

Two other in equivalent extensions G of the Klein 4-group by Z2 occur when G 
is either of the abelian groups Z2 x Z2 x Z 2 or Z2 x Z4 for appropriate maps 1 and <p. 


Examples 2 and 5 above show that, for a fixed A and C, in general there may be 
several extensions of C by A. To distinguish different extensions we define the notion 
of a homomorphism (and isomorphism) between two exact sequences. Recall first that 
a diagram involving various homomorphisms is said to commute if any compositions of 
homomorphisms with the same starting and ending points are equal, i.e., the composite 
map defined by following a path of homomorphisms in the diagram depends only on 
the starting and ending points and not on the choice of the path taken. 
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Definition. Let 0— >A— >C— >0 and 0 A' — > B' — > C' 0 be two short 

exact sequences of modules. 

(1) A homomorphism of short exact sequences is a triple a, y of module homo- 

morphisms such that the following diagram commutes: 

0 > A > B ► C ► 0 

1” I* 1” 

0 > A! ► B' ► C > 0 

The homomorphism is an isomorphism of short exact sequences if a, y are all 
isomorphisms, in which case the extensions B and B' are said to be isomorphic 
extensions. 

(2) The two exact sequences are called equivalent if A — A', C = C' , and there is 
an isomorphism between them as in (1) that is the identity maps on A and C 
(i.e., a and y are the identity). In this case the corresponding extensions B and 
B' are said to be equivalent extensions. 

If B and B' are isomorphic extensions then in particular B and B' are isomorphic 
as R -modules, but more is true: there is an R -module isomorphism between B and 
B' that restricts to an isomorphism from A to A' and induces an isomorphism on the 
quotients C and C' . For a given A and C the condition that two extensions B and B' 
of C by A are equivalent is stronger still: there must exist an R -module isomorphism 
between B and B' that restricts to the identity map on A and induces the identity map 
on C. The notion of isomorphic extensions measures how many different extensions of 
C by A there are, allowing for C and A to be changed by an isomorphism. The notion 
of equivalent extensions measures how many different extensions of C by A there are 
when A and C are rigidly fixed. 

Homomorphisms and isomorphisms between short exact sequences of multiplica- 
tive groups (90 are defined similarly. 

It is an easy exercise to see that the composition of homomorphisms of short exact 
sequences is also a homomorphism. Likewise, if the triple a, y is an isomorphism 
(or equivalence) then a -1 , f$~ l , y~ l is an isomorphism (equivalence, respectively) in 
the reverse direction. It follows that “isomorphism” (or equivalence) is an equivalence 
relation on any set of short exact sequences. 

Examples 

(1) Let m and n be integers greater than 1. Assume n divides m and let k = m/n. Define 
a map from the exact sequence of Z- modules in Example 2 of the preceding set of 
examples: 

0 — > Z — ^ Z — ► Z/nZ ► 0 

1 “ i y 

0 > Z/JfcZ — ► Z/mZ — 'Lira, ► 0 

where a and /S are the natural projections, y is the identity map, i maps a mod k to 
na mod m, antin' is the natural projection of Z/mZonto its quotient (Z/mZ) /(nZ/mZ) 
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(which is isomorphic to Z/nZ). One easily checks that this is a homomorphism of 
short exact sequences. 

(2) If again 0 -*■ Z A- Z A Z/»Z -*■ 0 is the short exact sequence of Z- modules defined 
previously, map each module to itself by je i — > —x. This triple of homomorphisms 
gives an isomorphism of the exact sequence with itself. This isomorphism is not an 
equivalence of sequences since it is not the identity on the first Z. 

(3) The short exact sequences in Examples 1 and 2 following Corollary 23 are not 
isomorphic — the extension modules are not isomorphic Z-modules (abelian groups). 
Likewise the two extensions, Dg and Qg, in Example 5 of the same set are not iso- 
morphic (hence not equivalent), even though the two end toms “A” and M C” are the 
same for both sequences. 

(4) Consider the maps 

0 > Z/2Z — 5^-— > Z/2Z 0 Z/2Z — Z/2Z > 0 



0 ► Z/2Z — Z/2Z0Z/2Z — Z/2Z » 0 

where ^ maps Z /2Z injectively into the first component of the direct sum and <p projects 
the direct sum onto its second component Also *p' embeds Z/2Z into the second 
component of the direct sum and <p' projects the direct sum onto its first component 
If fi maps the direct sum Z/2Z © Z/2Z to itself by interchanging the two factors, 
then this diagram is seen to commute, hence giving an equivalence of the two exact 
sequences that is not the identity isomorphism. 

(5) We exhibit two isomorphic but inequivalent Z-module extensions. For / = 1 , 2 define 

0 — ► Z/2Z Z/4Z © Z/2Z Z/2Z © Z/2Z — ► 0 

where : 1 i-> (2, 0) in both sequences, is defined by <p\ (a mod 4, b mod 2) = 
(a mod 2, b mod 2), and <pi{a mod 4, £>mod2) — (b mod 2, a mod 2). It is easy to see 
that the resulting two sequences are both short exact sequences. 

An evident isomorphism between these two exact sequences is provided by the 
triple of maps id, id, y, where y : Z/2Z © Z/2Z — ► Z/2Z © Z/2Z is the map 
y((c, d)) = (d, c) that interchanges the two direct factors. 

We now check that these two isomorphic sequences are nvl equivalent, as fol- 
lows. Since <p\ (0, 1) = (0, 1), any equivalence, id, id, from the first sequence to 
the second must map (0, 1) € Z/4Z © Z/2Z to either (1 , 0) or (3, 0) in Z/4Z© Z/2Z, 
since these are the two possible elements mapping to (0, 1) by <f> 2 . This is impossible, 
however, since the isomorphism P cannot send an dement of order 2 to an element of 
order 4. 

Put another way, equivalences involving the same extension module B are au- 
tomorphisms of B that restrict to the identity on both »//(A) and B/\J/(A). Any such 
automorphism of B = Z/4Z ffi Z/2Z must fix the coset (0, 1) + V'(A) since this 
is the unique nonidentity coset containing elements of order 2. Thus maps which 
send this coset to different elements in C give inequivalent extensions. In particular, 
there is yet a third inequivalent extension involving the same modules A = Z/2Z, 
B = Z/4Z © Z/2Z and C = Z/2Z ffi Z/2Z, that maps the coset (0, 1) + yj/{A) to the 
element (1, 1) e Z/2Z © Z/2Z. 

By similar reasoning there are three inequivalent but isomorphic group extensions 
of Z 2 x Z 2 by Z 2 with B = D& (cf. the exercises). 
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The homomorphisms a, 0, y in a homomorphism of short exact sequences are not 
independent The next result gives some relations among these three homomorphisms. 

Proposition 24. (The Short Five Lemma ) Let a, 0, y be a homomorphism of short 
exact sequences 

0 ► A ► B > C ► 0 

i“ 1' i r 

0 v A! ► B' ► C ► 0 

(1) If a and y are injective then so is 0. 

( 2 ) If a and y are surjective then so is 0. 

(3) If a and y are isomorphisms then so is 0 (and then the two sequences are 
isomorphic). 

Remark: These results hold also for short exact sequences of (possibly non-abelian) 
groups (as the proof demonstrates). 

Proof: We shall prove (1), leaving the proof of (2) as an exercise (and (3) follows 
immediately from (1) and (2)). Suppose then that a, and y are injective and suppose 
b € B with 0(b) = 0. Let rjr : A B and <p : B ->• C denote the homomorphisms in 
the first short exact sequence. Since 0(b) — 0, it follows in particular that the image 
of 0(b) in the quotient C' is also 0. By the commutativity of the diagram this implies 
that y(<p(b)) = 0, and since y is assumed injective, we obtain <p(b) — 0, i.e., b is 
in the kernel of <p. By the exactness of the first sequence, this means that b is in the 
image of if, i.e., b = yf(a) for some a g A. Then, again by the commutativity of 
the diagram, the image of a (a) in B' is the same as 0(\j/(a)) = 0(b) = 0. But a and 
the map from A! to B' are injective by assumption, and it follows that a = 0. Finally, 
b = \}/(a) = rJ/( 0) = 0 and we see that 0 is indeed injective. 

We have already seen that there is always at least one extension of a module C by A, 
namely the direct sum B = A 0 C. In this case the module B contains a submodule C 
isomorphic to C (namely C' — 0 0 C) as well as the submodule A, and this submodule 
complement to A “splits” B into a direct sum. In the case of groups the existence of 
a subgroup complement C' to a normal subgroup in B implies that B is a semidirect 
product (cf. Section 5 in Chapter 5). The fact that B is a direct sum in the context 
of modules is a reflection of the fact that the underlying group structure in this case is 
abelian ; for abelian groups semidirect products are direct products. In either case the 
corresponding short exact sequence is said to “split”: 

Definition. 

yj/ <p 

(1) Let R be a ring and let0->A->-,fi->C->-0bea short exact sequence of 
R -modules. The sequence is said to be split if there is an R -module complement 
to yJs(A) in B. In this case, up to isomorphism, B = A 0 C (more precisely, 
B — yJ/(A) © C' for some submodule C’, and C’ is mapped isomorphically onto 
C by tp: <p(C') = C). 
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\J/ (p 

(2) If 1 — * A — > B C — > I is a short exact sequence of groups, then the 
sequence is said to be split if there is a subgroup complement to \f/(A) in B. In 
this case, up to isomorphism, B = A C (more precisely, B = f/(A) x C' for 
some subgroup C', and C' is mapped isomorphically onto C by <p\ <p{C') = C). 
In either case the extension B is said to be a split extension of C by A. 

The question of whether an extension splits is the question of the existence of a 
complement to ty(A) in B isomorphic (by (p) to C, so the notion of a split extension 
may equivalently be phrased in the language of homomorphisms: 

Proposition 25. The short exact sequence 0— >0of R -modules is 
split if and only if there is an R -module homomorphism p : C -> B such that (pop 

jj/ tp 

is the identity map on C. Similarly, the short exact sequence I ->• A — ► i? — ► C -> 1 
of groups is split if and only if there is a group homomorphism p : C ->■ B such that 
tp o p is the identity map on C. 

Proof: This follows directly from the definitions: if p is given define C' = p(C) C 
B and if C' is given define p = <p~* : C = C' C B. 

Definition. With notation as in Proposition 25, any set map p : C -> B such that 
tp o p = id is called a section of (p. If p is a homomorphism as in Proposition 25 then 
p is called a splitting homomorphism for the sequence. 

Note that a section of (p is nothing more than a choice of coset representatives in B 
for the quotient B/ ker <p = C. A section is a (splitting) homomorphism if this set of 
coset representatives forms a submodule (respectively, subgroup ) in B, in which case 
this submodule (respectively, subgroup) gives a complement to ^(A) in B, 

Examples 

(1) The split short exact sequence 0— >-0 has the evident splitting 
homomorphism /x(c) — (0, c). 

(2) The extension 0 -> Z % © (Z/nZ) \ Z/nZ ->• 0, of Z/nZ by Z is split (with 
splitting homomorphism u mapping Z/nZ isomorphically onto the second factor of 
the direct sum). On the other hand, the exact sequence of Z-modules 0 — v Z A z4 
Z/nZ ->• 0 is not split since there is no nonzero homomorphism of Z/nZ into Z. 

(3) Neither D$ nor Qg is a split extension of Z2 x Z2 by Z2 because in neither group is 
there a subgroup complement to the center (Section 2.5 gives the subgroup structures 
of these groups). 

( 4 ) The group Dg is a split extension of Z2 by Z4, i.e., there is a split short exact sequence 

i->Z4ADgAz2-^i, 

namely, 

l-»(r)AftA(i)^l, 

usmgourusualsetofgeneratorsforDg. Here 1 is the inclusion map and n : r a s b »->■ s** 
is the projection onto the quotient D%/(r) = Z2. The splitting homomorphism u 
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maps ( s ) isomorphicaUy onto the complement < s ) for ( r ) in Dg. Equivalently, Dg 
is the semidirect product of the normal subgroup ( r ) (isomorphic to Z 4 ) with ( s ) 
(isomorphic to Z 2 ). 

On the other hand, while Qg is also an extension of Z 2 by Z 4 (for example, 
( i ) = Z 4 has quotient isomorphic to Z 2 ), Qg is not a split extension of Z 2 by Z 4 : no 
cyclic subgroup of Qg of order 4 has a complement in Qg. 

Section 5.5 contains many more examples of split extensions of groups. 

Proposition 25 shows that an extension B of C by A is a split extension if and only 
if there is a splitting homomorphism of the projection map tp : B -> C from B to the 
quotient C. The next proposition shows in particular that for modules this is equivalent 
to the existence of a splitting homomorphism for \}/ at the other end of the sequence. 


ip 

Proposition 26. Let 0-*A->fi->C->0bea short exact sequence of modules 

ty (p 

(respectively, 1-»A— ^ B -> C 1 a short exact sequence of groups). Then B = 
i}/(A) © C' for some submodule C' of B with <p(C') = C (respectively, B = V^(A) x C' 
for some subgroup C' of B with <p(C') = C) if and only if there is a homomorphism 
X : B -> A such that X o ^ is the identity map on A. 

Proof: This is s imil ar to the proof of Proposition 25. If X is given, define C' — 
ker X c B and if C' is given define X : B = iJf(A) © C' A by X((^(/ 3 ), d) = a. 
Note that in this case C' — ker X is normal in B y so that C' is a normal complement to 
tj/{A) in B, which in turn implies that B is the direct sum of ip(A) and C' (cf. Theorem 
9 of Section 5.4). 

Proposition 26 shows that for general group extensions, the existence of a splitting 
homomorphism X on the left end of the sequence is stronger than the condition that 
the extension splits: in this case the extension group is a direct product, and not just 
a semidirect product The fact that these two notions are equivalent in the context of 
modules is again a reflection of the abelian nature of the underlying groups, where 
semidirect products are always direct products. 

Modules and Hom^f D, ) 

Let R be a ring with 1 and suppose the /{-module M is an extension of N by L, with 

0 — ► L M N — >- 0 

the corresponding short exact sequence of /{-modules. It is natural to ask whether 
properties for L and N imply related properties for the extension M. The first situation 
we shall consider is whether an /{-module homomorphism from some fixed /{-module 
D to either L or N implies there is also an /{-module homomorphism from D to M. 

The question of obtaining a homomorphism from D to M given a homomorphism 
from D to L is easily disposed of: if / € Hom /? (D, L) is an /{-module homomorphism 
from D to L then the composite /' = \Jj o / is an /{-module homomorphism from D to 
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M. The relation between these maps can be indicated pictorially by the commutative 
diagram 


D 


f 


L 


N /' 

yjr x , 
— — 


Put another way, composition with ij/ induces a map 

4r ' : Homfl(D, L) — ► Hom*(Z), M) 
f \ — ► /' = ir o /. 


Recall that, by Proposition 2, Hom/f (Z), L) and Hom^(D, M ) are abelian groups. 


Proposition 27. Let D, L and M be R -modules and let -if/ : L — > M be an R -module 
homomorphism. Then the map 

ir 1 : Horn r (D, L ) — ► Hom fl (D, M) 

f ^ f' = * of 

is a homomorphism of abelian groups. If is injective, then is also injective, i.e., 

if 0 — >■ L M is exact, 
then 0 — > Hom«(D, L) — ► Horn /?(£>, M ) is also exact. 


Proof: The fact that \J/' is a homomorphism is imm ediate. If \Js is injective, then 
distinct homomorphisms / and g from D into L give distinct homomorphisms ^ ° / 
and ifr o g from D into M, which is to say that i]/ 1 is also injective. 


While obtaining homomorphisms into M from homomorphisms into the submodule 
L is straightforward, the situation for homomorphisms into the quotient N is much less 
evident. More precisely, given an -module homomorphism f : D N the question 
is whether there exists an R -module homomorphism F : D — >■ M that extends or lifts 
f to M, i.e., that makes the following diagram commute: 


D 


F 


f 


y <p 
M — — N 


As before, composition with the homomorphism <p induces a homomorphism of abelian 
groups 


<p ' : Hom^fD, M) — > Hom/f(D, N ) 
F i — ► F' = <po F. 


In terms of <p\ the homomorphism / to N lifts to a homomorphism to M if and only if 
/ is in the image of <p' (namely, / is the image of the lift F). 
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In genera] it may not be possible to lift a homomorphism / from D to N to a 
homomoiphism from D to M. For example, consider the nonsplit exact sequence 

0 -*■ Z Z A- Z/2Z — > 0 from the previous set of examples. Let D = Z/2Z and let 
/ be the identity map from D into N. Any homomorphism F of D into M = Z must 
map D to 0 (since Z has no elements of order 2), hence it o F maps D to 0 in N , and 
in particular, n o F =/ /. Phrased in terms of the map <p\ this shows that 

if M N — »• 0 is exact, 

then Hom/f(D, M ) — ► Hom^fZ), N) — ► 0 is not necessarily exact 

These results relating the homomorphisms into L and N to the homomorphisms 
into M can be neatly summarized as part of the following theorem. 

Theorem 28. Let D, L, M, and N be Z?-modules. If 

0 — > L M — N — ► 0 is exact, 
then the associated sequence 

0 -> YLom R (D, L) ^ Hom J? (D, M) Hom R (D, N) is exact. (10.10) 

A homomorphism / : D -*• N lifts to a homomorphism F : D -> M if and only if 
/ € Homj?(D, N) is in the image of <p' . In general^' : Hom*(Z), M) — ► Hom^(Z), N) 
need not be surjective; the map <p' is surjective if and only if every homomorphism from 
D to N lifts to a homomorphism from D to M, in which case the sequence (10) can be 
extended to a short exact sequence. 

The sequence (10) is exact for all R -modules D if and only if the sequence 

0 — ► L M N is exact. 


Proof: The only item in the first statement that has not already been proved is the 
exactness of (10) at Homj?(D, M), i.e., ker^p' = image yp’ . Suppose F : D -> M 
is an element of Hom/?(Z), M) lying in the kernel of <p', Le„ with <p o F = 0 as 
homomorphisms from D to N. If d € D is any element of £>, this implies that 
tp{F(d)) = 0 and F(d) € ker <p. By the exactness of the sequence defining the extension 
M we have ker <p = image yp, so there is some element l € L with F(d) = yp(l). Since 
yp is injective, the element l is unique, so this gives a well defined map F’ : D -*• L 
given by F'{d) = l. It is an easy check to verify that F' is a homomorphism, i.e., 
F' € Homtf(D, L). Since yp o F'{d) = yp(l) = F(d ), we have F = yp'(F') which 
shows that F is in the image of yp', proving that ker <p' c image yp' . Conversely, 
if F is in the image of yp' then F = yp'(F') for some F' e Hom w (D, L) and so 
<p(F(d)) = <p(yp(F'(d))) for any d € D. Since ker <p = image yp we have <p o yp = 0, 
and it follows that <p(F(d)) = 0 for any d € D, i.e., <ff(F) = 0. Hence F is in the 
kernel of <p' t proving the reverse containment: image yp' c ker <p' . 

For the last statement in the theorem, note first that the surjectivity of tp was not 
required for the proof that (10) is exact, so the “if’ portion of the statement has already 
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been proved. For the converse, suppose that the sequence (10) is exact for all R -modules 
D. In general, Homfl(A\ X) = X for any left /^-module X, the isomorphism being 
given by mapping a homomorphism to its value on the element 1 e R (cf. Exercise 

10(b)). Taking D = R in (10), the exactness of the sequence 0 -> L M N 
follows easily. 

By Theorem 28, the sequence 

0 — ► Hom*(D, L) Horn r(D, M ) Hom R (D, N) — ► 0 (10.11) 

is in general not a short exact sequence since the homomorphism tp' need not be sur- 
jective. The question of whether this sequence is exact precisely measures the extent 
to which the homomorphisms from D into M are uniquely determined by pairs of ho- 
momorphisms from D into L and D into N. More precisely, this sequence is exact if 
and only if there is a bijection F ++ (g, f) between homomorphisms F : D —> M and 
pairs of homomorphisms g : D L and f : D N given by F\^(l) — ty'ig) and 
/ = <P'(F). 

One situation in which the sequence ( 1 1 ) is exact occurs when the original sequence 
0 — > L — ^ M -> N — > 0 is a split exact sequence, i.e., when M = L 0 N . In this 
case the sequence (11) is also a split exact sequence, as the first part of the following 
proposition shows. 

Proposition 29. Let D, L and N be R -modules. Then 

(1) Homft(Z), L © AO = Hom*(L>, L) © Hom*(Z>, N), and 

(2) Hoihr(L © N, D) = Horn R (L y D ) © Hom/j(A0 D). 

Proof: Let n\ : LBN — >■ L be the natural projection from L©AMo Land similarly 
let 7T2 be the natural projection to N. If / e Hom^(D, L © N) then the compositions 
Tt\ o f and 7i2 o f give elements in Hom*(D, L) and Hom/?(D, N), respectively. 
This defines a map from Hom^(D, L © N) to Hom/?(D, L) © Horrifl(Z), AO which 
is easily seen to be a homomorphism. Conversely, given /] e Hom^(D, L) and 
f'l e Hom*(D, AO, define the map / e Hom^(D, L © N) by fid) = ( f\(d ), f 2 id)). 
This defines a map from Horn/} (D, L) ©Horn/? (L>, AO to Horn* (L>, LBN) that is easily 
checked to be a homomorphism inverse to the map above, proving the isomorphism in 
(1). The proof of (2) is similar and is left as an exercise. 

The results in Proposition 29 extend immediately by induction to any finite direct 
sum of R -modules. These results are referred to by saying that Horn commutes with 
finite direct sums in either variable (compare to Theorem 17 for a corresponding result 
for tensor products). For infinite direct sums the situation is more complicated. Part 
(1) remains true if L © A/ is replaced by an arbitrary direct sum and the direct sum on 
the right hand side is replaced by a direct product (Exercise 13 shows that the direct 
product is necessary). Part (2) remains true if the direct sums on both sides are replaced 
by direct products. 

This proposition shows that if the sequence 
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is a split short exact sequence of R -modules, then 

0 — > Horn* (Z), L) Horn* (D, M) Horn R (D, N) — > 0 

is also a split short exact sequence of abelian groups for every /^-module D. Ex- 

« V ' 7 (P* 

excise 14 shows that a converse holds: if 0 -> Hom/?(Z), L) ->• Horn/? (A M) -> 

jJ/ (p 

Hom*(D, N) — >• 0 is exact for every /f-module D then 0 — >• Z, — > M — >■ N — >• 0 is 
a spht short exact sequence (which then implies that if the original Horn sequence is 
exact for every Z), then in fact it is split exact for every D). 

Proposition 29 identifies a situation in which the sequence (11) is exact in terms 
of the modules L, M, and N. The next result adopts a slightly different perspective, 
characterizing instead the modules D having the property that the sequence (10) in 
Theorem 28 can always be extended to a short exact sequence: 


Proposition 30. Let P be an /^-module. Then the following are equivalent: 
(1) For any /^-modules L, M , and N t if 

0 — > M N — >0 


is a short exact sequence, then 

0 — > Hom^(/ > , L ) Hom fi (/ > , M) Hom/?(/ > , N) — ► 0 


( 2 ) 


is also a short exact sequence. 

ip 

For any R -modules M and N,ifM — »- N 0 is exact, then every P-module 
homomorphism from P into N lifts to an /^-module homomorphism into M, 
i.e., given / € Hom^(/ > , N) there is a lift F € Homfl(P, M) making the 
following diagram commute: 


M 



y <P 


V 


0 


(3) If P is a quotient of the P-module M then P is isomorphic to a direct s umm and 

of M, i.e., every short exact sequence 0 — > L M P— *0 splits. 

(4) P is a direct s ummand of a free R -module. 


Proof: The equivalence of (1) and (2) is a restatement of a result in Theorem 28. 

Suppose now that (2) is satisfied, and let 0 — > L M \ P -+ Obe exact. By (2), the 
identity map from P to P lifts to a homomorphism fi making the following diagram 
commute: 


V-s' 


id 


y <p 
M > P 


> 0 


Then <pofi= 1, so ju is a splitting homomorphism for the sequence, which proves (3). 
Every module P is the quotient of a free module (for example, the free module on the 
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set of elements in P ), so there is always an exact sequence 0 — ► ker <p ^ ^ P ^ 0 

where T is a free R -module (cf. Example 4 following Corollary 23). If (3) is satisfied, 
then this sequence splits, so T is isomorphic to the direct sum of ker <p and P, which 
proves (4). 

Finally, to prove (4) implies (2), suppose that P is a direct summand of a free R- 
module on some set S, say T(S) = P © K, and that we are given a homomorphism / 
from P to N as in (2). Let n denote the natural projection from T(S) to P, so that f on 
is a homomorphism from T(S) to N. For any s e S define n s = f o n(s) e N and let 
m s e M be any element of M with <p(m s ) = n s (which exists because <p is surjective). 
By the universal property for free modules (Theorem 6 of Section 3), there is a unique 
R -module homomorphism F' from T(S) to M with F'(s) = m x . The diagram is the 
following: 

f(S) = P © K 
/ 

* n 

/ \ 

F' / P 

/ 

/ f 

* & i 

M *-0 

By definition of the homomorphism F' we have tpoF'is) = <p(m s ) = n s = fon(s), 
from which it follows that <p o F' = / o n on ^"(5), i.e., the diagram above is com- 
mutative. Now define a map F : P — > M by F(d) = F'((d , 0)). Since F is the 
composite of the injection P — v T(S) with the homomorphism F\ it follows that F is 
an F -module homomorphism. Then 

<P o F{d) = ip o F'((d, 0)) = / o n((d, 0)) = f(d ) 
i.e., <p o F = /, so the diagram 


P 


M 



<p 


* 


^0 


commutes, which proves that (4) implies (2) and completes the proof. 


Definition. An F-module P is called projective if it satisfies any of the equivalent 
conditions of Proposition 30. 


The third statement in Proposition 30 can be rephrased as saying that any module 
M that projects onto P has (an isomorphic copy of) P as a direct summand, which 
explains the terminology. 

The following result is immediate from Proposition 30 (and its proof): 

Corollary 31. Free modules are projective. A finitely generated module is projective 
if and only if it is a direct summand of a finitely generated free module. Every module 
is a quotient of a projective module. 
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If D is fixed, then given any R -module X we have an associated abelian group 
Homfl(D, X). Further, an R -module homomorphism a : X -> Y induces an abelian 
group homomorphism a ' : Hom^(D, X) -> Hom^fD, Y), defined by ct'(f) = a o /. 

Put another way, the map Homj?(D, ) is a covariant functor from the category of 

R -modules to the category of abelian groups (cf. Appendix II). Theorem 28 shows that 
applying this functor to the terms in the exact sequence 

0 — > L M /V — ► 0 
produces an exact sequence 

0 Hom ft (D, L) X Horn* (D, M) 4 Hom*(£>, N). 

This is referred to by saying that Hom^(Z), ) is a left exact functor. By Proposition 

30, the functor Hom^(£), ) is exact, i.e., always takes short exact sequences to short 

exact sequences, if and only if D is projective. We summarize this as 

Corollary 32. If D is an /?-module, then the functor Hom/?(D, ) from the category 

of /^-modules to the category of abelian groups is left exact It is exact if and only if D 
is a projective ^-module. 

Note that if Homj?(D, ) takes short exact sequences to short exact sequences, 

then it takes exact sequences of any length to exact sequences since any exact sequence 
can be broken up into a succession of short exact sequences. 

As we have seen, the functor Hom^fD, ) is in general not exact on the right 

Measuring the extent to which functors such as Hom^(Z), ) fail to be exact leads to 

the notions of “homological algebra,” considered in Chapter 17. 

Examples 

(1) We shah see in Section 1 1.1 that if R = F is a field then every F-module is projective 
(although we only prove this for finitely generated modules). 

(2) By Corollary 31, Z is a projective Z- module. This can be seen directly as follows: 

suppose / is a map from 2 to N and M N 0 is exact. The homomorphism / is 
uniquely determined by the value n = /( 1). Then / can be lifted to a homomorphism 
F : Z -* M by first defining F(l) = m, where m is any element in M mapped to n 
by <p, and then extending F to all of Z by additivity. 

By the first statement in Proposition 30. since Z is projective, if 

0 — ► L M N — ► 0 
is an exact sequence of Z- modules, then 

0 — ► Homz(Z, Z.) — > HomgfZ, M) HomgfZ, N) — * 0 

is also an exact sequence. This can also be seen directly using the isomorphism 
Homz(Z, M) = M of abelian groups, which shows that the two exact sequences 
above are essentially the same. 

(3) Free Z- modules have no nonzero elements of finite order so no nonzero finite abelian 
group can be isomorphic to a submodule of a free module. By Corollary 3 1 it follows 
that no nonzero finite abelian group is a projective Z-module. 
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(4) As a particular case of the preceding example, we see that for n > 2 the Z-module 
Z/hZ is not projective. By Theorem 28 it must be possible to find a short exact 

sequence which after applying the functor HomgfZ/nZ, ) is no longer exact on the 

right. One such sequence is the exact sequence of Example 2 following Corollary 23: 

0 — ► Z -A- Z A- Z/nZ — ► 0, 

for n > 2. Note first that Hom^fZ/nZ, Z) = 0 since there are no nonzero Z-module 
homomorphisms from Z/nZ to Z. It is also easy to see that Homz(Z/nZ, Z/«Z) = 
Z/nZ, as follows. Every homomorphism / is uniquely determined by/(l)=a€ 
Z/nZ, and given any a € Z/nZ there is a unique homomorphism f a with f a ( 1) = a; 
the map a is easily checked to be an isomorphism from Homz(Z/nZ, Z/nZ) 
to Z/nZ. 

Applying Hom^fZ/uZ, ) to the short exact sequence above thus gives the 

sequence 

o^o4o4 z/nz — ► o 

which is not exact at its only nonzero term. 

(5) Since Q/Z is a torsion Z-module it is not a submodule of a free Z-module, hence is 
not projective. Note also that the exact sequence 0 ->Z-vQ 4 Q/Z ->• 0 does not 
split since Q contains no submodule isomorphic to Q/Z. 

(6) The Z-module Q is not projective (cf. the exercises). 

(7) We shall see in Chapter 12 that a finitely generated Z-module is projective if and only 
if it is free. 

(8) Let P be the commutative ring Z/2Z x Z/2Z under componentwise addition and 
multiplication. If P\ and P 2 are the principal ideals generated by (1, 0) and (0, 1) 
respectively then R = P\ 0 Pi, hence both Pi and P 2 are projective P -modules by 
Proposition 30. Neither P\ nor P 2 is free, since any free module has order a multiple 
of four. 

(9) The direct sum of two projective modules is again projective (cf. Exercise 3). 

(10) We shall see in Part VI that if F is any field and n € Z + then the ring R — M„(F ) of all 
«xk matrices with entries from F has the property that every P -module is projective. 
We shall also see that if G is a finite group of order n and n ^ 0 in the field F then the 
group ring FG also has the property that every module is projective. 


Injective Modules and Hom^f , D ) 

If 0 — y L — M — N — > 0 is a short exact sequence of R -modules then, instead 
of considering maps from an P-module D into L or N and the extent to which these 
determine maps from D into M, we can consider the “dual” question of maps from 
L or N to D. In this case, it is easy to dispose of the situation of a map from N to 
D: an R -module map from N to D immediately gives a map from M to D simply by 
composing with (p. It is easy to check that this defines an injective homomorphism of 
abelian groups 


qf : Horn* (N, D) — > Horn r(M, D ) 
/ »— ► /' = / o <0, 
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or, put another way, 


if M — N — > 0 


is exact, 


then 


0 -► Hom*(Ar, D ) Horn/? (A/, D) 


is exact 


(Note that the associated maps on the homomorphism groups are in the reverse direction 
from the original maps.) 

On the other hand, given an /?-module homomorphism / from L to D it may not 
be possible to extend / to a map F from M to D, Le., given / it may not be possible 
to find a map F making the following diagram commute: 






f 


''F 


D 




For example, consider the exact sequence 0 — > Z Z -A Z/2Z — ► 0 of 
Z-modules, where \Jx is multiplication by 2 and <p is the natural projection. Take 
D — Z/2Z and let / : Z -> Z/2Z be reduction modulo 2 on the first Z in the se- 
quence. There is only one nonzero homomorphism F from the second Z in the se- 
quence to Z/2Z (namely, reduction modulo 2), but this F does not lift the map / since 
F o ^(Z) = F(2Z) = 0, so F o $ ± /. 

Composition with yj/ induces an abelian group homomorphism yfr' fromHom/? (M, D) 
to Hom/f(T, D), and in terms of the map the homomorphism / € Hom ff (L, D) 
can be lifted to a homomorphism from M to D if and only if / is in the image of ip~'. 
The example above shows that 


if 0 



is exact, 


then Horn/? (A/, D) — » Horn /?(£, D) 0 is not necessarily exact 
We can summarize these results in the following dual version of Theorem 28: 


Theorem 33. Let D, L,M, and N be /?-modules. If 

0 — > L M N — > 0 is exact, 
then the associated sequence 

0 Uom R (N, D ) Hora^(M, D ) Ho ra R (L, D) is exact (10.12) 

A homomorphism / : L — D lifts to a homomorphism F : M -> D if and only if 
/ € Homfl(Z., D) is in the image of yp-\ In general^' : Horn/; (A/, D ) -► Hom/?(L, D) 
need not be surjective; the map yj/' is surjective if and only if every homomorphism from 
L to D lifts to a homomorphism from M to £>, in which case the sequence (12) can be 
extended to a short exact sequence. 

The sequence (12) is exact for all R -modules D if and only if the sequence 

L \ M N 0 is exact. 
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Proof: The only item remaining to be proved in the first statement is the exactness 
of (12) at Horn;? (A/, D). The proof of this statement is very similar to the proof of 
the corresponding result in Theorem 28 and is left as an exercise. Note also that the 
injectivity of iff is not required, which proves the “if’ portion of the final statement of 
the theorem. 

Suppose now that the sequence (12) is exact for all R -modules D. We first show 
that <p : M —y N is a surjection. Take D = N/<p(M). If ttj : TV -> N/<p(M) is 
the natural projection homomorphism, then n\ o <p{M) = 0 by definition of tz\. Since 
it) o<p = this means that the element ttj e Horn R (N, N /<p(M)) is mapped to 0 

by <p' . Since (f/ is assumed to be injective for all modules D, this means 7zr is the zero 
map, i.e., N = <p(M) and so <p is a surjection. We next show that <p o ifr = 0, which 
will imply that image if/ C ker <p. For this we take D — N and observe that the identity 
map ids on N is contained in Horn/? (TV, N), hence ip' (id s) e Horn R (M, N). Then the 
exactness of (12) for D = N implies that <p'(id N ) e ker iff', so if/' (<p' (ids)) = 0. Then 
ids o \]/ o <p = 0, i.e., ifr o <p = 0, as claimed. Finally, we show that ker <p c image ifr. 
Let D = M/iff(L) and let rr 2 : M -*■ M /ip(L) be the natural projection. Then 
iff' (rtf) — 0 since 7r 2 (lKL)) = 0 by definition of rr 2 . The exactness of (12) for this D 
then implies that tt 2 is in the image of <p' t say 7T 2 == <p'(f) for some homomorphism 
f e Horn R (N, M/ifr(L)), i.e., rr 2 = / o <p. If m e ker^ then n 2 (tn) = f(<p(m)) = 0, 
which means that m € ifr(L) since n 2 is just the projection from M into the quotient 
M / ifr (L). Hence ker^p c image iff , completing the proof. 

By Theorem 33, the sequence 

0 — y Hom R (N, D) Horn R (M, D ) Hom*(L, D ) — y 0 

is in general not a short exact sequence since ifi' need not be surjective, and the question 
of whether this sequence is exact precisely measures the extent to which bomomor- 
phisms from M to D are uniquely determined by pairs of homomorphisms from L and 
NtoD. 

The second statement in Proposition 29 shows that this sequence is exact when the 
origina] exact sequence 0— >• 0 is a split exact sequence. In fact in 

this case the sequence 0 -> Hor n R (N, D) ~y Hom/jfJtf, D) —y Horn R (L, D) -y 0 is 
also a split exact sequence of abelian groups for every /^-module D. Exercise 14 shows 

that a converse holds: if 0 — y Hom R (N , D ) A- Horn R (M, D) Horn* (L. D) -y 0 

\Jr ifi 

is exact for every /?-module D then 0 — > L M N — ► 0 is a split short exact 
sequence (which then implies that if the Horn sequence is exact for every D, then in 
fact it is split exact for every D). 

There is also a dual version of the first three parts of Proposition 30, which describes 
the R -modules D having the property that the sequence (12) in Theorem 33 can always 
be extended to a short exact sequence: 

Proposition 34. Let Q be an /?-module. Then the following are equivalent: 

(1) For any 7?-modules L, M t and N , if 

0 — y L -^y M -^-y N — ► 0 
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is a short exact sequence, then 

0 — ► Horn r(N, Q ) Horn/? (M, Q) Hom/?(L, Q) — > 0 

is also a short exact sequence. 

(2) For any R -modules L and M, if 0 -> L — y M is exact, then every R -module 
homomorphism from L into Q lifts to an /^-module homomorphism of M into 
Q y i.e., given f e Hom/?(F, Q) there is a lift F e Hom^fM, Q) making the 
following diagram commute: 

$ 

0 *■ L >-M 



(3) If Q is a submodule of the /?-module M then Q is a direct summand of M, i.e., 
every short exact sequence 0 — > Q M —> N —> 0 splits. 

Proof: The equivalence of ( 1) and (2) is part of Theorem 33. Suppose now that (2) 

is satisfied and let 0 — > Q N — ► 0 be exact. Taking L = Q and f the identity 

map from Q to itself, it follows by (2) that there is a homomorphism F : M Q with 
F oTp — 1, so F is a splitting homomorphism for the sequence, which proves (3). The 
proof that (3) implies (2) is outlined in the exercises. 

Definition. An R -module Q is called injective if it satisfies any of the equivalent 
conditions of Proposition 34. 

The third statement in Proposition 34 can be rephrased as saying that any module 
M into which Q injects has (an isomorphic copy of) Q as a direct summand, which 
explains the terminology. 

If D is fixed, then given any F- module X we have an associated abelian group 
Horn/? (7, D). Further, an F-module homomorphism a \ X > Y induces an abelian 
group homomorphism a' : Horn/? (7. D) -> Horn/? (7, D), defined by af{f) = f o a, 

that “reverses” the direction of the arrow. Put another way, the map Hom/?(D, ) is a 

contravariant functor from the category of R -modules to the category of abelian groups 
(cf. Appendix II). Theorem 33 shows that applying this functor to the terms in the exact 
sequence 

0 — > L — M N — ► 0 
produces an exact sequence 

0 Hom/?(N, D) X Hom/?(A/, D ) Hom/?(L, D). 

This is referred to by saying that Hom/?( , D) is a left exact (contravariant) functor. 

Note that the functor Hom^( , D) and the functor Horn /?(/), ) considered earlier 
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are both left exact; the former reverses the directions of the maps in the original short 
exact sequence, the latter maintains the directions of the maps. 

By Proposition 34, the functor Hom/?( , D) is exact, i.e., always takes short exact 

sequences to short exact sequences (and hence exact sequences of any length to exact 
sequences), if and only if D is injective. We summarize this in the following proposition, 
which is dual to the covariant result of Corollary 32. 

Corollary 35. If D is an R -module, then the functor Hom*( , D) from the category 

of R -modules to the category of abelian groups is left exacL It is exact if and only if D 
is an injective F-module. 

We have seen that an R -module is projective if and only if it is a direct su mman d 
of a free F-module. Providing such a simple characterization of injective F-modules 
is not so easy. The next result gives a criterion for Q to be an injective R -module (a 
result due to Baer, who introduced the notion of injective modules around 1940), and 
using it we can give a characterization of injective modules when R = 2 (or, more 
generally, when R is a P.I.D.). Recall that a 2-module A (i.e., an abelian group, written 
additively) is said to be divisible if A = nA for all nonzero integers n. For example, 
both Q and Q/Z are divisible (cf. Exercises 18 and 19 in Section 2.4 and Exercise 15 
in Section 3.1). 

Proposition 36. Let Q be an F -module. 

(1) (Baer's Criterion) The module Q is injective if and only if for every left ideal / 

of R any F -module homomorphism g : / Q can be extended to an F -module 

homomorphism G : R Q. 

(2) If R is a P.I.D. then Q is injective if and only if rQ = Q for every nonzero 
r G R. In particular, a 2-module is injective if and only if it is divisible. When 
R is a P.I.D., quotient modules of injective F -modules are again injective. 

Proof: If Q is injective and g : / — > Q is an F -module homomorphism from the 
nonzero ideal 1 of R into Q , then g can be extended to an F-module homomorplusm 
from R into Q by Proposition 34(2) applied to the exact sequence 0 — > I — > R, which 
proves the “only if’ portion of (1). Suppose conversely that every homomorphism 
g : / -> Q can be lifted to a homomorphism G : R -> Q. To show that Q is 
injective we must show that if 0 — > L — ► M is exact and / : L Q is an F- 
module homomorphism then there is a lift F : M -> Q extending /. If <5 is the 
collection (/', Z/) of lifts /' : 7/ Q of / to a submodule V of M containing L, 
then the ordering (/', 7/) < (/", L") if L! c L" and f" = f on V partially orders 
S. Since S 0, by Zorn’s Lemma there is a maximal element (F, M') in S. The map 
F : M 1 |2isaliftof / and it suffices to show that M’ = M. Suppose that there is 
some element m e M not contained in M' and let / = {r e R \ rm e M'}. It is easy to 
check that 7 is a left ideal in R, and the map g : 1 — ► Q defined by g(x) = F(xm) is an 
F -module homomorphism from 1 to Q. By hypothesis, there is a lift G : R -» Q of g. 
Consider the submodule M' + Rm of M, and define the map F' : M' + Rm Q by 
F'(m' + rm) = F(m') + G(r). If mi + rim = mi 4- r 2 m then (rj — r 2 )m = m 2 ~ mi 
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shows that r\ — r 2 e /, so that 

G(r\ - r 2 ) = g(n - r 2 ) = F((r, - r 2 )m) = F(m 2 - m ( ), 

and so F(m\) + G(r\) = F(m 2 ) + G(r 2 ). Hence F' is well defined and it is then 
immediate that F' is an F- module homomorphism extending / to iW' + Rm. This 
contradicts the maximality of M ' , so that M' = M, which completes the proof of (1). 

To prove (2), suppose R is a P.I.D. Any nonzero ideal / of R is of the form I = (r) 
for some nonzero element r of R. An R -module homomorphism / : / — > Q is 
completely determined by the image /(r) = q in Q. This homomorphism can be 
extended to a homomorphism F : R Q if and only if there is an element q' in Q 
with F(l) = q' satisfying q = / (r) = F(r) = rq'. It follows that Baer’s criterion for 
Q is satisfied if and only if rQ = Q, which proves the first two statements in (2). The 
final statement follows since a quotient of a module Q with rQ = Q for all r ^ 0 in /? 
has the same property. 

Examples 

(1) Since Z is not divisible, Z is not an injective Z-module. This also follows from the 

2 

fact that the exact sequence 0 — > Z — ► Z — ► Z/2Z — ► 0 corresponding to 
multiplication by 2 does not split 

(2) The rational numbers Q is an injective Z-module. 

(3) The quotient Q/Z of the injective Z-module Q is an injective Z-module. 

(4) It is immediate that a direct sum of divisible Z-modules is again divisible, hence a 
direct sum of injective Z-modules is again injective. For example, <Q> © ®/Z is an 
injective Z-module. (See also Exercise 4). 

(5) We shall see in Chapter 12 that no nonzero finitely generated Z- module is injective. 

(6) Suppose that the ring R is an integral domain. An F-module A is said to be a divisible 
fl-module if rA = A for every nonzero r e R. The proof of Proposition 36 shows 
that in this case an injective /? -module is divisible. 

(7) We shall see in Section 11.1 that if R = F is a field then every f -module is injective. 

(8) We shall see in Part VI that if F is any field and n € Z + then the ring R = M„(F) 
of all n x n matrices with entries from F has the property that every /^-module is 
injective (and also projective). We shall also see that if G is a finite group of order 
n and n ± 0 in the field F then the group ring FG also has the property that every 
module is injective (and also projective). 

Corollary 37. Every Z-module is a submodule of an injective Z-module. 

Proof: Let M be a Z-module and let A be any set of Z-module generators of M. 
Let T = F (A) be the ffee Z-module on the set A. Then by Theorem 6 there is a 
surjective Z-module homomorphism from T to M and if JC denotes the kernel of this 
homomorphism then /C is a Z-submodule of T and we can identify M = F/K.. Let Q 
be the free Q- module on the set A. Then Q is a direct sum of a number of copies of Q, 
so is a divisible, hence (by Proposition 36) injective, Z-module con taining T . Then /C 
is also a Z-submodule of Q, so the quotient Q/K is injective, again by Proposition 36. 
Since M = F/K C Q/K, it follows that M is contained in an injective Z-module. 

Corollary 37 can be used to prove the following more general version valid for 
arbitrary R -modules. This theorem is the injective analogue of the results in Theorem 6 
and Corollary 31 showing that every F-module is a quotient of a projective /(-module. 
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Theorem 38. Let R be a ring with 1 and let M be an R-module. Then M is contained 
in an injective R-module. 

Proof: A proof is outlined in Exercises 15 to 17. 

It is possible to prove a sharper result than Theorem 38, namely that there is a 
minimal injective R-module H containing M in the sense that any injective map of 
M into an injective R-module Q factors through H. More precisely, if M C Q for 
an injective R-module Q then there is an injection i : H Q that restricts to the 
identity map on A/; using l to identify H as a subset of Q we have M c H c Q. (cf. 
Theorem 57.13 in Representation Theory of Finite Groups and Associative Algebras 
by C. Curtis and I. Reiner, John Wiley & Sons, 1966). This module H is called the 
injective hull or injective envelope of M. The universal property of the injective hull of 
M with respect to inclusions of M into injective /^-modules should be compared to the 
universal property with respect to homomorphisms of M of the free module F (A) on a 
set of generators A for M in Theorem 6. For example, the injective hull of Z is (Q>, and 
the injective hull of any field is itself (cf. the exercises). 

Flat Modules and D 

We now consider the behavior of extensions 0 — * L — M N — > 0 of 
R-modules with respect to tensor products. 

Suppose that D is a right /?- module. For any homomorphism / : X -> Y of left 
R-modules we obtain a homomorphism 1® / : D®rX -> D®rY of abelian groups 
(Theorem 13). If in addition D is an (5, /?)-bimodule (for example, when 5 = R is 
commutative and D is given the standard (R, R)-biraodule structure as in Section 4), 
then 1 ® / is a homomorphism of left 5 -modules. Put another way, 

D ®k : X — > D ® R X 

is a covariant functor from the category of left R-modules to the category of abelian 
groups (respectively, to the category of left 5-modules when D is an ( 5 , R) -bimodule), 

cf. Appendix II. In a s imil ar way, if D is a left /?-module then ® R I) is a covariant 

functor from the category of right /?-modules to the category of abelian groups (respec- 
tively, to the category of right 5-modules when D is an (R, 5) -bimodule). Note that, 
unlike Horn, the tensor product is covariant in both variables, and we shall therefore 

concentrate on , leaving as an exercise the min or alterations necessary for 

D. 

We have already seen examples where the map l ® \J/ ; £> ®r L -> D ® R M 
induced by an injective map i/r : L c -> M is no longer injective (for example the 
injection Z ■=->■ <Q> of Z-rnodules induces the zero map from Z/2Z Z = Z/2Z to 
Z/2Z®zQ = 0). On the other hand, suppose that sp : M -> N is a surjective R-module 
homomorphism. The tensor product D ® R N is generated as an abelian group by the 
simple tensors d®n for d e D and n e N. The surjectivity of <p implies that n = <p(m) 
for some m e M, and then 1 ® <p(d ® m) = d ® tpim) = d ®n shows that 1 <8> <p is 
a surjective homomorphism of abelian groups from D ® R M lo D ® R N. This proves 
most of the following theorem. 
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Theorem 39. Suppose thai D is a right /?-module and that L, M and N are left 
/? -modules. If 

0 — ► L — — >• M — N — > 0 is exact, 
then the associated sequence of abelian groups 

D <8* L D <8r M D <8/? N — > 0 is exacL (10.13) 

If D is an (5, /?)-bimodule then ( 13) is an exact sequence of left 5-modules. In partic- 
ular, if 5 = R is a commutative ring, then (13) is an exact sequence of R -modules with 
respect to the standard f?-module structures. The map 1 ® <p is not in general injective, 
i.e., the sequence (13) cannot in general be extended to a short exact sequence. 

The sequence (13) is exact for all right /^-modules D if and only if 

L -> A/ N 0 is exact. 


Proof: For the first statement it remains to prove the exactness of (13) at D <8/t M. 
Since <p o \}/ = 0, we have 

(1 ® <p) di <8 V'(A)) = d t <8 {(p o \J/(li)) = 0 

and it follows that image(l®V0 c ker(l<8<p). In particular, there is a natural projection 
7t : (D®r M )/ image(l <8 \J/) (D Af)/ker(l ®<p) = D ® R N. The composite 
of the two projection homomorphisms 

D <8r M -> (D <S)r M)/ image(l (8) VO -> D <8>r N 

is the quotient of D <g> R M by ker(l ® <p) t so is just the map 1 ®<p. We shall show that 
7T is an isomorphism, which will show that the kernel of 1 <8 <p is just the kernel of the 
first projection above, i.e., image(l <8> VOi giving the exactness of (13) at D ®r M. To 
see that n is an isomorphism we define an inverse map. First define 7r' : D x N -*■ 
(D Af)/ image(l ® VO by n'( (d, n) ) = d (8 m for any m e Af with (p(m) = n . 
Note that this is well defined: any other element m' e M mapping to n differs from 
m by an element in ker#> = image V'. i.e., m> — m + \J/(l) for some / e L, and 
d <8>rJ/(l) e image(l ® VO- It is easy to check that it' is a balanced map, so induces a 
homomorphism ft : D x N -> (£> Af)/ image(l <8 VO with n (d <8 n) = d ® m. 
Then ft o n(d ® m) = ft(d <8 <p(m)) = d <g> m shows that non = I. Similarly, 
non = 1 , so that n and ft are inverse isomorphisms, completing the proof that (13) is 
exact Note also that the injectivity of V was not required for the proof. 

Finally, suppose (1 3) is exact for every right R -module D. In general, R®rX = X 
for any left R -module X (Example 1 following Corollary 9). Taking D = R the 

exactness of the sequence L -X M N 0 follows. 

By Theorem 39, the sequence 

0 — y D L D <8/? Af D (8/? N — y 0 
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is not in general exact since 1 ® y}/ need not be injective. lfO->-L-^-Af->Af^Ois 
a split short exact sequence, however, then since tensor products commute with direct 
sums by Theorem 17, it follows that 

0 — > D <g>fl L D <8>/? M D N — > 0 

is also a split short exact sequence. 

The following result relating to modules D having the property that (13) can always 
be extended to a short exact sequence is immediate from Theorem 39: 

Proposition 40. Let A be a right /^-module. Then the following are equivalent: 

(1) For any left /^-modules L, M , and N, if 

0 — > L ^ M ^ N — >0 
is a short exact sequence, then 

A® R L^t A® R M A® R N ^>0 
is also a short exact sequence. 

(2) For any left R -modules L and M, if 0 -v L -—*■ M is an exact sequence of 

left R -modules (i.e., : L M is injective) then 0 ->• A L — > A <S>/? M 
is an exact sequence of abelian groups (Le., 1 <g> ij/ : A tg>R L A M is 
injective). 

Definition. A right R -module A is called fiat if it satisfies either of the two equivalent 
conditions of Proposition 40. 

For a fixed right fl-module D, the first part of Theorem 39 is referred to by saying 
that the functor D { S>r is right exact. 

Corollary 41. If D is a right R -module, then the functor D <£>/? from the category 

of left R -modules to the category of abelian groups is right exact. If D is an (5, R)- 
bimodule (for example when 5 = R is commutative and D is given the standard 

R -module structure), then D <S>r is a right exact functor from the category of left 

/^-modules to the category of left 5-modules. The functor is exact if and only if D is a 
flat /^-module. 

We have already seen some flat modules: 

Corollary 42. Free modules are flat; more generally, projective modules are flaL 

Proof: To show that the free R -module F is flat it suffices to show that for any 
injective map if : L M of /^-modules L and M the induced map : F®rL —*■ 
F M is also injective. Suppose first that F = R n is a finitely generated free R- 
module. In this case F L = R fl ®r L = L fI since R® r L = L and tensor products 
commute with direct sums. Similarly F (g)/? M = M n and under these isomorphisms 
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the map 1 ® if/ : F ® R L -v F ® R M is just the natural map of L n to M n induced 
by the inclusion \J/ in each component. In particular, 1 ® if/ is injective and it follows 
that any finitely generated free module is flat Suppose now that F is an arbitrary free 
module and that the element e F L mapped to 0 by 1 ® if/. This means 

that the element ^2(fi , if/(li)) can be written as a sum of generators as in equation (6) 
in the previous section in the free group on F x M . Since this sum of elements is finite, 
all of the first coordinates of the resulting equation lie in some finitely generated free 
submodule F' of F. Then this equation implies that Ylfi® k € F' ® R L is mapped to 

0 in F' ®r M. Since F' is a finitely generated free module, the injectivity we proved 
above shows that JD /; <S> /, is 0 in F' ® R L and so also in F ® R L. It follows that 1 ® if/ 
is injective and hence that F is flat 

Suppose now that P is a projective module. Then P is a direct summand of a 
free module F (Proposition 30), say F = P © P' . If if/ : L -v M is injective then 

1 ® if/ : F ® R L — > F ® R M is also injective by what we have already shown. Since 
F = P © P' and tensor products commute with direct sums, this shows that 

1 ® if/ : (P ® R L) © (P' ® R L) -> (P ® R M) © (P' ® R M) 
is injective. Hence 1 ® if/ : P ® R L P ® R M is injective, proving that P is flat 

Examples 

(1) Since Z is a projective Z-module it is flat The example before Theorem 39 shows 
that Z/2Z not a flat Z-module. 

(2) The Z-module Q is a flat Z-module. as follows. Suppose if/ : L -> M is an injective 
map of Z- modules. Every element of <Q>®z L can be written in the form (1 (d) ® Z for 
some nonzero integer d and some l € L (Exercise 7 in Section 4). If (1/d) ® Z is in the 
kernel of 1 ® if/ then ( 1 /d)0i]/([) is 0 in <Q>®z M. By Exercise 8 in Section 4 this means 
ci}/ (l ) = 0 in M for some nonzero integer c. Then if/(c ■ l) = 0, and the injectivity of 
if/ implies c • Z = 0 in L. But this implies that ( 1 /d) 01 = (l / cd) ® (c • Z) = 0 in L, 
which shows that 1 ® if/ is injective. 

(3) The Z-module Q/Z is injective (by Proposition 36), but is not flat the injective 
map ^(z) = 2z from Z to Z does not remain injective after tensoring with Q/Z 
(1 ® if/ : Q/Z 0z Z -> Q/Z 0 Z has the nonzero element (5 + Z) 0 1 in its kernel 
— identifying Q/Z = Q/Z 0% Z this is the statement that multiplication by 2 has the 
element 1/2 in its kernel). 

(4) The direct sum of flat modules is flat (Exercise 5). In particular, Q ® Z is flat. This 
module is neither projective nor injective (since Q is not projective by Exercise 8 and 
Z is not injective by Proposition 36 (cf. Exercises 3 and 4). 

We close this section with an important relation between Horn and tensor products: 

Theorem 43, (Adjoint Associativity) Let R and S be rings, let A be a right /^-module, let 
£ be an (R, SO -bimodule and let C be a right 5-module. Then there is an isomorphism 
of abelian groups: 

Homs (A ® R B,C) = Hom*(A. Homs(£, C)) 

(the homomorphism groups are right module homomorphisms — note that Homs(£. O 
has the structure of a right ^-module, cf. the exercises). If R = S is commutative this 
is an isomorphism of /^-modules with the standard R -module structures. 
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Proof: Suppose (p : A ® R B — >• C is a homomorphism. For any fixed a e A define 
the map <f>(a) from B to C by <2> (a) (b) = <p{a <g> b). It is easy to check that #>(<z) 
is a homomorphism of right 5-modules and that the map from A to Honis(/?, C) 
given by mapping a to <P(a) is a homomorphism of right P-modules. Then f{<p) — 
defines a group homomorphism from Homs (A & R B, C) to Horn/? (A, Hom^(B, C)). 
Conversely, suppose : A — > Hoiits(£, C) is a homomorphism. The map from 
Ay. B to C defined by mapping (a, b ) to @(a)(c) is an /^-balanced map, so induces a 
homomorphism <p from A (& R B to C. Then g(4>) = (p defines a group homomorphism 
inverse to / and gives the isomorphism in the theorem. 

As a first application of Theorem 43 we give an alternate proof of the first result 
in Theorem 39 that the tensor product is right exact in the case where S = R is a 
commutative ring. If 0 — > L — >• M — > N — > 0 is exact, then by Theorem 33 the 
sequence 

0 — > Homfl(7V, E) — > Hom^(A/, E) — ► Horn R (L, E ) 

is exact for every B-module £. Then by Theorem 28, the sequence 

0 — > Horn/? (Z),Hom/? ( N , £)) — » Hom fl ( D , Horn R (Af, £)) — >• Horn/? (D ,Homj? (£,£)) 

is exact for all D and all £. By adjoint associativity, this means the sequence 

0 — > Horn r (D N, E) — Homj?(D M, E) — >■ Hom fl (£) <g> R L, £) 

is exact for any D and all £. Then, by the second part of Theorem 33, it follows that 
the sequence 

Z) L — > D <& R M — >■ D N — > 0 

is exact for all Z), which is the right exactness of the tensor product 

As a second application of Theorem 43 we prove that the tensor product of two 
projective modules over a commutative ring R is again projective (see also Exercise 9 
for a more direct proof). 

Corollary 44. If R is commutative then the tensor product of two projective P-modules 
is projective. 

Proof: Let Pi and P 2 be projective modules. Then by Corollary 32, Hom« (P 2 , ) 

is an exact functor from the category of £ -modules to the category of B-modules. Then 

the composition Horn/? (Pi , Hom/?(P 2 , )) is an exact functor by the same corollary. 

By Theorem 43 this means that Horn/? (Pi <8> R P 2 , ) is an exact functor on £ -modules. 

It follows again from Corollary 32 that Pi <8> R P 2 is projective. 

Summary 

Each of the functors Hom^(A, ), Hom/?( , A), and A <8> R , map left P -modules 

to abelian groups; the functor ® R A maps right £ -modules to abelian groups. When 

R is commutative all four functors map P-modules to £ -modules. 

(1) Let A be a left £ -module. The functor Horn/? (A, ) is covariant and left exact; 

the module A is projective if and only if Horn* (A, ) is exact (i.e., is also right 

exact). 
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(2) Let A be a left /?-module. The functor Honi/?( , A) is contravariant and left exact; 

the module A is injective if and only if Hom*( , A) is exact 

(3) Let A be a right ^-module. The functor A ® R is covariant and right exact; the 

module A is flat if and only if A <8i* is exact (i.e., is also left exact). 

(4) Let A be a left R -module. The functor ® R A is covariant and right exact; the 

module A is flat if and only if ® R A is exact . 

(5) Projective modules are flat. The Z-module Q/Z is injective but not flat. The 
Z- module Z © Q is flat but neither projective nor injective. 

EXERCISES 

Let R be a ring with 1. 

1. Suppose that 



is a commutative diagram of groups and that the rows are exact. Prove that 

(a) if tp and a are suijective, and ft is injective then y is injective. [If c e ker y . show there 
is a b e B with <p(b) = c. Show that )) = 0 and deduce that fi(b) = a 0 
for some a' e A'. Show there is an a e A with a(a) = a' and that fi(yp(a)) = ft (b). 
Conclude that b = xfr (a) and hence c = tp(b) = 0.] 

(b) if V'', a, and y are injective, then ft is injective, 

(c) if <p, a, and y are suijective, then ft is suijective, 

(d) if fi is injective, a and y are suijective, then y is injective, 

(e) if ft is suijective, y and rj/' are injective, then or is suijective. 

2. Suppose that 

A >- B ► C >- D 


a 


y 

8 

\ 

' > 

> > 

' V 


A' B ’ ^ C ^ D’ 


is a commutative diagram of groups, and that the rows are exact. Prove that 

(a) if a is suijective, and ft, 8 are injective, then y is injective. 

(b) if 8 is injective, and a, y are suijective, then ft is suijective. 

3. Let Pi and P 2 be R -modules. Prove that P\ © P 2 is a projective R -module if and only if 
both Pi and Pi are projective. 

4. Let Q 1 and Qi be /?- modules. Prove that Q\ © £2 is an injective /?-module if and only 
if both Q\ and Qi are injective. 

5. Let A 1 and Ai be /?- modules. Prove that A 1 © A 2 is a flat R -module if and only if both A 1 
and A 2 are flat More generally, prove that an arbitrary direct sum £ A, of 7?-modules is 
flat if and only if each A/ is flat [Use the fact that tensor product commutes with arbitrary 
direct sums.] 

6. Prove that the following are equivalent for a ring R: 

00 Every R -module is projective. 

(U) Every R -module is injective. 
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7. Let A be a nonzero finite abelian group. 

(a) Prove that A is not a projective Z-module. 

(b) Prove that A is not an injective Z-raodule. 

8. Let Q be a nonzero divisible Z-module. Prove that Q is not a projective Z-module. Deduce 

that the rational numbers <Q> is not a projective Z-module. [Show first that if F is any free 
module then = 0 (use a basis of F to prove this). Now suppose to the contrary 

that Q is projective and derive a contradiction from Proposition 30(4).] 

9. Assume R is commutative with 1. 

(a) Prove that the tensor product of two free /{-modules is free. [Use the fact that tensor 
products commute with direct sums.] 

(b) Use (a) to prove that the tensor product of two projective /?-modules is projective. 

10. Let R and S be rings with 1 and let M and N be left /{-modules. Assume also that M is 
an (/?, S) -bimodule. 

(a) For s € S and for <p e Hom/?(Af, N ) define (sip) : M -*■ N by (s^)frn) = (p(ms). 
Prove that s<p is a homomorphism of left /{-modules, and that this action of S on 
Horn/? (A/, N) makes it into a left S-module. 

(b) Let S = R and let M = R (considered as an (R, /?)-bimodule by left and right 
ring multiplication on itself). For each n € N define <pn : R N by tpnir) = rn, 
i.e., (p n is the unique R -module homomorphism mapping 1/? to n. Show that <p n e 
Horn r(R, N). Use part (a) to show that the map n \-+ sp n is an isomorphism of left 
/{-modules: N = Horn r(R, AO- 

(c) Deduce that if N is a free (respectively, projective, injective, flat) left /{-module, then 
Hom/?(/?, TV) is also a free (respectively, projective, injective, flat) left /{-module. 

11 . Let R and S be rings with 1 and let M and N be left /?-modules. Assume also that N is an 
(R, 5)-bimodule. 

(a) For s € 5 and for tp e Hom/?(Af, N) define (cps) : M — > N by (^s)(wi) = <p(m)s . 
Prove that <ps is a homomorphism of left /^-modules, and that this action of S on 
Hom/?(Af, N) makes it into a right S-module. Deduce that Hom/?(M, R) is a right 
/^-module, for any /{-module M — called the dual module to M. . 

(b) Let N = R be considered as an (/?, /?)- bimodule as usual. Under the action de- 
fined in part (a) show that the map r i->- <p T is an isomorphism of right /{-modules: 
Hom/?(/{, R) = R , where <p r is the homomorphism that maps 1/? to r. Deduce that 
if M is a finitely generated free left /{-module, then Hom/?(M, R) is a free right 
/{-module of the same rank. (cf. also Exercise 13.) 

(c) Show that if Af is a finitely generated projective /{-module then its dual module 
Hom/?(Af, R) is also projective. 

12 . Let A be an /{-module, let / be any nonempty index set and for each / e / let /?,- be an 
/{-module. Prove the following isomorphisms of abelian groups; when R is commutative 
prove also that these are /{-module isomorphisms. (Arbitrary direct sums and direct 
products of modules are introduced in Exercise 20 of Section 3.) 

(a) Hom/?(0, e/ B t , A) = n, e / Horn /?(&,/!) 

(b) Horn/? (A, n, € / «/) = FL-ei Horn r(A, 5/). 

13. (a) Show that the dual of the free Z-module with countable basis is not free. [Use the 

preceding exercise and Exercise 24, Section 3.] (See also Exercise 5 in Section 1 1.3.) 
(b) Show that the dual of the free Z-module with countable basis is also not projective. 
[You may use the fact that any submodule of a free Z-module is free.] 

14 . Let 0 — ► L M N — > 0 be a sequence of /{-modules. 
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(a) Prove that the associated sequence 

0 — ► Hom*(D. L) Hom R (D, M) Horn r(D, N) —> 0 

is a short exact sequence of abelian groups for all /{-modules D if and only if the 
original sequence is a split short exact sequence. [To show the sequence splits, take 
D = N and show the lift of the identity map in Hom^fN, N) to Hom/?(N, M) is a 
splitting homomorphism for <p.] 

(b) Prove that the associated sequence 

0 — »■ HomjKN, D) Hom/?(M, D) Hom/?(L> D) — ► 0 

is a short exact sequence of abelian groups for all /{-modules D if and only if the 
original sequence is a split short exact sequence. 

15 . Let M be a left /{-module where R is a ring with 1 . 

(a) Show that Horn z(R, M) is a left /{-module under the action (r<p){/) = <p(r’r) (see 
Exercise 10). 

(b) Suppose that 0 — ► A -*■ B is an exact sequence of /{-modules. Prove that if every 
homomorphism / from A to M lifts to a homomorphism F from B to M with f = 
Fo\)/, then every homomorphism f' from A to Horn %(R , M) lifts to a homomorphism 
F' from B toHom^R, M) with f = F'oy}/. [Given /', show that f(a) — lj?) 
defines a homomorphism of A to M. If F is the associated lift of / to B, show that 
F'(b)(r) = F(rb) defines a homomorphism from Homz(/f, M) that lifts /'.] 

(c) Prove that if Q is an injective /{-module then HomgC/?. Q) is also an injective R- 
module. 

16. This exercise proves Theorem 38 that every left /{-module M is contained in an injective 
left /{-module. 

(a) Show that M is contained in an injective Z- module Q. [M is a Z- module — use 
Corollary 37.] 

(b) Show that Hom^CR, M) C HomzCR, A/) C Hom^R, Q). 

(c) Use the /{-module isomorphism M = Horn /?(/?, M) (Exercise 10) and the previous 
exercise to conclude that M is contained in an injective module. 

17. This exercise completes the proof of Proposition 34. Suppose that Q is an /{-module with 
the property that every short exact sequence 0 -> Q -i > M[ N 0 splits and suppose 

that the sequence 0 — ► L M is exact. Prove that every R - module homomorphism / 
from L to Q can be lifted to an /{-module homomorphism F from M to Q with / = Foyfr. 
[By the previous exercise, Q is contained in an injective /{-module. Use the splitting 
property together with Exercise 4 (noting that Exercise 4 can be proved using (2) in 
Proposition 34 as the definition of an injective module).] 

18. Prove that the injective hull of the Z-modnle Z is Q. [Let H be the injective hull of Z 
and argue that <Q> contains an isomorphic copy of H. Use the divisibility of H to show 
\/n e H for all nonzero integers n, and deduce that H = Q.j 

19. If F is a field, prove that the injective hull of F is F. 

20 . Prove that the polynomial ring /?[jc] in the indeterminate x over the commutative ring R 
is a flat R -module. 

21 . Let R and S be rings with 1 and suppose M is a right /{-module, and N is an (R, 5 )- 
bimodule. If M is flat over R and N is flat as an 5-module prove that M ® R N is flat as a 
right 5-module. 
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22. Suppose that R is a commutative ring and that M and N art flat /? -modules. Prove that 
M ®r N is a flat ^-module. [Use the previous exercise.] 

23. Prove that the (right) module M®rS obtained by changing the base from the ring R to the 
ring S (by some homomorphism / : R — ► S with / (1/?) = 1$, cf. Example 6 following 
Corollary 12 in Section 4) of the flat (right) R -module M is a flat S-module. 

24. Prove that A is a flat /?-module if and only if for any left ^-modules L and M where L is 
finitely generated, then iff : L — > M injective implies that also \®\}/ : A®rL — > A®rM 
is injective. [Use the techniques in the proof of Corollary 42.] 

25. (A Flatness Criterion) Parts (a)-(c) of this exercise prove that A is a flat R -module if and 
only if for every finitely generated ideal / of R, the map from A ®r I — > A ®r R = A 
induced by the inclusion I Q R is again injective (or, equivalently, A ®r / = A I c A). 

(a) Prove that if A is flat then A ®r I ^ A ®r R is injective. 

(b) If A / — > A ®r R is injective for every finitely generated ideal /, prove that 
A ®r l -> A ®r R is injective for every ideal 1 . Show that if A - is any submodule of 
a finitely generated free module F then A ®r A — ► A A is injective. Show that 
the same is true for any free module F. [Cf. the proof of Corollary 42.] 

(c) Under the assumption in (b), suppose L and M are R -modules and L -*■ M is injective. 

Prove that A ®r L — > A ®r M is injective and conclude that A is flat. [Write M as 
a quotient of the free module F, giving a short exact sequence 

0 — ► K — > F -U M — >0. 

Show that if J = y — 1 (-0- (Z-)) and i : 7 -» F is the natural injection, then the diagram 


0 > K >- 7 >- L >■ 0 


l 


T ' 

r I 


0 >- K ► F ^ 0 

is commutative with exact rows. Show that the induced diagram 


A ®r K 
id 

' i 

A ®r K 


->■ A ®r 7 

I 

A ®r F 


A ®r L ► 0 

1 ® 

>- A ®r M >-0 


is commutative with exact rows. Use (b) to show that 1 ® i is injective, then use 
Exercise 1 to conclude that 1 ® i}/ is injective.) 

(d) (A Flatness Criterion for quotients) Suppose A = F/K where F is flat (e.g., if F is 
free) and A is an R -submodule of F. Prove that A is flat if and only if 77 0 A = A7 
for every finitely generated ideal / of R. [Use (a) to prove F ®r I = FI and observe 
the image of A ®r 1 is A /; tensor the exact sequence 0~»A— >.F-*A->0 with 
/ to prove that A ®r I = FI/KI, and apply the flatness criterion.] 


26. Suppose 7? is a PJ.D. This exercise proves that A is a flat A-module if and only if A is 
torsion free A-module (Le., if a e A is nonzero and re/?, then ra = 0 implies r = 0). 

(a) Suppose that A is flat and for fixed r e R consider the map : R R defined 
by multiplication by r: ip r (x) = rx. If r is nonzero show that t/v is an injection. 
Conclude from the flatness of A that the map from A to A defined by mapping a to 
ra is injective and that A is torsion free. 

(b) Suppose that A is torsion free. If / is a nonzero ideal of /?, then / = rR for some 
nonzero r e R. Show that the map \J/ r in (a) induces an isomorphism R = / of 
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Ip t A = 

R -modules and that the composite /?—►/—►/? of Vv with the inclusion t \ I c. R 

is multiplication by r. Prove that the composite A R A ®r I ^ A®r R 
corresponds to the map a r a under the identification A ®r R = A and that this 
composite is injective since A is torsion free. Show that 1 ® V'r is an isomorphism 
and deduce that \®iis injective. Use the previous exercise to conclude that A is flat 

27. Let M, A and B be /^-modules. 

(a) Suppose / : A — ► M and g : B — ► M are P-module homoroo rphis m s . Prove that 
X = {(a, b) | a e A, b € B with f(a) = g(b)) is an R -submodule of the direct sum 
A©B (called the pullback or fiber product of / and g ) and that there is a commutative 
diagram 



where jz] and jt 2 are the natural projections onto the first and second components, 
(b) Suppose f‘:M-*A and g' : M — ► B are P-module homomorphisms. Prove that 
the quotient Y of A © B by {(/'(m), —g , (m)) \ m e M] is an P-module (called the 
pushoul or fiber sum of /' and g r ) and that there is a commutative diagram 



where n\ and are the natural maps to the quotient induced by the maps into the 
first and second components. 

28. (a) (SchanueVs Lemma ) If 0 — ^ ^ — > P -%■ M -*■ 0 and 0 -*■ K' -*■ B' M -> 0 are 
exact sequences of R - modules where P and P' are projective, prove P © K' = P' © K 
as /^-modules. [Show that there is an exact sequence 0 —*■ ker n — > X — ► P -»■ 0 
with ker n = K\ where X is the fiber product of <p and <p' as in the previous exercise. 
Deduce that X = P © K'. Show similarly that X = P' © A\] 

(b) If 0 — ► A/ — *■ Q —>■ L —>■ 0 and 0 — ► M -> Q' L' — ► 0 are exact sequences of 
R -modules where Q and Q' are injective, prove Q® L' = Q' © L as /^-modules. 

The R -modules M and N are said to be protectively equivalent if M © P = N © P' for some 
projective modules P, P'. Similarly, M and N are injectively equivalent if M © Q = N © Q! 
for some injective modules Q, Q'. The previous exercise shows K and K' are projectively 
equivalent and L and V are injectively equivalent. 
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CHAPTER 11 


Vector Spaces 


In this chapter we review the basic theory of finite dimensional vector spaces over 
an arbitrary field F (some infinite dimensional vector space theory is covered in the 
exercises). Since the proofs are identical to the corresponding arguments for real vector 
spaces our treatment is very terse. For the most part we include only those results which 
are used in other parts of the text so basic topics such as Gauss-Jordan elimination, 
row echelon forms, methods for finding bases of subspaces, elementary properties of 
matrices, etc., are not covered or are discussed in the exercises. The reader should 
therefore consider this chapter as a refresher in linear algebra and as a prelude to field 
theory and Galois theory. Characteristic polynomials and eigenvalues will be reviewed 
and treated in a larger context in the next chapter. 


11.1 DEFINITIONS AND BASIC THEORY 

The terminology for vector spaces is slightly different from that of modules, that is, 
when the ring R is a field there are different names for many of the properties of R- 
modules which we defined in the last chapter. The following is a dictionary of these new 
terms (many of which may already be familiar). The definition of each corresponding 
vector space property is the same (verbatim) as the module-theoretic definition with 
the only added assumption being that the ring R is a field (so these definitions are not 
repeated here). 

Terminology for R any Ring Terminology for R a Field 


M is an it -module 

m is an element of M 

of is a ring element 

N is a submodule of M 

M/N is a quotient module 

M is a free module of rank n 

M is a finitely generated module 

M is a nonzero cyclic module 

<p : M -*■ N is an R -module homomorphism 

M and N are isomorphic as -modules 

the subset A of M generates M 

M = RA 


M is a vector space over R 
m is a vector in M 
a is a scalar 
N is a subspace of M 
M/N is a quotient space 
M is a vector space of dimension n 
M is a finite dimensional vector space 
M is a 1 -dimensional vector space 
: A/ A/ is a linear transformation 
M and N are isomorphic vector spaces 
the subset A of M spans M 
each element of M is a linear combination 
of elements of A i.e., M = Span(A) 
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For the remainder of this chapter F is a field and Visa vector space over F. 

One of the first results we shall prove about vector spaces is that they are free F- 
modules, that is, they have bases. Although our arguments treat only the case of finite 
dimensional spaces, the corresponding result for arbitrary vector spaces is proved in the 
exercises as an application of Zorn’s Lemma. The reader may first wish to review the 
section in the previous chapter on free modules, especially their properties pertaining 
to homomorphisms. 

Definition. 

( 1 ) A subset S of V is called a set of linearly independent vectors if an equation 

ct\ uj + o 2 V2 H 1- a„v„ =0 with ai , «2 , e F and ui , V2, - - . , v„ € S 

implies «i = a 2 = ■ ■ ■ = a n = 0. 

(2) A basis of a vector space V is an ordered set of linearly independent vectors 
which span V. In particular two bases will be considered different even if one 
is simply a rearrangement of the other. This is sometimes referred to as an 
ordered basis. 

Examples 

( 1 ) The space V — F[x] of polynomials in the variable x with coefficients from the 
field F is in particular a vector space over F. The elements 1 , x, x 2 , . . . are linearly 
independent by definition (i.e., a polynomial is 0 if and only if all its coefficients are 
0 ). Since these elements also span V by definition, they are a basis for V. 

( 2 ) The collection of solutions of a linear, homogeneous, constant coefficient differential 
equation (for example, y" — 3 / + 2 y = 0 ) over C form a vector space over C 
since differentiation is a linear operator. Elements of this vector space arc linearly 
independent if they are linearly independent as functions. For example, e' and e 2 * are 
easily seen to be solutions of the equation y" — 3 y' + 2 y = 0 (differentiation with 
respect to t). They are Linearly independent functions since ae 1 + be 21 = 0 implies 
a + b = 0 (let / = 0) and ae + be 2 = 0 (let / = 1) and the only solution to these two 
equations is a — b = 0 . It is a theorem in differential equations that these elements 
span the set of solutions of this equation, hence are a basis for this space. 


Proposition 1 . Assume the set A = {ui, V2 , . . . , v„) spans the vector space V but no 
proper subset of A spans V. Then A is a basis of V. In particular, any finitely generated 
(i.e., finitely spanned) vector space over F is a free F -module. 

Proof: It is only necessary to prove that v\, V2, . . . , v n are linearly independent 
Suppose a^Vi + CX2V2 + ■ - ■ + ct n v„ =0 where not all of the a t are 0 . By reordering, 
we may assume that <*1 ^ 0 and then 

1 

Vi = (a 2 V2 + ■ ■ -+a n v n ). 

a 1 

It follows that [v2, U3, . . . , v n } also spans V since any linear combination of uj , i>2. . . . , v„ 
can be written as a linear combination of V2, U3, - - . , v„ using the equation above. This 
is a contradiction. 
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Example 

Let F be a field and consider E[x]/(/(x)) where f{x) = x n +a n ~yx n ~ { -\ \-ayx + aQ. 

The ideal ( fix )) is a subspace of the vector space F[ x] and the quotient F[x]/(f (x)) is 
also a vector space over F. By the Euclidean Algorithm, every polynomial a( x) e T[x] 
can be written uniquely in the form a(x) = q(x)f(x) + r(x) where r(x) € F[x] and 
0 < deg r(x) < n — 1. Since q{x)f(x) e (/(*)), it follows that every element of 
the quotient is represented by a polynomial r(x) of degree < n — 1. Two distinct such 
polynomials cannot be the same in the quotient since this would say their difference (which 
is a nonzero polynomial of degree at most n — 1) would be divisible by / (x) (which is 
of degree n). It follows that the elements 1, x, x 2 , .... x"' 1 (the bar denotes the image of 
these elements in the quotient, as usual) span F[x]/(f(x)) as a vector space over F and 
that no proper subset of these elements also spans, hence these elements give a basis for 

FM/l/fr))- 

Corollary 2. Assume the finite set A spans the vector space V. Then A contains a 
basis of V . 

Proof: Any subset B of A sp anning V such that no proper subset of B also spans 

V (there clearly exist such subsets) is a basis for V by Proposition 1 . 

Theorem 3. (A Replacement Theorem ) Assume A = [ay, a ^, . . . , a n ] is a basis for 

V containing n elements and {by, b> 2 , . . . , b m ) is a set of linearly independent vectors 

in V. Then there is an ordering ay , a 2 , . . . , a^ such that for each k e (1,2 m) 

the set [bi,b 2 ,... t bk, a*+ 1 , 2 , . . - , a n ] is a basis of V. In other words, the elements 

by, b> 2 , . ■ . , b m can be used to successively replace the elements of the basis A, still 
retaining a basis. In particular, n > m. 

Proof: Proceed by induction on k. If k = 0 there is nothing to prove, since A is 
given as a basis for V. Suppose now that {by, b 2 , - . . , bk, Q k+i , ■ - • , a n ) is a basis 
for V. Then in particular this is a spanning set, so bk+y is a linear combination: 

bk+i — f\by + ■ ■ ■ + fikbk + cik+iOk+i + hcx„a„ . (11-1) 

Not all of the a, can be 0, since this would imply bk+i is a linear combination of 
by, bi , . . . , bk, contrary to the linear independence of these elements. By reordering 
if necessary, we may assume a* + i / 0. Then solving this last equation for ak+y as a 
linear combination of bk + 1 and by, bz, - - . , bk, flr+ 2 , . - - , a„ shows 

Span{£>i, b 2 , - . - , bk, bk-y- 1 , » ^n} — Span{/?i, b 2 , - - - , bk > Qk+ i» &k+ 2 , * - - > &A 

and so this is a spanning set for V. It remains to show by, bk, bk+y,a k + 2 , ■ ■ ■ , a n 
are linearly independent. If 

fiyby -| h Pkbk + Pk+lbk+y + Ctk+2&k+2 + 1" a n^n = 0 (11-2) 

then substituting for bk+y from the expression for y in equation ( 1 ), we obtain a linear 
combination of {by , b 2 , ■■ ■ ,b k , a k +y,a k+ 2 , equal to 0, where the coefficient of 

ak+y is fa- i-i- Since this last set is a basis by induction, all the coefficients in this linear 
combination, in particular Pk-y-x, must be 0. But then equation (2) is 

P\by + 1- pkbk + ak+2^k+2 + • — H ci n a n = 0. 
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Again by the induction hypothesis all the other coefficients must be 0 as well. Tims 
(b\ , t>2, ■ ■ ■ ,bf;, b^+i, £*+ 2 , ■ • ■ , a,i} is abasis for V , and the induction is complete. 


Corollary 4. 

(1) Suppose V has a finite basis with n elements. Any set of linearly independent 
vectors has < n elements. Any spanning set has > n elements. 

(2) If V has some finite basis then any two bases of V have the same cardinality. 


Proof: (1) This is a restatement of the last result of Theorem 3 and Corollary 2. 
(2) This is immediate from (1) since a basis is both a spanning set and a linearly 
independent set. 


Definition. If V is a finitely generated F -module (i.e., has a finite basis) the cardinality 
of any basis is called the dimension of V and is denoted by dim f V, or just dim V when 
F is clear from the context, and V is said to be finite dimensional over F. If V is not 
finitely generated, V is said to be infinite dimensional (written dim V — oo). 

Examples 

(1) The dimension of the space of solutions to the differential equation y” — 3/ + 2y = 0 
over C is 2 (with basis e 1 , e 2 *, for example). In general, it is a theorem in differential 
equations that the space of solutions of an / 1 th order linear, homogeneous, constant 
coefficient differential equation of degree n over C form a vector space over C of 
dimension n. 

(2) The dimension over F of the quotient F[x)/(f (je)) by the nonzero polynomial / (x) 
considered above is n = deg / (jc). The space F[x] and its subspace (/ (*)) are infinite 
dimensional vector spaces over F. 


Corollary 5. (Building-Up Lemma) If A is a set of linearly independent vectors in the 
finite dimensional space V then there exists a basis of V containing A . 

Proof: This is also immediate from Theorem 3, since we can use the elements of 
A to successively replace the elements of any given basis for V (which exists by the 
assumption that V is finite dimensional). 


Theorem 6. If V is an n dimensional vector space over F, then V = F n . In particular, 
any two finite dimensional vector spaces over F of the same dimension are isomorphic. 


Proof: Let Ui, u 2 , . . . , v„ be a basis for V . Define the map 

(p:F n ^V by (p(a u a 2 , . . . ,cc n ) = a iUi+g^H h a n v n . 

The map q> is clearly F- linear, is surjective since the V; span V, and is injective since 
the Vj are linearly independent, hence is an isomorphism. 
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Examples 

(1) Let F be a finite field with q elements and let W be a it -dimensional vector space over 
F. We show that the number of distinct bases of W is 

(/ - 1)($* - q)(q k - <? 2 ) . . V - q k ~ l ). 

Every basis of W can be built up as follows. Any nonzero vector w\ can be the first 
element of a basis. Since W is isomorphic to F*, \W\ = q k , so there are q k — 1 
choices for u>i . Any vector not in the 1 -dimensional space spanned by tut is linearly 
independent from w\ and so may be chosen for the second basis element, wi. A 
1 -dimensional space is isomorphic to F and so has q elements. Thus there are q k — q 
choices for W 2 ■ Proceeding in this way one sees that at the i* stage any vector not in the 
(/ — l)-dunensional space spanned by u >\ . u> 2 , . . . , tu, _i will be linearly independent 
from ini , u> 2 , .... Wi-\ and so may be chosen for the /* basis vector to,-. An (/ — 1)- 
dimensional space is isomorphic to F ,_1 and so has q‘~ l elements. Thus there are 
q k —q'~ i choices for tu/ . The process terminates when Wk is chosen, for then we have 
k linear independent vectors in a ^-dimensional space, hence a basis. 

(2) Let F be a finite field with q elements and let V be an n-dimensional vector space 

over F. For each k € {1,2 n) we show that the number of subspaces of V of 

dimension k is 

- 1)(4 W - q) ■ ■ ■ (g n - q k ~ X ) 

(q k _ ])( qk _ g) (q k _ qk-iy 

Any fc-dimensional space is spanned by k independent vectors. By arguing as in the 
preceding example the numerator of the above expression is the number of ways of 
picking k independent vectors from an n -dimensional space. Two sets of k independent 
vectors span the same space W if and only if they are both bases of the ^-dimensional 
space W. In order to obtain the formula for the number of distinct subspaces of 
dimension k we must divide by the number of repetitions, i.e., the number of bases of 
a fixed ^-dimensional space. This factor which appears in the denominator is precisely 
the number computed in Example 1. 

Next, we prove an important relation between the dimension of a subspace, the 
dimension of its associated quotient space and the dimension of the whole space: 

Theorem 7. Let V be a vector space over F and let W be a subspace of V. Then V / W 
is a vector space with dim V = dim W + dim V/W (where if one side is infinite then 
both are). 

Proof: Suppose W has dimension m and V has dimension n over F and let 
iui, u> 2 , - - - , to„, be a basis for W. By Corollary 5, these linearly independent ele- 
ments of V can be extended to a basis w \ , tU 2 , - - - , u>, n , iWi , . . . , v fl of V. The natural 
suijective projection map of V into V/W maps each uj, to 0. No linear combination of 
the Uj is mapped to 0, since this would imply this linear combination is an element of 
W t contrary to the choice of the v,. Hence, the image V/W of this projection map is 
isomorphic to the subspace of V spanned by the u; , hence dim V/W = n — m , which is 
the theorem when the dimensions are finite. If either side is infinite it is an easy exercise 
to produce an infinite number of linearly independent vectors showing the other side is 
also infini te. 
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Corollary 8 . Let y : V -> U be a linear transformation of vector spaces over F. Then 
ker^> is a subspace of V, <p(V) is a subspace of V and dim V = dim kerv> + dim^>(V). 

Proof: This follows immediately from Theorem 7. Note that the proof of Theorem 
7 is in fact the special case of Corollary 8 where V is the quotient V/W and <p is the 
natural projection homomorphism. 

Corollary 9. Let (p : V -> W be a linear transformation of vector spaces of the same 
finite dimension. Then the following are equivalent 

( 1 ) (p is an isomorphism 

(2) (p is injective, i.e., Icerv? = 0 

(3) cp is suijective, i.e., <p(V) = W 

(4) <p sends a basis of V to a basis of W. 

Proof: The equivalence of these conditions follows from Corollary 8 by counting 
dimensions. 

Definition. If <p : V -> V is a linear transformation of vector spaces over F, ker <p is 
sometimes called the null space of <p and the dimension of ker <p is called the nullity of 
<p. The dimension of <p(V) is called the rank of <p. If ker 49 = 0, the transformation is 
said to be nonsingular. 

Example 

Let F be a finite field with q elements and let V be an n -dimensional vector space over 
F. Recall that the general linear group GL(V) is the group of all nonsingular linear 
transformations from V to V (the group operation being composition). We show that the 
order of this group is 

\GL(V)\ = (q n - \)(q n - q)(q n - q 2 ) . . . (q n - q^ 1 ). 

To see this, fix a basis in, . . . , v n of V. A linear transformation is nonsingular if and only 
if it sends this basis to another basis of V. Moreover, if wi . . . , w r is any basis of V, by 
Theorem 6 in Section 10.3 there is a unique linear transformation which sends u/ to io, , 
1 < / < n. Thus the number of nonsingular linear transformations from V to itself equals 
the number of distinct bases of V. T hi s number, which was computed in Example 1 above 
(with it = n), is the order of GL(V). 


EXERCISES 

1. Let V = IR" and let (a\, 02 , .... a , ,) be a fixed vector in V. Prove that the collection of 
elements (jq, * 2 , - - ■ . x„) of V with a\x\ + aq.X'i + . . . + a n x n = 0 is a subspace of V. 
Determine the dimension of this subspace and find a basis. 

2. Let V be the collection of polynomials with coefficients in Q in the variable x of degree 

at most 5. Prove that Visa vector space over <Q> of dimension 6, with l . x, x 2 x 5 as 

basis. Prove that 1, 1 +x, 1 + x + x 2 , . . . , 1 +x + X 2 +x 3 +jc 4 +jc 5 is also a basis for V. 
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3 . Let *> be the linear transformation <p : R 4 — ► R 1 such that 

*>(( 1 , 0 , 0 , 0 )) = 1 *>(( 1 ,- 1 . 0 . 0 )) = 0 
*>(( 1 , - 1 , 1 . 0 )) = 1 *>(( 1 , - 1 . 1 , - 1 )) = 0 . 

Determine <p((a, b, c , d)). 

4 . Prove that the space of real- valued functions on the closed interval [a, b] is an infinite 
dimensional vector space over R, where a <b. 

5. Prove that the space of continuous real- valued functions on the closed interval [a, b] is an 
infinite dimensional vector space over R, where a < b. 

6. Let V be a vector space of finite dimension. If <p is any linear transformation from V to V 
prove there is an integer m such that the intersection of the image of <p m and the kernel of 
<p m is {0}. 

7. Let <p be a linear transformation from a vector space V of dimension n to itself that satisfies 
*> 2 = 0. Prove that the image of *> is contained in the kernel of *> and hence that the rank 
of*>is at most n/ 2. 

8. Let V be a vector space over F and let *> be a linear transformation of the vector space 
V to itself. A nonzero element u e V satisfying *>(u) = Xv for some X e F is called 
an eigenvector of <p with eigenvalue X . Prove that for any fixed X e F the collection of 
eigenvectors of if with eigenvalue X together with 0 forms a subspace of V. 

9. Let V be a vector space over F and Let <p be a linear transformation of the vector space V 

to itself. Suppose for i = 1,2 k that r>/ e V is an eigenvector for *> with eigenvalue 

X j € F (cf. the preceding exercise) and that all the eigenvalues A; are distinct Prove that 

ui, i >2 l>* are linearly independent [Use induction on k: write a linear dependence 

relation among the u, and apply <p to get another linear dependence relation among the v, 
involving the eigenvalues — now subtract a suitable multiple of the first linear relation to get 
a linear dependence relation on fewer elements.] Conclude that any linear transformation 
on an n-dimensional vector space has at most n distinct eigenvalues. 

In the following exercises let V be a vector space of arbitrary dimension over a field F. 

10 . Prove that any vector space V has a basis (by convention the null set is the basis for the 
zero space). [Let <S be the set of subsets of V consisting of linearly independent vectors, 
partially ordered under inclusion; apply Tom’s T .emma to S and show a maximal element 
of<S is a basis.] 

11 . Refine your argument in the preceding exercise to prove that any set of linearly independent 
vectors of V is contained in a basis of V . 

12. If F is a field with a finite or countable number of elements and V is an infinite dim ensional 
vector space over F with basis B, prove that the cardinality of V equals the cardinality of 
B. Deduce in this case that any two bases of V have the same cardinality. 

13 . Prove that as vector spaces over <Q), R” = IR, for all n € Z + (note that, in particular, this 
means R" and R are isomorphic as additive abelian groups). 

14 . Let A be a basis for the infinite dim ensional space V. Prove that V is isomorphic to the 
direct sum of copies of the field F indexed by the set A. Prove that the direct product of 
copies of F indexed by A is a vector space over F and it has strictly larger dimension than 
the dimension of V (see the exercises in Section 10.3 for the definitions of direct sum and 
direct product of infini tely many modules). 
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11.2 THE MATRIX OF A LINEAR TRANSFORMATION 


Throughout this section let V, W be vector spaces over the same field F, let B = 
{t>i , V 2 > . . . , u„} be an (ordered) basis of V, let £ = {ini , u>2 , . . . , w m ] be an (ordered) 
basis of W and let <p e Hom(V, W) be a linear transformation from V to W. For each 
j e {1,2 write the image of Vj under <p in terms of the basis £: 

m 

(p(Vj) = ^cc u Wi. ( 11 . 3 ) 

i=I 

Let M%(<p) = (cijj) be them xn matrix whose/', j entry is or,-; (that is, use the coefficients 
of the iu, ’s in the above computation of (p{Vj) for the 7 th column of this matrix). The 
matrix Mq (<p) is called the matrix of<p with respect to the bases B, £. The domain basis 
is the lower and the codomain basis the upper letters appearing after the Given 
this matrix, we can recover the linear transformation <p as follows: to compute <p(y) far 
v € V, write v in terms of the basis B: 

n 

v = ^2 a > v > * a i e F > 

i=i 

and then calculate the product of the m x n and n x 1 matrices 


Afg<«>) x 

(<*\\ 

<*2 

— 

/fii\ 

h 


J 


\Pm) 


The image of v under (p is given by 

m 

<p(.v) = X! . 

1 

i.e., the column vector of coordinates of <p(v) with respect to the basis £ are obtained 
by multiplying the matrix A/g (<p) by the column vector of coordinates of v with respect 
to the basis B (sometimes denoted [^(u)]f = A/g(^)[u]g). 

Definition. The m x n matrix A — {a ,;) associated to the linear transformation <p 
above is said to represent the linear transformation <p with respect to the bases B , £. 
Similarly, (p is the linear transformation represented by A with respect to the bases B, 
£. 

Examples 

(1) Let V = K 3 with the standard basis B = ((1, 0, 0), (0, 1, 0), (0, 0, 1)} and let W = 
R 2 with the standard basis £ = {(1,0), (0, 1)}, Let tp be the linear transformation 
<p(x, y,z) = (x + 2y, x -l- y + z). Since <£>(1,0, 0) = (1. 1), <p( 0, 1, 0) = (2, 1), 

<p( 0. 0, 1) = (0. 1), the matrix A — M^{<p) is the matrix 
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(2) Let V = W be the 2-dimensional space of solutions of the differential equation 
y" — 3/ + 2y = 0 over C and let B = £ be the basis ui = e* , V 2 = e 2 ' . Since the 
coefficients of this equation are constants it is easy to check that if y is a solution then 
its derivative y' is also a solution* It follows that the map <p = d/dt = differentiation 
(with respect to r) is a linear transformation from V to itself. Since <p(v\. ) = die 1 ) /dt = 
e? = v\ and <p(v 2 ) = d(e h )/dt = 2e?* — 2 v 2 we see that the corresponding matrix 

with respect to these bases is the diagonal matrix 

(3) Let V = W = Q 3 = {(x, y, z) \ x, y. z € Q) be the usual 3-dimensional vector space 
of ordered 3-tuples with entries from the field F = Q of rational numbers and suppose 
<p is the linear transformation 

<p(x, y, z) = (9x + Ay + 5z, —Ax - 3z, -6x -Ay- 2 z), x, y, z € <Q 



from V to itself. Take the standard basis e\ — (1,0, 0), e^= (0,1, 0), e$ = (0, 0, 1) 
for V and for W = V. Since <p (1.0,0) = (9. —4. —6), y(0, 1,0) = (4, 0,-4), 
^(0, 0, 1) = (5, —3, —2), the matrix A representing this linear transformation with 
respect to these bases is 


A = 





V-6 -4 -2, 


Theorem 10. Let V be a vector space over F of dimension n and let W be a vector space 
over F of dimension m, with bases B, S respectively. Then the map Hom/.-(V, W) -> 
MmxniF) from the space of linear transformations from V to W to the space ofmxn 
matrices with coefficients in F defined by <p i-> M%((p) is a vector space isomorphism. 
In particular, there is a bijective correspondence between linear transformations and 
their associated matrices with respect to a fixed choice of bases. 

Proof: The columns of the matrix M| (cp) are determined by the action of <p on 
the basis B as in equation (3). This shows in particular that the map (p i-> M^(<p) is 
an /"-linear map since <p is /"-linear. This map is surjective since given a matrix M , 
the map <p defined by equation (3) on a basis and then extended by linearity is a linear 
transformation with matrix M. The map is injective since two linear transformations 
agreeing on a basis are the same. 

Note that different choices of bases give rise to different isomorphisms, so in the 
same sense that there is no natural choice of basis for a vector space, there is no natural 
isomorphism between Hom F (V, W0 and M m ^ n (F). 

Corollary 11. The dimension of Hom F (V, W) is (dim V)(dim W0- 

Proof: The dimension of M mxn (F ) is mn. 

Definition. An m x n matrix A is called nonsinguhr if Ax — 0 with x e F n implies 
x — 0. 
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The connection of the term nonsingular applied to matrices and to linear trans- 
formations is the following; let A = be the matrix associated to the linear 

transformation <p (with some choice of bases B, £). Then independently of the choice 
of bases, the m x n matrix A is nonsingular if and only if the linear transformation q> is a 
nonsingular linear transformation from the n -dimensional space V to them -dimensional 
space W (cf. the exercises). 

Assume now that U> V and W are all finite dimensional vector spaces over F 
with ordered bases Z>, B and £ respectively, where B and £ are as before and suppose 
V = {«i , «2i - - ■ , «*}. Assume : U -»• V and <p : V W are linear transformations. 
Their composite, <p o is a linear transformation from U to IT, so we can compute its 
matrix with respect to the appropriate bases; namely, (<p o \J/) is found by computing 

m 

(ftorj/(Uj) = ^YijWi 

i=l 

and putting the coefficients Yij down the 7 th column of M^isp o \fr). Next, compote the 
matrices of \]/ and <p separately: 

n m 

'P(uj) = y' J a pj v p and v(v p ) = ^2 PipWi 

p= 1 i= 1 

so that = ( a p} ) and M|(^>) = (fi ip ). 

Using these coefficients we can find an expression for the y’s in terms of the a * s 
and /Ts as follows: 

<P oy}/ (uj) = <pi 

%=i 
n 

p=i 

>1 m 

p = 1 /— 1 

n m 

= J2J2 ot pjftp Wi - 

p= 1 1=1 

By interchanging the order of summation in the above double sum we see that Yij , which 
is the coefficient of w, in the above expression, is 

n 

yij = a pj pip ■ 
p= 1 

Computing the product of the matrices for <p and rj/ (in that order) we obtain 

m 

(flij ) (&ij ) = i^ij ) 1 where &ij = ^ ipC^pj ■ 

p=i 
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By comparing the two sums above and using the commutativity of field multiplication, 
we see that for all / and j, y-,j = &ij. This computation proves the following result: 

Theorem 12. With notations as above, M^isp of) = A7g(<p)A7®(V0» ie., with respect 
to a compatible choice of bases, the product of the matrices representing the linear 
transformations <p and is the matrix representing the composite linear transformation 

<P O’ij/. 

Corollary 13. Matrix multiplication is associative and distributive (whenever the di- 
mensions are such as to make products defined). An n x n matrix A is nonsingular if 
and only if it is invertible. 

Proof: Let A, B and C be matrices such that the products (AB)C and A(BC ) are 
defined, and let 5, T and R denote the associated linear transformations. By Theorem 
12, the linear transformation corresponding to AB is the composite S o T so file linear 
transformation corresponding to ( AB)C is file composite (5 o T) o R. Similarly, the 
linear transformation corresponding to A(BC) is the composite S o (T o R). Since 
function composition is associative, these two linear transformations are the same, and 
so (AB)C = A(BC ) by Theorem 10. The distributivity is proved similarly. Note also 
that it is possible to prove these results by straightforward (albeit tedious) calculations 
with matrices. 

If A is invertible, then Ax = 0 implies x = A -1 Ajc = A -, 0 = 0, so A is 
nonsingular. Conversely, if A is nonsingular, fix bases B, £ for V and let <p be the 
linear transformation of V to itself represented by A with respect to these bases. By 
Corollary 9, <p is an isomorphism of V to itself, hence has an inverse, <p~ ] . Let B 
be the matrix representing (p~ x with respect to the bases £, B (note the order). Then 
AB — M§(<p)Mjr(<p~ x ) — M^{(p o (p~ y ) — Aff(l) = 7. Similarly, BA = l so B is 
the inverse of A. 

Corollary 14. 

(1) If B is a basis of the n-dimensional space V, the map (p i-> is a ring and 

a vector space isomorphism of Hom^(V, V) onto the space M„(F) of n x n 
matrices with coefficients in F. 

(2) GL(V ) = GL n (F ) where dimV = n. In particular, if F is a finite field 
the order of the finit e group GL n (F) (which equals \GL(V)\) is given by the 
formula at the end of Section 1 . 

Proof: (1) We have already seen in Theorem 10 that this map is an isomorphism 
of vector spaces over F. Corollary 13 shows that M„(F) is a ring under matrix multi- 
plication, and then Theorem 12 shows that multiplication is preserved under this map, 
hence it is also a ring isomorphism. 

(2) This is immediate from (1) since a ring isomorphism sends units to units. 

Definition. If A is any m x n matrix with entries from F, the row rank (respectively, 
column rank ) of A is the maximal number of linearly independent rows (respectively. 
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columns) of A (where the rows or columns of A are considered as vectors in affine 
n -space, m -space, respectively). 

The relation between the rank of a matrix and the rank of the associated linear 
transformation is the following: the rank of (p as a linear transformation equals the 
column rank of the matrix M$ (<p) (cf. the exercises). We shall also see that the row 
rank and the col umn rank of any matrix are the same. 

We now consider the relation of two matrices associated to the same linear transfor- 
mation of a vector space to itself but with respect to two different choices of bases (cf . 
the exercises for the general statement regarding a linear transformation from a vector 
space V to another vector space W). 

Definition. Two n x n matrices A and B are said to be similar if there is an invertible 
(i.e., nonsingular) n x n matrix P such that P~ l AP = B. Two linear transformations 
<p and ^ from a vector space V to itself are said to be similar if there is a nonsingular 
linear transformation £ from V to V such that £“ V£ = if. 

Suppose B and £ are two bases of the same vector space V and let (p £ Homjr(V, V ). 
Let 7 be the identity map from V to V and let P = A/J? (/) be its associated matrix 
(in other words, write the elements of the basis £ in terms of the basis B — note the 
order — and use the resulting coordinates for the columns of the matrix P). Note that 
if B ^ £ then P is not the identity matrix. Then P~ l M%(<p)P = M^(<p). If [v]b is 
the n x 1 matrix of coordinates for v £ V with respect to the basis B, and similarly 
[v\c is the n x 1 matrix of coordinates for v £ V with respect to the basis £, then 
[u] e = P[u]£. The matrix P is called the transition or change of basis matrix from B 
to £ and this similarity action on Mg(<p) is called a change of basis. This shows that 
the matrices associated to the same linear transformation with respect to two different 
bases are similar. 

Conversely, suppose A and B are n x n matrices similar by a nonsingular matrix P. 
Let B be a basis for the n -dimensional vector space V. Define the linear transformation 
<p of V (with basis B) to V (again with basis B) by equation (3) using the given matrix 
A, i.e., 

n 

<P(Vj) - 

1=1 

Then A = Mg(<p) by definition of <p. Define a new basis £ of V by using the I th 
column of P for the coordinates of w; in terms of the basis B (so P = Mj?(I) by 
definition). Then B = P~ l AP = P -1 Mg(<p)P — M^(<p) is the matrix associated to 
<p with respect to the basis £. This shows that any two similar n x n matrices arise in 
this fashion as the matrices representing the same linear transformation with respect to 
two different choices of bases. 

Note that change of basis for a linear transformation from V to itself is the same as 
conjugation by some element of the group GL(V) of nonsingular linear transformations 
of V to V . In particular, the relation “similarity” is an equivalence relation whose 
equivalence classes are the orbits of GL(V ) acting by conjugation on Homjr( V, V). If 
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(p e GL(V) (i.e., (p is an invertible linear transformation), then the similarity class of 
(p is none other than the conjugacy class of <p in the group GL(V). 


Example 

Let V = Q 3 and let <p be the linear transformation 

(fix., y, z) = ( 9x -f 4y + 5z, -4x - 3z. —6x -4y - 2 z), x,y , z e Q 


from V to itself we considered in an earlier example. With respect to the standard basis, 
B, b\ = ( 1 , 0 , 0 ), bj = ( 0 , 1 , 0 ), b$ = ( 0 , 0 , 1 ) we saw that the matrix A representing this 
linear transformation is 


/ 9 4 

A = Mg(<p) = I —4 0 

\ —6 -4 



Take now the basis, £ , e\ = (2, —I, —2), «2 = 0,0, —1), ej = (3, —2, —2) for V (we 
shall see that this is in fact a basis momentarily). Since 

<p{e\) — <p{ 2, —1, —2) = (4, -2, —4) = 2- «i+0 e 2 + 0 -e 3 

<p{e 2 ) =^0>0, — 1) = (4,-1, -4) = l-ei+2>e2 + 0-e3 

<p{e 3 ) = <p{ 3, -2, -2) = (9, - 6 . - 6 ) = 0-<?i+0-e2+3-e3, 

the matrix representing <p with respect to this basis is the matrix 

(2 1 0 \ 

B = M§(<p)= 0 2 0 . 

\0 0 3/ 

Writing the elements of the basis £ in terms of the basis B we have 


e\ = 2 b\ — 1>2 — 2&3 

ei = b\ — bs 

ez = 3fc| — 2£>2 — 2i>3 

/ 2 1 3\ /— 2 -1 —2 \ 

so the matrix P = M? (/) — [ — 1 0 -2 I with inverse P~ y — ( 2 2 1 1 

v 2 -J -2/ \ 1 0 l) 

conjugates A into B, i.e., P~ l AP — B, as can easily be checked. (Note incidentally that 
since P is invertible this proves that £ is indeed a basis for V.) 

We observe in passing that the matrix B representing this linear transformation <p is 
much simpler than the matrix A representing <p. The study of the simplest possible matrix 
representing a given linear transformation (and which basis to choose to realize it) is the 
study of canonical forms considered in the next chapter. 


Linear Transformations on Tensor Products of Vector Spaces 

For convenience we reiterate Corollaries 18 and 19 of Section 10.4 for the special case 
of vector spaces. 


Proposition 15. Let F be a subfield of the field K. If W is an m -dimensional vector 

space over F with basis tui w m , then K W is an m-dimensional vector space 

over K with basis 1 ® tuj , . . . , 1 ® w m . 
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Proposition 16. Let V and W be finite dimensional vector spaces over the field F with 
bases v\ f ... f v n and w\, . . . , w m respectively. Then V <g>f W is a vector space over F 
of dimension nm with basis w, ® wj, 1 < / < n and 1 < j < m. 


Remark: If v and w are nonzero elements of V and W, respectively, then it follows from 
the proposition that t; <S> w is a nonzero element of V <& F IV, because we may always 
build bases of V and W whose first basis vectors are u, in, respectively. In a tensor 
product M N of two R -modules where R is not a field it is in general substantially 
more difficult to determine when the tensor product m <8> n of two nonzero elements is 
zero. 


Now let V, IV, X, Y be finite dimensional vector spaces over F and let 
<p:V -► X and \jr : W Y 

be linear transformations. We compute a matrix of the linear transformation 

Let Bi = {v\, . . . , v„) and B 2 = {tni , . ..,w m ) be (ordered) bases of V and W respec- 
tively, and let £1 = {*i , . . . , x r } and £2 = b>i be (ordered) bases of X and Y 
respectively. Let B = {t>, ® to,-} and £ = {jc f <g> yj } be the bases of V ® W and X 0 Y 
given by Proposition 16; we shall order these shortly. Suppose 


Then 


(p(vi) — and ijr(wj) = 

p = 1 4=' 


(<P ® tYvi ® wj) = ( <p(vt )) <g> ( \j/(u>j )) 

r j 

— (y ] GpiXp) ® ( ^ ^ fiqjYq) 

p = 1 4=1 (1L8) 

r s 

= EE“ pi Pqj (Xp ® yq ) - 

P~ 1 4=1 


In view of the order of summation in (1 1.8) we order the basis £ into r ordered sets, 
with the p 01 list being x p ® y\, x p <%> y^, . . . , x p <& y s , and similarly order the basis B. 
Then equation (8) determines the column entries for the corresponding matrix oi<p®^r. 
The resulting matrix (<p <8> \}r) is an r x n block matrix whose p, q block is the s xm 
matrix In other words, the matrix for <p ® ^ is obtained by taking the 

matrix for <p and multiplying each entry by the matrix for \}r. Such matrices have a 
name: 


Definition. Let A = (c/j j) and fiberxn and s x m matrices, respectively, with 
coefficients from any commutative ring. The Kronecker product or tensor product of 
A and B, denoted by A ® 15, is the rs x nm matrix consisting of an r x n block matrix 
whose i, j block is the s x m matrix ctijB. 

With this terminology we have 
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Proposition 17. Let <p : V -» X and \fr : W Y be linear transformations of finite 
dimensional vector spaces. Then the Kronecker product of matrices representing <p and 
ifr is a matrix representation of <p ® ir. 


Example 

Let V X ■ R 3 , both with basis vi,V 2 , V3, and W = Y = R 2 , both with basis iui, 102- 
Suppose <p : R 3 R 3 is the linear transformation given by <p{av\ + bv 2 + cv$) = 
cv 1 -\- 2 av 2 — 2>bv$ and ip- : R 2 -*■ R 2 is the linear transformation given by \}r(aw\ -\-bw 2 ) = 
(a + 3b)w] + (4 b — 2a)w2. With respect to the chosen bases, the matrices for <p and \p are 



and 




respectively. Then with respect to the ordered basis 

B = {V\®W\, I>1 ® W2, V2 0 U12, U 3 ®tUi, V 3 ® U) 2 } 

we have 

/ 




0 

0 

0 

0 

1 

3 

0 

0 

0 

0 

-2 

4 

2 

6 

0 

0 

0 

0 

-4 

8 

0 

0 

0 

0 

0 

0 

—3 

-9 

0 

0 

0 

0 

6 

-12 

0 

0 


obtained (as indicated by the dashed lines) by multiplying the 2 x 2 matrix for successively 
by the entries in the matrix for <p. 


EXERCISES 

1. Let V be the collection of polynomials with coefficients in Q in the variable x of degree at 
most 5. Determine the transition matrix from the basis 1, x, x 2 , . . . , jc 5 for V to the basis 
1, 1 +x, 1+x +x 2 , . . . , l+x +x 2 + X 3 +x 4 +x 5 for V. 

2. Let V be the vector space of the preceding exercise. Let — d/dx be the linear trans- 
formation of V to itself given by usual differentiation of a polynomial with respect to jc. 
Determine the matrix of <p with respect to the two bases for V in the previous exercise. 

3. Let V be the collection of polynomials with coefficients in F in the variable x of degree 
at most n. Determine the transition matrix from the basis 1 , x, x 2 , . . . , x n for V to the 
elements 

1, x — X, . . . , (jc — X)" -1 , (jc — X)" 

where X is a fixed element of F. Conclude that these elements are a basis for V . 

4. Let <p be the linear transformation of R 2 to itself given by rotation counterclockwise around 

the origin through an angle 6 . Show that the matrix of <p with respect to the standard basis 

_ . / cos 6 — sin 6 \ 

for R 2 is [ . „ „ ] . 

\ sin 6 cos 6 ) 

5. Show that the m xn matrix A is nonsingular if and only if the linear transformation <p is a 

nonsingular linear transformation from the n -dimensional space V to the m-dimensional 
space W, where A = regardless of the choice of bases B and S. 
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6. Prove if <p e Hotclf (F n ,P), and B t £ are the natural bases of F", F m respectively, then 
the range of <p equals the span of the set of columns of Mg (<p). Deduce that the rank of <p 
(as a linear transformation) equals the column rank of M^(<p). 

7. Prove that any two simil ar matrices have the same row rank and the same column rank. 

8. Let V be an n-dimensional vector space over F and let <p be a linear transformation of the 
vector space V to itself. 

(a) Prove that if V has a basis consisting of eigenvectors for <p (cf. Exercise 8 of Section 1) 
then the matrix representing <p with respect to this basis (for both domain and range) 
is diagonal with the eigenvalues as diagonal entries. 

(b) If A is the n x n matrix representing <p with respect to a given basis for V (for both 
domain and range) prove that A is similar to a diagonal matrix if and only if V has a 
basis of eigenvectors for <p. 

9. If W is a subspace of the vector space V stable under the linear transformation <p (i.e., 
<p(W) c W\ show that <p induces linear transformations <p\ w on W and <p on the quotient 
vector space V fW. If ^ I w and ip are nonsingular prove <p is nonsingular. Prove the converse 
holds if V has finite dimension and give a counterexample with V infinite dimensional. 

10. Let V be an /i-dimensional vector space and let <p be a linear transformation of V to itself. 
Suppose W is a subspace of V of dimension m that is stable under <p. 

(a) Prove that there is a basis for V with respect to which the matrix for <p is of the form 



where A is an m xm matrix, Bisanra x (n — m) matrix and C is an (n— m) x (n —m) 
matrix (such a matrix is called block upper triangular). 

(b) Prove that if there is a subspace W’ invariant under <p so that V = WQ>W decomposes 
as a direct sum then the bases for W and W' give a basis for V with respect to which 
the matrix for <p is block diagonal : 



where A is an tn x m matrix and C is an (n — m) x (n — m) matrix. 

(c) Prove conversely that if there is a basis for V with respect to which <p is block diagonal 
as in (b) then there are ^-invariant subspaces W and W of dimensions m and n — m, 
respectively, with V = W © W' . 


11. Let (p be a linear transformation from the finite dimensional vector space V to itself such 
that (p- = <p. 

(a) Prove that image <p fi ker <p = 0. 

(b) Prove that V = image <p © ker <p. 

(c) Prove that there is a basis of V such that the matrix of <p with respect to this basis is 
a diagonal matrix whose entries are all 0 or 1. 


A linear transformation <p satisfying p = <p is called an idempotent linear transformation. 
This exercise proves that idempotent linear transformations are simply projections onto 
some subspace. 


12. Let V = R 2 , in = (1, 0), V 2 = (0, 1), so that iq, V 2 are a basis for V. Let (p be the linear 


transformation of V to itself whose matrix with respect to this basis 


“G !)• 


Prove 


that if W is the subspace generated by vi then W is stable under the action of <p. Prove 
that there is no subspace W' invariant under <p so that V = W © W' . 
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13. Let V be a vector space of dimension n and let W be a vector space of dimension m over 
a field F. Suppose A is the m x n matrix representing a linear transformation tp from V to 
W with respect to the bases B\ for V and E\ for W. Suppose similarly that B is the m x n 
matrix representing <p with respect to the bases B 2 for V and for W. Let P = Aff^ (/) 

where / denotes the identity map from V to V, and let Q = (/) where 1 denotes the 

identity map from W to W. Prove that Q~ l = M^(l) and that Q~ X AP — B t giving 
the general relation between matrices representing the same linear transformation but with 
respect to different choices of bases. 

The following exercises recall the Gauss-Jordan elimination process. This is one of the fastest 
computational methods for the solution of a number of problems involving vector spaces — 
solving systems of linear equations, determining inverses of matrices, computing determinants, 
determining the span of a set of vectors, deter minin g linear independence of a set of vectors 
etc. 

Consider the system of m linear equations 

Olixi + 012^2 + -■■ +a\ n x n =c\ 

021X1 +022X2 + ... +amx n = C?2 

(11-4) 


O /71 lXl ” 1 " Qm2 x 2 - ■ • ri" &mn x n — 

in the n unknowns x\, * 2 , . . . , x„ where o/j, cf, i = 1, 2 m, j = 1, 2, . . . , n are elements 

of the field F. Associated to this system is the coefficient matrix: 


A = 

and the augmented matrix: 

(A | C) = 


( All 

012 

0 1/7 


021 

022 ■ • • 

02/1 


\Oml 

0/ti2 

o»im y 


( an 

012 

01/7 

Cl ^ 

021 

022 • ■ - 

02/1 

C 2 

\ 0 j 7 ll 

O7712 

O/71/1 

C /71 / 


(the term augmented refers to the presence of the column matrix C = (cj) in addition to the 
coefficient matrix A = (a,]) ). The set of solutions in F of this system of equations is not 
altered if we perform any of the following three operations: 


(1) interchange any two equations 

(2) add a multiple of one equation to another 

(3) multiply any equation by a nonzero element from F, 

which correspond to the following three elementary row operations on the augmented matrix: 


(1) interchange any two rows 

(2) add a multiple of one row to another 

(3) multiply any row by a unit in F, i.e., by any nonzero element in F. 


If a matrix A can be transformed into a matrix C by a series of elementary row operations then 
A is said to be row reduced to C. 
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14. Prove that if A can be row reduced to C them C can be row reduced to A. Prove that the 
relation “A ~ C if and only if A can be row reduced to C" is an equivalence relation. 
[Observe that the elementary row operations are reversible.] 

Matrices lying in the same equivalence class under this equivalence relation are said to be row 
equivalent. 

15. Prove that the row rank of two row equivalent matrices is the same. [It suffices to prove 
this for two matrices differing by an elementary row operation.] 

Anmxn matrix is said to be in reduced row echelon form if 

(a) the first nonzero entry a;j j in row i is 1 and all other entries in the corresponding jf 1 column 
are zero, and 

(b) j\ < j 2 < ... < j r where r is the number of nonzero rows, i.e., the n umb er of initial zeros 
in each row is strictly increasing (hence the term echelon). 

An augmented matrix (A | C) is said to be in reduced row echelon form if its coefficient 
matrix A is in reduced row echelon form. For example, the following two matrices are in 
reduced row echelon form: 


/l 

0 

5 

7 

0 

3 


/o 

1 

-1 

o 

o\ 

0 

1 

-1 

1 

0 

-A 

-1 

(o 

A 

A 

1 

2 1 

0 

0 

0 

0 

1 

6 

1 


U 

U 

1 

\\ 

\0 

0 

0 

0 

0 

0 

o) 

\0 

0 

0 

0 

- 3 / 


(with j\ = 1, j 2 ~ 2, 73 = 5 for the first matrix and ji = 2, j 2 = 4 for the second matrix). 
The first nonzero entry in any given row of the coefficient matrix of a reduced row echelon 
augmented matrix (in position (i, ji ) by definition) is sometimes referred to as a pivotal element 
(so the pivotal elements in the first matrix are in positions (1,1), (2,2) and (3,5) and the pivotal 
elements in the second matrix are in positions (1,2) and (2,4)). The col umn s containing pivotal 
elements will be called pivotal columns and the columns of the coefficient matrix not containing 
pivotal elements will be called nonpivotal. 

16. Prove by induction that any augmented matrix can be put in reduced row echelon form by 
a series of elementary row operations. 

17. Let A and C be two matrices in reduced row echelon form. Prove that if A and C are row 
equivalent then A = C. 

18. Pruve that the row rank of a matrix in reduced row echelon form is the number of nonzero 
rows. 

19. Prove that the reduced row echelon forms of the matrices 


/ 1 

1 

4 

8 

0 

-1 



(0 

-3 

3 

1 

5\ 

1 

2 

3 

9 

0 

—5 

-2 


0 

1 

-1 

0 

« 

0 

-2 

2 

-2 

1 

14 

3 


\1 

4 

1 

11 

0 

-13 

-J 



2 

-2 

0 

-*) 


are the two matrices preceding Exercise 16. 

The point of the reduced row echelon form is that the corresponding system of linear equations 
is in a particularly simple form, from which the solutions to the system AX = C in (4) can be 
determined immediately: 

20. (Solving Systems of Linear Equations) Let (A' j C') be the reduced row echelon form of 
the augmented matrix (A | C). The number of zero rows of A' is clearly at least as great 
as the number of zero rows of (A' | C'). 
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(a) Prove that if the number of zero rows of A' is strictly larger than the number of zero 
rows of (A' | C') then there are no solutions to AX = C. 


By (a) we may assume that A' and (A' | CO have the same number, r, of nonzero rows 
(so n > r ). 

(b) Prove that if r = n then there is precisely one solution to the system of equations 
AX = C. 

(c) Prove that if r < n then there are infinitely many solutions to the system of equations 
AX = C. Prove in fact that the values of the n — r variables corresponding to the 
nonpivotal columns of (A' | C') can be chosen arbitrarily and that the remaining 
r variables corresponding to the pivotal colu mns of ( A ' \ C') are then determined 
uniquely. 


2L Determine the solutions of the following systems of equations: 

(a) 

— 3x + 3y + z= 5 

jr — y =0 

2x — 2y = — 3 


(b) 


(c) 


x - 2y+ z = 5 

x — 4 y 6z = 10 

4x — lly + llz = 12 

x — 2y + z = 5 

y-2z = 17 
2x - 3y =27 


(d) 


(e) 


x + y - 3 z 
3x — 2y 4- 5z 
6jt -f y — 4z 
2x + 2 y — 6z 

x + y + 4z + 8 u 
x + 2y + 3z 4- 9u 
— 2y+2z— 2 u - 
x+4y+ z + 1 1 w 


4- 2u = 2 
-I - u = 1 
+ 3u = 7 
= 4 

— w = — 1 

— 5w = -2 

I- u + 14u> = 3 

— 13ui = —4 


22. Suppose A and B are two row equivalent m x n matrices, 
(a) Prove that the set 


/*t\ 

*2 

w 


of solutions to the homogeneous linear equations AX = 0 as in equation (4) above 
are the same as the set of solutions to the homogeneous linear equations BX = 0. [It 
suffices to prove this for two matrices differing by an elementary row operation.] 

(b) Prove that any linear dependence relation satisfied by the columns of A viewed as 
vectors in F m is also satisfied by the columns of B. 
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(c) Conclude from (b) that the number of linearly independent columns of A is the same 
as the number of linearly independent columns of B. 

23. Let A' be a matrix in reduced row echelon form. 

(a) Prove that the nonzero rows of A' are linearly independent Prove that the pivotal 
columns of A! are linearly independent and that the nonpivotal columns of A' are 
linearly dependent on the pivotal columns. (Note the role the pivotal elements play.) 

(b) Prove that the number of linearly independent columns of a matrix in reduced row 
echelon form is the same as the number of linearly independent rows, i.e., the row 
rank and the column rank of such a matrix are the same. 

24. Use the previous two exercises and Exercise 1 5 above to prove in general that the row rank 
and the column rank of a matrix are the same. 

25. ( Computing Inverses cf Matrices) Let A be an n x n matrix. 

(a) Show that A has an inverse matrix B with columns By, B 2 B„ if and only if the 

systems of equations: 



/ 1 \ 


(°) 


( °\ 


0 


1 


0 

AB\ = 

0 

, AB 2 = 

0 

A Bn = 

0 


\o/ 


\o ) 


K\) 


have solutions. 

(b) Prove that A has an inverse if and only if A is row equivalent to the n x n identity 
matrix. 

(c) Prove that A has an inverse B if and only if the augmented matrix (A | 7) can be row 
reduced to the augmented matrix (/ | B ) where / is the n x n identity matrix. 

26. Determine the inverses of the following matrices using row reduction: 


A =( \ 

-i 

l 

1 ) 

B = 

/ 1 

0 

0 

1 

2 

2 

0 

1 

0 

2 \ 

-1 

0 


0 



\-i 

1 

1 

0 / 


27. ( Computing Spans, Linear Independence and Linear Dependencies in Vector Spaces) Let 

V be an m-dimensional vector space with basis C| , ei e m and let ui, t >2 v„ be 

vectors in V. Let A be the m x n matrix whose columns are the coordinates of the vectors 
Vi (with respect to the basis e \ , e 2 , . . . , e m ) and let A' be the reduced row echelon form of 
A. 

(a) Let B be any matrix row equivalent to A. Let un , W 2 , . . . , w„ be the vectors whose 
coordinates (with respect to the basis e \ , e 2 , . . . , e m ) are the columns of B. Prove that 
any linear relation 

*ll>l + *2^2 + ■ ■ ■ = 0 (11-5) 

satisfied by ui, U 2 , . . . , v n is also satisfied when v, is replaced by Wi, i = 1.2 n. 

(b) Prove that the vectors whose coordinates are given by the pivotal columns of A' 
are linearly independent and that the vectors whose coordinates are given by the 
nonpivotal columns of Al are linearly dependent on these. 

(c) ( Determining linear Independence of Vectors) Prove that the vectors ui , P 2 . - ■ - , Wn 
are linearly independent if and only if A' has n nonzero rows (i.e., has rank n). 

(d) ( Determining Linear Dependencies of Vectors) By (c), the vectors vi,v 2 iVi are 

linearly dependent if and only if Al has nonpivotal columns. The solutions to (5) 
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defining linear dependence relations among ui, V 2 , . . . , v„ are given by the linear 
equations defined by A'. Show that each of the variables xy,x 2 ,...,x n in (5) corre- 
sponding to the nonpivotal columns of A' can be prescribed arbitrarily and the values 
of the remaining variables are then uniquely determined to give a linear dependence 
relation among vy,V 2 v n as in (5). 

(e) ( Determining the Span of a Set of Vectors) Prove that the subspace W spanned by 

i>j, V 2 v„ has dimension r where r is the number of nonzero rows of A' and that 

a basis for W is given by the original vectors vj. ( i = 1,2 r) corresponding to 

the pivotal columns of A'. 

28. Let V = R 5 with the standard basis and consider the vectors 

vy = (1. 1, 3, -2, 3) . U 2 = (0. 1. 0. -1. 0) . I* = (2, 3, 6. -5. 6) 
v 4 = (0. 3, 1, -3, 1) . vs = (2, -1. -1, -1, -1). 

(a) Show that the reduced row echelon form of the matrix 

/ 1 0 2 0 2\ 

1133-1 
A= 3 0 6 1 -1 

-2 -1 -5 -3 -1 

V 3 0 6 1 

whose columns are the coordinates of ui , V 2 , pj, 1)4, vs is the matrix 

/I 0 2 0 2\ 

0110 18 
A' = 0 0 0 1 -7 

0 0 0 0 0 

Vo 0 0 0 0/ 

where the 1 st , 2 nd and 4 th columns are pivotal and the remaining two are nonpivotal. 

(b) Conclude that these vectors are linearly dependent, that the subspace W spanned by 
u1.u2.u3, U4, i>5 is 3-dimensional and that the vectors 

«i = (1.1,3, -2,3), U2 = (0,1,0, —1,0) and u 4 = (0. 3, 1, -3, 1) 

ore a basis for W. 

(c) Conclude from (a) that the coefficients jcj , * 2, *3, X4, *5 of any linear relation 

xjui + X2U2 + X3V3 + X4U4 + X5U5 = 0 

satisfied by iq , V 2 , U3. u 4 , 1)5 are given by the equations 

xy +2x 3 + 2x 5 = 0 

X 2 + x 3 + 18x5 = 0 

X 4 — 7xs = 0 . 

Deduce that the 3 rd and 5 th variables, namely *3 and *5, corresponding to the non- 
pivotal columns of A', can be prescribed arbitrarily and the remaining variables are 
then uniquely determined as: 

*1 = —2x3 - 2x5 
X2 = — X3 - 18x5 
x 4 = 7x5 
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to give all the linear dependence relations satisfied by i»i, V 2 , V3, t/4, i>5. In particular 
show that 

—2v\ —V 2 + V 2=0 


and 


—2vi — 18 i >2 + 7 i >4 + v$ = 0 


corresponding to (*3 = l, *5 = 0) and (*3 = 0, *5 = 1), respectively. 

29. For each exercise below, determine whether the given vectors in R 4 are linearly inde- 
pendent. If they are linearly dependent, determine an explicit linear dependence among 
them. 

(a) (1, -4, 3, 0), (0, -1,4, -3), (1, -1. 1, -1), (2, 2, -1, -3). 

(b) (1, -2, 4, 1), (2, -3, 9, -1), (1,0, 6, -5). (2, -5. 7, 5). 

(c) (I, -2, 0, 1), (2. -2, 0, 0), (-1, 3, 0, -2), (-2, 1. 0, 1). 

(d) (0, 1, 1, 0), (1, 0, 1 . 1), (2, 2. 2, 0), (0. -1.1. 1). 

30. For each exercise below, determine the subspace spanned in R 4 by the given vectors and 
give a basis for this subspace. 

(a) (1, -2, 5, 3), (2, 3. 1, -4), (3, 8, -3, -5). 

(b) (2, -5, 3, 0), (0, -2, 5. -3), (1, -1, 1, -1), (-3. 2, -1, 2). 

(c) (1. -2, 0, 1), (2, -2, 0. 0), (-1, 3, 0. -2), (-2, 1,0, 1). 

(d) (1, 1, 0. -1). (1. 2, 3. 0), (2. 3. 3. -1). (1,2. 2, -2), (2, 3, 2, -3), (1, 3. 4, -3). 


31. ( Computing the Image and Kernel of a Linear Transformation) Let V be an n -dimensional 
vector space with basis e\,e 2 ,...,e„ and let W be an m -dimensional vector space with 

basis fi,fi fm ■ Let <p be a linear transformation from V to W and let A be the 

corresponding m x n matrix with respect to these bases: A = (a;j) where 


m 

<P(ej) = $ • j = 
1=1 


i.e., the columns of A are the coordinates of the vectors <p(e 1 ) , <p(e 2 ) <p(e n ) with respect 

to the basis /j.fi / m ofW. Let A' be the reduced row echelon form of A. 

(a) (Determining the Image of a Linear Transformation) Prove that the image <p(V) of 
V under (p has dimension r where r is the number of nonzero rows of A' and that a 
basis for <p(V) is given by the vectors (i = 1, 2, .... r), Le., the columns of 
A corresponding to the pivotal columns of A ' give the coordinates of a basis for the 
image of <p . 

(b) (Determining the Kernel of a Linear Transformation) The elements in the kernel of 
<p are the vectors in V whose coordinates (xi,x 2 ,...,x n ) with respect to the basis 
ei, e 2 , . . . , e„ satisfy the equation 




= 0 , 



and the solutions xi,X 2 x„ to this system of linear equations are determined by 

the matri x A'. 

(!) Prove that <p is injective if and only if A' has n nonzero rows (Le., has rank n). 

(ii) By (i), the kernel of is nontrivial if and only if A' has nonpivotal columns. Show that 
each of the variables xi,x 2 , ■ ■ ■ >x„ above corresponding to the nonpivotal columns 
of A' can be prescribed arbitrarily and the values of the remaining variables are then 
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uniquely determined to give an element jqci + X2?2 4- . . . 4- x„e„ in the kernel of 
<p. In particular, show that the coordinates of a basis for the kernel are obtained 
by successively setting one nonpivotal variable equal to 1 and all other nonpivotal 
variables to 0 and solving for the remaining pivotal variables. Conclude that the 
kernel of <p has dimension n — r where r is the rank of A. 

32. Let V = R 5 and W = R 4 with the standard bases. Let <p be the linear transformation 
(p : V -> W defined by 

(p(x,y y z,u,v) — (x +2y + 3z+4u+4v, -2x-4y + 2v > x + 2y + u-2v,x+2y — v). 

(a) Prove that the matrix A corresponding to <p and these bases is 

1 2 3 4 4\ 

-2 -4 0 0 2 

1 2 0 1-2 
1 200 -1/ 

and that the reduced row echelon matrix A' row equivalent to A is 

1 2 0 0 -1\ 

0 0 10 3 

0 0 0 1 -1 

0000 0/ 

where the 1 st , 3 rd and 4 th columns are pivotal and the remaining two are nonpivotal. 

(b) Conclude that the image of <p is 3-dimensional and that the image of the 1 st , 3 rd and 
4 th basis elements of V, namely, (1, —2, 1, 1), (3, 0, 0, 0) and (4, 0, 1, 0) give a basis 
for the image <p{V) of V. 

(c) Conclude from (a) that the elements in the kernel of <p are the vectors (x,y,z,u, u) 
satisfying the equations 

x + 2 y — u = 0 

z + 3u = 0 

u — v = 0. 

Deduce that the 2 nd and 5 th variables, namely y and v, corresponding to the nonpivotal 
columns of A' can be prescribed arbitrarily and the rem ainin g variables are then 
uniquely determined as 

x = -2y -I- v 
z = —3v 
u ~ v. 

Show that (—2, 1 . 0, 0, 0) and ( 1, 0, — 3, 1 , 1) give a basis for the 2-dimensional kernel 
of < p , corresponding to (y = 1, u = 0) and (y — 0, u = 1), respectively. 

33. Let <p be the linear transformation from Ik 4 to itself defined by the matrix 

1-10 3 

-1 2 1-1 

-1 1 0-3 

1 - 2-1 1 

with respect to the standard basis for R 4 . Determine a basis for the image and for the 
kernel of tp. 
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34. Let (p be the linear transformation <p : IR 4 -► K 2 such that 

?(( 1, 0. 0, 0)) = (1, -1) <p((l, -1. 0,0)) = (0, 0) 

?((1, -1, 1, 0)) = (1, -1) p((l, -1, 1, -1)) = (0, 0). 

Determine a basis for the image and for the kernel of <p. 

35. Let V be the set of all 2 x 2 matrices with real entries and let <p : V — ► R be the map 
defined by sending a matrix A e V to the sum of the diagonal entries of A (the trace of 
A). 

(a) Show that 

/I 0 \ /O 1\ /o o\ (0 o\ 

\0 o)' \0 0J’ \l o)' \0 1 ) 

is a basis for V . 

(b) Prove that <p is a linear transformation and determine the matrix of <p with respect to 
the basis in (a) for V . Determine the dimension of and a basis for the kernel of <p. 

36. Let V be the 6-dimensional vector space over Q consisting of the polynomials in the 
variable x of degree at most 5. Let <p be the map of V to itself defined by <p(f) = 
x 2 f" — 6xf‘ + 12 f, where f" denotes the usual second derivative (with respect to x) of 
the polynomial / e V and /' similarly denotes the usual first derivative. 

(a) Prove that <p is a linear transformation of V to itself. 

(b) Dei ermine a basis for the image and for the kernel of <p. 

37. Let V be the 7-dimensional vector space over the field F consisting of the polynomials in 
the variable x of degree at most 6. Let <p be the linear transformation of V to itself defined 
by <p(f) = f\ where f denotes the usual derivative (with respect to x) of the polynomial 
/ g V. For each of the fields below, determine a basis for the image and for the kernel of 
V- 

(a) F = nt 

(b) F = F 2 , the finite field of 2 elements (note that, for example, (x 2 ) 1 = 2x = 0 over 
this field) 

(c) F = IF 3 

(d) F = F 5 . 

38. Let A and B be square matrices. Prove that the trace of their Kronecker product is the 
product of their traces: tr (A ® B) = tr (A) tr (B ). (Recall that the trace of a square matrix 
is the sum of its diagonal entries.) 

39. Let F be a subfield of K and let ^ : V — ► W be a linear transformation of finite dimensional 
vector spaces over F. 

(a) Prove that 1 <g> rfr is a F -linear transformation from the vector spaces K V to 

K over K. (Here 1 denotes the identity map from K to itself.) 

(b) Let B = {u|, . . . , d,,} and E = {ini, .... w m ) be bases of V and W respectively. 

Prove that the matrix of 1 ® \j/ with respect to the bases {1 ® i)j 1 ® and 

{1 ® w \ , . . . , 1 ® w m } is the same as the matrix of with respect to B and E. 

11.3 DUAL VECTOR SPACES 
Definition. 

(1) For V any vector space over F let V* = Hom f (V, F ) be the space of linear 
transformations from V to F, called the dual space of V. Elements of V* are 
called linear functionals. 


Sec. 11.3 Dual Vector Spaces 


431 



(2) if s == Cm. U 2 , . . . , u n } is a basis of the finite d im ensional space V, define v* € V* 
for each i €{1,2,..., n} by its action on the basis B: 

* \ 1 , if / = j 

«>;) = ! n !<;<«■ (11-6) 
10, if i ± j 

Proposition 18. With notations as above, {u}\ v\, . . . , u*} is a basis of V*. In particular, 
if V is finite dimensional then V * has the same dimension as V. 

Proof: Observe that since V is finite dimensional, dim V* — dimHom/r(V, F) = 
dim V = n (Corollary 1 1), so since there are n of the t’’s it suffices to prove that they 
are linearly independent If 

ayv\ + 02^2 H 1- a n v* = 0 in Hom/r(V, F), 

then applying this element to v,- and using equation (6) above we obtain ct; = 0. Since 
i is arbitrary these elements are linearly independent 

Definition. The basis {u*, . . . , u*}of V* is called the dual basis to {ui, v 2 , . . . , u,,}. 

The exercises later show that if V is infini te dimensional it is always true that 
dim V < dim V*. For spaces of arbitrary dim ension the space V* is the “algebraic” 
dual space to V. If V has some additional structure, for example a continuous structure 
(i.e., a topology), then one may define other types of dual spaces (e.g., the continuous 
dual of V, defined by requiring the linear functionals to be continuous maps). One has 
to be careful when reading other works (particularly analysis books) to ascertain what 
qualifiers are implicit in the use of the terms “dual space” and “linear functional.” 

Example 

Let [a, b\ be a closed interval in IR and let V be the real vector space of all continuous 
functions / : [a , b] — »■ IR. If a < b, V is infinite dimensional. For each g € V the function 
(p g \ V -> IR defined by <p g {f) = f{t)g(t)dt is a linear functional on V. 

Definition. The dual of V *, namely V **, is called the double dual or second dual of 
V. 


Note that for a finite dimensional space V, dim V — dim V * and also dim V* = 
dim V**, hence V and V** are isomorphic vector spaces. For infini te dimensional 
spaces dim V < dim V** (cf. the exercises) so V and V** cannot be isomorphic. In the 
case of finite dimensional spaces there is a natural , i.e., basis independent or coordinate 
free way of exhibiting the isomorphism between a vector space and its second dual. 
The basic idea, in a more general setting, is as follows: if X is any set and S is any set 
of functions of X into the field F, we normally think of choosing or fixing an / € S 
and computing fix) as x ranges over all of X. Alternatively, we could think of fixing 
a point x in X and computing / (jc) as / ranges over all of S. The latter process, called 
evaluation at x shows that for each x e X there is a function E x : S F defined by 
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E x (f) = f(x ) (i.e., evaluate / at x ). This gives a map a: i-^ E x of X into the set of 
F-valued functions on S. If S “separates points” in the sense that for distinct points 
x and y of X there is some / e 5 such that f(x) ^ / (y), then the map x i E x 
is injective. The proof of the next lemma applies this “role reversal” process to the 
situation where X = V and S = V*, proves E x is a linear F-valued function on S, 
that is, E x belongs to the dual space of V *, and proves the map x i-+ E x is a linear 
transformation from V into Note that throughout this process there is no mention 
of the word “basis” (although it is convenient to know the dimension of V** — a fact 
we established by picking bases). In particular, the proof does not start with the familiar 
phrase “pick a basis of V ” 

Theorem 19. There is a natural injective linear transformation from V to V**. If V is 
finite dimensional then this linear transformation is an isomorphism. 


Proof: Let v € V. Define the map (evaluation at v) 

E v :V*^ F by £„(/) = f(v). 

Then E v (f+ag ) = ( f+ag)(v ) = f(v)+ag(v) = E v (f) + aE g (v), so that E v is a 
linear transformation from V* to F. Hence E v is an element of Hom^ (V*, F) = V**. 
This defines a natural map 

<p : V -> V** by <p(v) = E„. 

The map <p is a linear map, as follows: for u, w e V and a e F, 

Ev-Ktw(f) = f(v + aw) = f(v) + af(w ) = E v (f ) + aE w (f ) 

for every / € V*> and so 

<p(v + aw) = E v+aw — E v +aE w = <p(v) + a(p(w). 

To see that cp is injective let v be any nonzero vector in V . By the Building Up Le mm a 
there is a basis B containing v. Let / be the linear transformation from V to F 
defined by sending v to 1 and every element of B — (u) to zero. Then / € V* and 
E v (f) = f(v) — 1. Thus <p(v) = E v is not zero in V**. This proves ker^ = 0, i.e., (p 
is injective. 

If V has finite dimension n then by Proposition 18, V* and hence also V** has 
dimension n. In this case <p is an injective linear transformation from V to a finite 
dimensional vector space of the same dimension, hence is an isomorphism. 

Let V, W be finite dimensional vector spaces over F with bases B, £, respectively 
and let B* , £* be the dual bases. Fix some <p € Homf (V, W). Then for each / e W*, 
the composite / o <p is a linear transformation from V to F, that is / o tp e V*. Thus 
the map / / o<p defines a function from W* to V*. We denote this induced function 

on dual spaces by (p*. 
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Theorem 20. With notations as above, (p* is a linear transformation from W* to V* and 
Mj 1‘ (cp*) is the transpose of the matrix Mg(<p) (recall that the transpose of the matrix 
(jdij) is the matrix (aj,)). 


Proof: The map (p* is linear because (/ + ag) o <p = (f o <p) + a(g o <p)> The 
equations which define (p are (from its matrix) 


<P( v j) = Y2 a U w i 1 <j<n- 

i=i 

To compute the matrix for <p*, observe that by the definitions of <p* and w% 


<p*(v4)(Vj) = (w k O <p)(Vj) = 



= ct kj . 


Also 

n 

(^ctkivfXvj) = a kj 
j= l 

for all j. This shows that the two linear functionals below agree on a basis of V, hence 
they are the same element of V*: 


(P*(w%) = 52 o < ki vf. 

i=i 

This determines the matrix for ip* with respect to the bases £* and B* as the transpose 
of the matrix for <p . 

Corollary 21. For any matrix A, the row rank of A equals the column rank of A. 

Proof: Let <p : V W be a linear transformation whose matrix with respect to 
some fixed bases of V and W is A. By Theorem 20 the matrix of <p* : W* — ► F* with 
respect to the dual bases is the transpose of A. The column rank of A is the rank of <p 
and the row rank of A (= the column rank of the transpose of A) is the rank of <p* (cf. 
Exercise 6 of Section 2). It therefore suffices to show that <p and cp* have the same rank. 
Now 


/ € ker<p* <P*(f) = 0^/o <p(v) =0, for all v € V 
<p(V) C ker / O’ / € Ann(^(V)), 

where Ann(S) is the annihilator of S described in Exercise 3 below. Thus Ann(<p(V)) = 
ker^j*. By Exercise 3, dim Ann(^?(V)) = dim W — dim^)(V). By Corollary 8, 
dim ker <p* = dim W* — dim <p*(W*). Since W and W* have the same dimension, 
dim <p(V) = dim <p*(XV*) as needed. 
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EXERCISES 


1. Let V be a finite dimensional vector space. Prove that the map <p <f>* in Theorem 20 
gives a ring isomorphism of End(V) with End(V*). 

2. Let V be the collection of polynomials with coefficients in in the variable x of degree 

at most 5 with 1 , x , x 2 x 5 as basis. Prove that the following are elements of the dual 

space of V and express them as linear combinations of the dual basis: 

(a) E : V -► defined by E(p(x)) = p( 3) (Le., evaluation at x = 3). 

(b) (p : V — ► (Qj defined by <p(p(x)) = fj p(t)dt. 

(c) <p : V Q defined by tp(p(x)) = /q t 2 p(t)dt . 

(d) <p : V -> <Q defined by <p(p(x)) = p\ 5) where p > ( x ) denotes the usual derivative of 

the polynomial p(x) with respect to x. 

3. Let S be any subset of V* for some finite dimensional space V. Define Ann(£) = {v € 
V | f(v) = 0 for all / € £}. (Ann(S) is called the annihilator of S in V). 

(a) Prove that Ann(S) is a subspace of V. 

(b) Let Wj and VV 2 be subspaces of V*. Prove that Ann( Wi + tV 2 ) = Ann(Wi ) nAnn(W 2 ) 
and Ann(W] n W 2 ) = Ann(Wi) + Ann(W 2 ). 

(c) Let Wj and W 2 be subspaces of V*. Prove that Wj = W 2 if and only if Ann(Wi) = 
Ann(lV 2 ). 

(d) Prove that the annihil ator of S is the same as the an nihilat or of the subspace of V* 
spanned by S. 

(e) Assume V is finite dimensional with basis uj , . . . , u n . Prove that if S = {v* i>£) 

for some k <n, then Ann(S) is the subspace spanned by {u* + ] , ,v n ). 

(0 Assume V is finite dimension aL Prove that if W* is any subspace of V* then 
dim Ann(W*) = dim V — dim W*. 

4. If V is infini te dimensional with basis A, prove that A* = {n* | n € A] does not span V*. 

5. If V is infinite dimensional with basis A, prove that V* is isomorphic to the direct product 
of copies of F indexed by A Deduce that dim V* > dim V. [Use Exercise 14, Section 1 .] 


11.4 DETERMINANTS 

Although we shall be using the theory primarily for vector spaces over a field, the theory 
of determinants can be developed with no extra effort ova - arbitrary commutative rings 
with 1 . Thus in this section R is any commutative ring with 1 and V\ , V 2 , . . . , V n ,V and 
W are R -modules. For convenience we repeat the definition of multilinear functions 
from Section 10.4. 

Definition. 

(1) A map <p : V\ x V 2 x - - - x V„ W is called multilinear if for each fixed i 
and fixed elements vj e Vj, j ^ i, the map 

Vi W defined by x <p(v u . . . , t><+i »*) 

is an i?-module homomorphism. If V; = V, i = l, 2, . . . , n, then tp is called 
an n-nudtilinear function on V, and if in addition W = R, <p is called an n- 
multilinear form on V. 
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(2) An n-multilinear function <p on V is called alternating if <p(v i , u 2 , . . . , w n ) = 0 
whenever v, = u, + i for some i e {1, 2, .... n — 1} (i.e., <p is zero whenever 
two consecutive arguments are equal). The function <p is called symmetric if 
interchanging u, and vj for any i and j in (uj, U 2 , . . . , t>„) does not alter the 
value of <p on this n-tuple. 

When n = 2 (respectively, 3) one says <p is bilinear (respectively, trilinear) rather 
than 2-multilinear (respectively, 3-multilinear). Also, when w is clear from the context 
we shall simply say <p is multilinear. 

Example 

For any fixed m > 0 the usual dot product on V = is a bilinear form (here the ring R 
is the field of real numbers). 

Proposition 22. Let <p be an n-mul tilin ear alternating function on V. Then 

(1) <p(v i , . . . , Uj_ i, u 1+ i, Vj t Vi +2 v n ) = -<p(v i, t >2 «^») f°r any i e 

{1, 2, .... n — 1}, i.e., the value of (p on an n-tuple is negated if two adjacent 
components are interchanged. 

(2) For each a e S „ , <p(v a( i), v a(2 ), . . - , iv^) = e(a)<p(vi, V 2 , . . . , u„), where 
e(cr) is the sign of the permutation a (cf. Section 3.5). 

(3) Ifu, = Vj for any pair of distinct!, j e [ 1,2,..., n} then (p(v Xt U 2 , . . . , v n ) — 0. 

(4) If v,- is replaced by v,- + avj in (ui, . . . , u„) for any j ^ i and any a € R, the 
value of <p on this n-tuple is not changed. 

Proof: (1) Let fr(x, y) be the function <p with variable entries x and y in positions 
i and i + 1 respectively and fixed entries vj in position j, for all other j. Thus (1) is the 
same as showing f/(y, x) = y). Since <p is alternating f/(x + y, x + y) = 0. 

Expanding x + y in each variable in turn gives ijr(x + y, x + y) = ^(x, x) + yfr(x t y) + 
Vr(y> x) + fr(y, y). Again, by the alternating property of (p, the first and last terms on 
the right hand side of the latter equation are zero. Thus 0 = f/(x t y) + f/(y, x), which 
gives (1). 

(2) Every permutation can be written as a product of transpositions (cf. Section 
3.5). Furthermore, every transposition may be written as a product of transpositions 
which interchange two successive integers (cf. Exercise 3 of Section 3.5). Thus every 
permutation a can be written as z\ • ■ ■ z m , where r* is a transposition interchanging two 
successive integers, for all k. It follows from m applications of (1) that 

<p(v c r(i), V 2 ), ■ ■ - , w a( „)) = e(r m ) - ■ ■ e{z{)(p(v u u 2 , . . . , t>„). 

Finally, since e is a homomorphism into the abelian group ±1 (so the order of the factors 
±1 does not matter), e(ri) • • • e(r m ) = e{t\ • • • x m ) = e(<r). This proves (2). 

(3) Choose a to be any permutation which fixes i and moves j to i + 1. Thus 
(u 0 (i), u 0 ( 2 ), - • - , u 0 ( n )) has two equal adjacent components so cp is zero on this n-tuple. 
By (2), (p(y a{X) , u ct(2 ), .... t^) = ±(p(y x , v 2 , . . . , u„). This implies (3). 

(4) This follows immediately from (3) on expanding by linearity in the i * position. 
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Proposition 23 . Assume <p is an n -multilinear alternating function on V and that for 
some i>i , V2, . - . , v n and w\ , wj, . . . , w„ e V and some a-,j € R we have 

w y = o-nui +0:21^2 H \~a n iv n 

w 2 = ct\iv\ + 0122^2 H 1 - u n2 v n 


W n = UinVy +<X2nV2 + b 0„ n V n 

(we have purposely written the indices of the a,j in “column format”). Then 
<p(wy, U)2, - - ■ > Wn) = Y2 * (<*)«*(!) l«a(2)2 ‘ ' ‘ V2. ■ ■ ■ . »*)• 


Proof: If we expand <p(wu u>2, , m„) by multilinearity we obtain a sum of n n 

terms of the form 1 a ,- 2 2 ■ ■ ■ «/„ n ^(n,, , i>; 2 , . . . , u Ir] ), where the indices r'i , 12, . . . , i n 
each run over 1 , 2 , ...» n. By Proposition 22 ( 3 ), <p is zero on the terms where two 
or more of the if s are equal. Thus in this expansion we need only consider the 
terms where iy t ... f i n are distinct. Such sequences are in bijective correspondence 
with permutations in S„, so each nonzero term may be written as 0^(1) 1 ^(2)2 ■ • ■ 
<Xo{n)n<p(Vo{ 1)> v o(2) n CT( „)), for some a € S n . Applying ( 2 ) of the previous propo- 

sition to each of these terms in the expansion of <p(wi , v>2 , .... w n ) gives the expression 
in the proposition. 

Definition. An n x n determinant function on R is any function 

det : A/„ *„(/?) -> R 
that satisfies the following two axioms: 

( 1 ) det is an n-multilinear alternating form on R n (= V), where the n-tuples are the 
n columns of the matrices in M nxn (R) 

( 2 ) det( 7 ) = 1 , where 7 is the n x n identity matrix. 


On occasion we s hall write det(Aj , A2, ...» A„) for det A, where A \ , A2, . . . , A„ 
are the columns of A. 

Theorem 24 . There is a unique nxn determinant function on R and it can be computed 
for any nxn matrix (0^) by the formula: 

detteij) = ^ ia o( 2)2 ■ * ' Cto(n)n- 

oeS„ 


Proof: Let Ai, A2, . . . , A n be the column vectors in a general nxn matrix (ofy). 
We leave it as an exercise to check that the formula given in the statement of the theorem 
does satisfy the axioms of a determinant function — this gives existence of a deter minan t 
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function. To prove uniqueness let e,- be the column n -tuple with 1 in position i and zeros 
in all other positions. Then 

A\ = anei + ct2\£2 + h 

A 2 = <*l2e\ + Of 22^2 H + Of^Cn 


An — d\ n 6\ -f- 0i2n^2 “I” ‘ ‘ “ "f" 

By Proposition 23, det A — €(p)a„ {V) ia<r( 2) 2— <**00* det(e lf e 2 , .... e n ). Since 

by axiom (2) of a determinant function det^ , e 2 e n) = 1, the value of det A is as 

claimed. 

Corollary 25. The determinant is an n -multilinear function of the rows of M„ xn (R) 
and for any n x n matrix A, det A = det(A'), where A* is the transpose of A. 

Proof: The first statement is an immediate consequence of the second, so it suffices 
to prove that a matrix and its transpose have the same determinant. For A — (or ,- } ) one 
calculates that 

det A* = ^ e(tf)tti«r(L)a 2 «r< 2 ) . ..<*#.*(»,)■ 

aeS„ 

Each number from 1 to n appears exactly once among <r(l), .... o(n) so we may 
rearrange the product 0 ! l£ra) a 2a(2) - - - ct na{f ,> as l of tr -)( 2 )2 - - - (/,)*■ Also, the 

homomorphism € takes values in {±1} so e(o) = e(cr -1 ). Thus the sum for det A 1 may 
be rewritten as 

€(cr I )a<7->(1) 1 0^- 1 ( 2)2 • ■ • a tr- *(«)»«■ 

cteS„ 

The latter sum is over all permutations, so the index o~ l may be replaced by cr. The 
resulting expression is the sum for det A. This completes the proof. 

Theorem 26. ( Cramer's Rule) If A l( A 2 , . . . , A n are the columns of an n x n matrix 
A and B = P\ A\ + & A 2 H 1- P„A n , for some P\ p n € /?, then 

Pi det A = det(A b ..., A,_i, B, A i+U . . . , A*). 

Proof: This follows immediately from Proposition 22(3) on replacing the given 
expression for B in the i * position and expanding by multilinearity in that position. 

Corollary 27. If R is an integral domain, then det A = 0 for A € M n (R ) if and only 
if the columns of A are R -linearly dependent as elements of the free B- module of rank 
n. Also, det A = 0 if and only if the rows of A are B -linearly dependent. 

Proof: Since det A = det A' the first sentence implies the second. 

Assume first that the columns of A are linearly dependent and 

0 = Pi A[ + P 2 A 2 + ■ ■ ■ + p n A n 
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is a dependence relation on the columns of A with, say, fr ± 0. By Cramer’s Rule, 
Pi det A — 0. Since R is an integral domain and ft,- det A = 0. 

Conversely, assume the columns of A are independent Consider the integral do- 
main R as embedded in its quotient field F so that M nxn (R) may be considered as 
a subring of M nxn (F) (and note that the determinant function on the subring is the 
restriction of the determinant function from M nxn (F)\ The columns of A in this way 
become elements of F" . Any nonzero F-linear combination of the columns of A which 
is zero in F n gives, by multiplying the coefficients by a common denominator, a nonzero 
/? -linear dependence relation. The columns of A must therefore be independent vectors 
in F n . Since A has n columns, these form a basis of F n . Thus there are elements fijj 
of F such that for each i, the I th basis vector e- x in F n may be expressed as 

e i = PliA[ + + 1" PniA n . 

The n x n identity matrix is the one whose columns are e \ , . . . , e n . By Proposition 

23 (with <p = det), the determinant of the identity matrix is some ^-multiple of det A. 
Since the determinant of the identity matrix is 1, det A cannot be zero. This completes 
the proof. 

Theorem 28. For matrices A, B € M nxn (R), det AB = (det A)(det B). 

Proof: Let B = (fiij) and let Ai, A 2 A„ be the columns of A. Then C = AB 

is the n x n matrix whose y 131 column is Cj = fiijAi + fh.jA 2 + ■ • • + fi n jA n . By 
Proposition 23 applied to the multilinear function det we obtain 

detC = det(Ci, . . . , C„) = j" ^ iAj< 2)2 - ■ det(Ai, . . . , A n ). 

cgS„ 

The sum inside the brackets is the formula for det B , hence det C = (det B) (det A), as 
required (R is commutative). 

Definition. Let A = (a/,) be an n x n matrix. For each i, j , let A,-y be the n — 1 xn-1 
matrix obtained from A by deleting its i* row and y* column (an n— 1 x n— 1 minor 
of A). Then (— 1)' + -' det (Ay) is called the ij cofactor of A. 

Theorem 29. (The Cofactor Expansion Formula along the I th row) If A — (a/y) is an 
n x n matrix, then for each fixed i € { 1 , 2, . . . . n } the determinant of A can be computed 
from the formula 

det A = (- l)' +1 a /1 det An + (-1)' +2 q:; 2 det A ;2 H h (-l) l+ "a,„ det A,„. 


Proof: For each A let D(A) be the element of R obtained from the cofactor expan- 
sion formula described above. We prove that D satisfies the axioms of a determinant 
function, hence is the determinant function. Proceed by induction on n. If n = 1, 
D((a)) = a, for all 1 x 1 matrices (a) and the result holds. Assume therefore that 
n > 2. To show that D is an alternating multilinear function of the columns, fix an 
index k and consider the k A column as varying and all other columns as fixed. If y ^ fc. 
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ctjj does not depend on k and D(Ajj) is linear in the column by induction. Also, as 
the column varies linearly so does whereas D(A,*) remains unchanged (the k 01 
column has been deleted from A /*). Thus each term in the formula for D varies linearly 
in the column. This proves D is multilinear in the columns. 

To prove D is alternating assume columns k and k 1 of A are equal. If j ^ k or 
k 1, the two equal columns of A become two equal columns in the matrix A, 7. By 
induction D(Ajj) = 0. The formula for D therefore has at most two nonzero terms: 
when 7 = k and when 7 = k + l. The minor matrices A,* and A, * +1 are identical and 
a , k = Then the two remaining terms in the expansion for £), (— l) ,+ *of/*D(A,fc) 
and (— \y +k+l a ik+l D(Ai *+1) are equal and appear with opposite signs, hence they 
cancel. Thus D(A) = 0 if A has two adjacent colu mns which are equal, i.e., D is 
alternating. 

Finally, it follows easily from the formula and induction that D(I) = 1, where / is 
the identity matrix. This completes the induction. 


Theorem 30. (Cofactor Formula for the Inverse of a Matrix ) Let A = (ofy) be an 
n x n matrix and let B be the transpose of its matrix of cofactors, i.e., B = (fiij), where 
fiij = (— 1)' + -' det Ajj, 1 < i, 7 < n. Then AB = BA = (det A)/. Moreover, det A is 

a unit in R if and only if A is a unit in M nxn (R)\ in this case the matrix B is the 

det A 


inverse of A. 


Proof: The i, j entry of AB is a,- 1/817 + otnfh 7 H \-a in (i n j. By definition of the 

entries of B this equals 

«i,(-iy +1 D(Ay,) +a a (-iy* 1 D(Aji) + ■ • ■ + <*i„(-iy + "D(A;„). (11.7) 

If 1 = j, this is the cofactor expansion for det A along the i* row. The diagonal entries 
of AB are thus all equal to det A. If / / j, let A be the matrix A with the 7* row 
replaced by the I th row, so det A = 0- By inspection Ajk = Ajf, and 07* = for every 
k € {1, 2, ... , n }. By making these substitutions in equation (7) for each/: = 1, 2, .... n 

one sees thnt the i, j entry inAB equals 5yj(— l) I+ - , D(Aji)H l) n+ t D(Aj„). 

This expression is the cofactor expansion for det A along the 7 th row. Since, as noted 
above, det A = 0, this proves that all off diagonal terms of A £ are zero, which proves 
that AB = (det A)/. 

It follows directly from the definition of B that the pair (A 1 , B‘) satisfies the 
same hypotheses as the pair (A, B). By what has already been shown it follows that 
(BA) 1 = A' B‘ = (det A')I. Since det A 1 = det A and the transpose of a diagonal ma- 
trix is itself, we obtain BA = (det A) I as well. 

If J = det A is a unit in /?, then d~ l B is a matrix with entries in R whose product 
with A (on either side) is the identity, i.e., A is a unit in M, ,*„(£). Conversely, assume 
that A is a unit in R with (2-sided) inverse matrix C. Since det C e R and 

1 = det I = det AC — (det A)(det C) = (det C)(det A), 

it follows that det A has a 2-sided inverse in R, as needed. This completes all parts of 
the proof. 
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EXERCISES 


1. Formulate and prove the cofactor expansion formula along the 7 th column of a square 
matrix A. 

2. Let F be a field and let A], Ai A„ be (column) vectors in F n . Form the matrix A 

whose 1 * column is Ai . Prove that these vectors form a basis of F n if and only ifdet A ± 0. 

3. Let R be any commutative ring with 1, let V be an f?-module and let x\ , X 2 , . . . , x n e V . 
Assume that for some A e M„^ n (R), 




W 


= 0. 


Prove that (det A)xj = 0, for all i € (1,2,..., n}. 

4. ( Computing Determinants of Matrices) This exercise outlines the use of Gauss-Jordan 
elimination (cf. the exercises in Section 2) to compute determinants. This is the most 
efficient general procedure for computing large deter min ants. Let A be an n x n matrix. 

(a) Prove that the elementary row operations have the following effect on determinants: 

(i) interchanging two rows changes the sign of the determinant 

(ii) adding a multiple of one row to another does not alter the determinant 

(tii) multiplying any row by a nonzero element u from F multiplies the determinant 
by u. 

(b) Prove that det A is nonzero if and only if A is row equivalent to the n x n identity 
matrix. Suppose A can be row reduced to the identity matrix using a total of s row 
interchanges as in (i) and by multiplying rows by the nonzero elements u 1 , «2. - ■ ■ . u t 
as in (iii). Prove that det A = (— l)*(uiU2 . . . u,) -1 . 

5. Compute the determinants of the following matrices using row reduction: 



6. ( Minkowski's Criterion) Suppose A is an n x n matrix with real entries such that the 
diagonal elements are all positive, the off-diagonal elements are all negative and the row 
sums are all positive. Prove that det A ^ 0. [Consider the corresponding system of 
equations AX = 0 and suppose there is a nontrivial solution (x|, . . . , x n ). If xt has the 
largest absolute value show that the 1 th equation leads to a contradiction.] 


11.5 TENSOR ALGEBRAS, SYMMETRIC AND EXTERIOR ALGEBRAS 

In this section R is any commutative ring with 1, and we assume the left and right 
actions of R on each R -module are the same. We shall primarily be interested in the 
special case when R = F is a held, but the basic constructions hold in general. 

Suppose M is an 7?-module. When tensor products were first introduced in Section 
10.4 we spoke beuristically of forming “products** m\m^ of elements of M, and we 
constructed a new module M <g) M generated by such “products’* m\ ® The “value” 
of this product is not in M, so this does not give a ring structure on M itself. If, however. 
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we iterate this by taking the “products” m ynx^tn^ and m i m 2 m 3 m 4l and all finite sums of 
such products, we can construct a ring containing M that is “universal” with respect to 
rings containing M (and, more generally, with respect to homomorphic images of Af), 
as we now show. 

For each integer k > 1, define 

T k (M) = M M ■ - • <2>k Af ( k factors), 

and set 7"° (Af) = R. The elements of T* (Af) are called k-tensors. Define 

OO 

T(M) =R@T\M)@ 7*(M) © 7*(M) ■■■ = 0 7*(M). 

k = o 

Every element of T (AO is a finite linear combination of f-tensors for various k > 0. 
We identify Af with T l (AO, so that Af is an /^-submodule of T (Af). 

Theorem 31. If M is any /?-module over the commutative ring R then 

(1) T (A/) is an ^-algebra containing M with multiplication defined by mapping 

(mi ® ■ ■ • <8> - ® m'-) = nty ® - - - ® m, ® wi', ® ■ • ■ ® in'- 

and extended to sums via the distributive laws. With respect to this multiplica- 
tion T(M)T j (M) c 

(2) (Universal Property) If A is any /? -algebra and <p : M A is an /f-module 
homomorphism, then there is a unique /^-algebra homomorphism : T (AO -> 
A such that ^>| M = 

Proof: The map 

AfxA/x---xAfyMxMx-'-xM-> T i+ - / (AO 

V ^ V J 

V V 

i factors jf factors 


defined by 


(mi , . . . , m, , m'i , . . . , mp mi <g> . . . ® m, ® m'| <8> . . . ® m^ 

is /^-multilinear, so induces a bilinear map T'(Af) x T 7 (Af) to T i+j (M) which is 
easily checked to give a well defined multiplication satisfying (l) (cf. the proof of 
Proposition 21 in Section 10.4). To prove (2), assume that <p : Af — >■ A is an /^-algebra 
homomorphism. Then 

(mi, m2, , m k ) hx ^(mi)v>(m 2 ) . . . <p(m k ) 

defines an ^-mul tilin ear map from Af x - - - x M (k times) to A. This in turn induces a 
unique /^-module homomorphism <P from 7"*(Af) to A (Corollary 16 of Section 10.4) 
mapping mi ® ... ® m k to the element on the right hand side above. It is easy to check 
from the definition of the multiplication in (1) that the resulting uniquely defined map 
<t> : T(Af) —>■ A is an /^-algebra homomorphism. 
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Definition. The ring T (M) is called the tensor algebra of M. 


Proposition 32 . Let V be a finite dimensional vector space over the field F with basis 
B={v i, . . . , u„). Then the ^-tensors 

v/, <8> i»i 2 <8> - ■ • <8> with e B 

are a vector space basis of T k (V) over F (with the understanding that the basis vector 
is the element 1 e F when k = 0). In particular, dim f(T*(V)) = n k . 

Proof: This follows immediately from Proposition 16 of Section 2. 

Theorem 31 and Proposition 32 show that the space T (V) may be regarded as the 
noncommutative polynomial algebra over F in the (noncommuting) variables i>i, . . . , u„. 
The analogous result also holds for finitely generated free modules over any commuta- 
tive ring (using Corollary 19 in Section 10.4). 

Examples 

(1) Let R = 7L and let M = Q/Z, Then (Q/Z) (Q/Z) = 0 (Example 4 following 

Corollary 12 in Section 10.4). Thus 'T (Q/Z) = Z © (Q/Z), where addition is com- 
ponentwise and the multiplication is given by (r, ~p)(s, q) = ( rs , rq + sp). The ring 
R/(x) of Exercise 4(d) in Section 9.3 is isomorphic to 'T (Q/Z). 

(2) Let R = Z and let M = TLInTL. Then (Z/nZ) (Z/nZ) = Z/nZ (Example 3 

following Corollary 12 in Section 10.4). Thus 7~' (M) = M for all i >0 and so 
T(Z/nZ) = Z © (Z/nZ) © (Z/nZ) ■ It follows easily that T (L/nL) = Z[x]/(njt). 

Since T' {M)T j (M) c T' 4 ^ (M), the tensor algebra T (A/) has a natural “grading” 
or “degree” structure reminiscent of a polynomial ring. 

Definition. 

(1) A ring S is called a graded ring if it is the direct sum of additive subgroups: 
S = So 0 Si © S 2 © ■ ■ ■ such that S)S) c S i+ j for all i, j > 0. The elements of 
Sk are said to be homogeneous of degree k , and S* is called the homogeneous 
component of S of degree k. 

( 2 ) An ideal 7 of the graded ring S is called a graded ideal if / — ©^(7 D Sk)- 

( 3 ) A ring homomorphism <p : S -*■ T between two graded rings is called a 

homomorphism of graded rings if it respects the grading structures on S and T, 
i.e., if (pi.Sk) c T k for k = 0, 1, 2 

Note that So So c Sb, which implies that So is a subring of the graded ring S and 
then S is an Sb -module. If So is in the center of S and it contains an identity of S, then 

S’ is an So-algebra. Note also that the ideal 7 is graded if whenever a sum i^ H 1- 

of homogeneous elements with distinct degrees k\, ... ,k n is in 7 then each of the 
individual summands i*, , . . . , is itself in 7. 
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Example 

The polynomial ring S = R[x\ ,X 2 , ;e n ] in n variables over the commutative ring R is 

an example of a graded ring. Here Sq = R and the homogeneous component of degree k 
is the subgroup of all /?-linear combinations of monomials of degree k. 

The ideal I generated by jci , . . . , x n is a graded ideal: every polynomial with zero 
constant term may be written uniquely as a sum of homogeneous polynomials of degree 
k > 1 , and each of these has zero constant term hence lies in I. More generally, an ideal is 
a graded ideal if and only if it can be generated by homogeneous polynomials (cf. Exercise 
17 in Section 9.1). 

Not every ideal of a graded ring need be a graded ideal. For example in the graded 
ring Z[jt] the principal ideal J generated by 1 + x is not graded: 1 + x e J and 1 £ J so 
1 + x cannot be written as a sum of homogeneous polynomials each of which belongs to 
J. 

The next result shows that quotients of graded rings by graded ideals are again 
graded rings. 

Proposition 33. Let S be a graded ring, let I be a graded ideal in S and let h = / H S* 
for all k > 0. Then S/ 1 is naturally a graded ring whose homogeneous component of 
degree k is isomorphic to Sk/Ik- 

Proof: The map 

s*. ... ) i — i* mod 

is suijective with kernel / = ©£^7* and defines an isomorphism of graded rings. The 
details are left for the exercises. 

Symmetric Algebras 

The first application of Proposition 33 is in the construction of a commutative quotient 
ring of T (M) through which /?- module homomorphisms from M to any commutative 
R -algebra must factor. This gives an “abelianized” version of Theorem 31. The con- 
struction is analogous to forming the commutator quotient G/G' of a group (cf. Section 
5.4). 

Definition. The symmetric algebra of an 7?-module M is the R -algebra obtained by 
taking the quotient of the tensor algebra T(M) by the ideal C(M) generated by all 
elements of the form mi <8> m 2 — m 2 ® mi , for all mi , m 2 € M. The symmetric algebra 
T(M)/C(M) is denoted by «S(M). 

The tensor algebra T (M) is generated as a ring by R = T°(M) and M — T l (M ), 
and these elements commute in the quotient ring S(M) by definition. It follows that 
the symmetric algebra S(M) is a commutative ring. The ideal C(M) is generated by 
homogeneous tensors of degree 2 and it follows easily that C(M) is a graded ideal. 
Then by Proposition 33 the symmetric algebra is a graded ring whose homogeneous 
component of degree k is S k (M) = T k (M)/C k (M). Since C(M) consists of A: -tensors 
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with k > 2, we have C(M) D M = 0 and so the image of M = T l (M) in S(M) 
is isomorphic to M. Identifying M with its image we see that S\M) = M and the 
symmetric algebra contains M. In a similar way «S°(Af) = R, so the symmetric algebra 
is also an R-algebra. The R -module S k (M) is called the fc th symmetric power of M. 

The first part of the next theorem shows that the elements of the /I th symmetric 
power of M can be considered as finite sums of simple tensors m\ <g> ■ ■ ■ <g> m k where 
tensors with the order of the factors permuted are identified. Recall also from Sec- 
tion 4 that a ^-mul tilin ear map <p : M x ■ ■ ■ x M — > N is said to be symmetric 

if <p(m\ m k ) = . . . , m a ^)) for all permutations a of 1,2 , ,k. (The 

definition is the same for modules over any commutative ring R as for vector spaces.) 

Theorem 34. Let M be an R -module over the commutative ring R and let S(M) be its 
symmetric algebra. 

(1) The k ^ symmetric power, «S*(M), of M is equal to M <£) ■■ - <S> M (k factors) 
modulo the submodule generated by all elements of the form 


(m, ® m 2 <8> ■ ■ ■ ® m k ) — ( m a0) ® m <7(2 ) ® • • ■ ® m a ( k) ) 

for all m, e M and all permutations o in the symmetric group S k . 

(2) (Universal Property for Symmetric Multilinear Maps) If <p : M x - - ■ x M — > N 
is a symmetric ^-multilinear map over R then there is a unique R-module 
homomorphism <t> : S k (M) — > N such that <p = o i, where 

i : M x - x M ^ S k ( M) 


is the map defined by 


t(m i, . . . , m k ) = m\ ® • ■ • ® m„ mod C(M). 


(3) (Universal Properly for maps to commutative R-algebras) If A is any commu- 
tative R-algebra and (p : M A is an R -module homomorphism, then there 
is a unique R-algebra homomorphism <P : S(M) — > A such that <t>\ M = <p. 

Proof: The ^-tensors C k (M) in the ideal C(M) are finite s ums of elements of the 
form 


mi® ... ®m ( - 1 ® (m/ ® m,- + i — m l+ 1 ® m{) ® 2 ® ...<8>m k 


with mi, . . . , m k € M (where k > 2 and 1 < t < k). This product gives a difference 
of two fc-tensors which are equal except that two entries (in positions i and i + 1) have 
been transposed, i.e., gives the element in (1) of the theorem corresponding to the trans- 
position ( i i+1) in the symmetric group S k . Conversely, since any permutation o in S k 
can be written as a product of such transpositions it is easy to see that every element in 
(1) can be written as a sum of elements of the form above. This gives (1). 

The proofs of (2) and (3) are very simil ar to the proofs of the corresponding “asym- 
metric” results (Corollary 16 of Section 10.4 and Theorem 31) noting that C k (M) is 
contained in the kernel of any symmetric map from T k (M) to N by part (1). 


Sec. 11.5 Tensor Algebras, Symmetric and Exterior Algebras 


445 



Corollary 35. Let V be an n -dimensional vector space over the field F. Then S( V) is 
isomorphic as a graded E-algebra to the ring of polynomials in n variables over F (i.e., 
the isomorphism is also a vector space isomorphism from S k (V) onto the space of all 
homogeneous polynomials of degree k). In particular, dim r(5*(V)) = 

Proof: Let B = {14 v n ) be a basis of V. By Proposition 32 there is a bijection 

between a basis of T*(V) and the set B k of ordered /: -tuples of elements from B. Define 
two ^-tuples in B k to be equivalent if there is some permutation of the entries of one 
that gives the other — this is easily seen to be an equivalence relation on B k . Let S(B k ) 
denote the corresponding set of equivalence classes. Any symmetric & -multi linear 
function from V k to a vector space over F will be constant on all of the basis tensors 
whose corresponding A:-tuples lie in the same equivalence class; conversely, any function 
from S(B k ) can be uniquely extended to a symmetric ^-multilinear function on V k . It 
follows that the vector space over F with basis S(B k ) satisfies the universal property 
of S k (V) in Theorem 34(2), hence is isomorphic to S k (V). Each equivalence class has 
a unique representative of the form (y{' , v° 2 > ■ ■ ■ , v° n ), where u" denotes the sequence 

Vi , Vi Vi taken a times, each a,- > 0, and a\-\ Va n =k. Thus there is a bijection 

between the basis S k (B) and the set xf - • • x%* of monic monomials of degree k in the 
polynomial ring E[xi, . . . , jc„]. This bijection extends to an isomorphism of graded 
E-algebras, proving the first part of the corollary. The computation of the dimension 
ofS k (V) (i.e., the number of monic monomials of degree k ) is left as an exercise. 

Exterior Algebras 

Recall from Section 4 that a multilinear map <p : Mx-xM-> N is called alternating 

if <p(m\ mf) = 0 whenever m,- = m I+ i for some i. (The definition is the same for 

any E -module as for vector spaces.) We saw that the determinant map was alternating, 
and was uniquely determined by some additional constraints. We can apply Proposition 
33 to construct an algebra through which alternating multilinear maps must factor in a 
manner similar to the construction of the symmetric algebra (through which symmetric 
multilinear maps factor). 

Definition. The exterior algebra of an E -module M is the E-algebra obtained by 
taking the quotient of the tensor algebra T ( M) by the ideal A(M) generated by all 
elements of the form m ® w, for m e M. The exterior algebra T (M)/A(hf) is denoted 
by f\(M) and the image of m] <g>/H 2 ®---®wt*in AC&O is denoted by mj A/712 a--- Am k . 

As with the symmetric algebra, the ideal A(M) is generated by homogeneous 
elements hence is a graded ideal. By Proposition 33 the exterior algebra is graded, with 
k A homogeneous component /\ k (M) = T k (M)/A k (M). We can again identify R with 
/\°(M) and M with f\ ( M ) and so consider M as an E-submodule of the E-algebra 
/\(Af). The E-module f\ k (M) is called the k^ exterior power of M. 

The multiplication 

(mi a - - - a m,) a (m\ a ■ ■ ■ a m'-) = m\ a ■ ■ • a m* a m\ a ■ • • a m' 
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in the exterior algebra is called the wedge (or exterior) product. By definition of the 
quotient, this multiplication is alternating in the sense that the product nt\ a - ■ ■ a m* 
is 0 in [ \{M ) if m,- = m i+1 for any 1 < i < k. Then 

0 = (m + m') A (m + m) 

= (m A m) + (m A m) + (m Am) + (m 1 A m') 

— (m A m) + (m' A m) 

shows that the multiplication is also anti co m m utative on simple tensors: 

m Am = —m a m for all m, m’ € M. 

This anticommutativity does not extend to arbitraiy products, however, i.e., we need 
not have ab = — ba for all a, b € /\(M) (cf. Exercise 4). 

Theorem 36. Let M be an /?- module over the commutative ring R and let /\(M) be 
its exterior algebra. 

(1) The exterior power, /\ k (M), of M is equal to M <8> ■ - ■ <g> M (k factors) 
modulo the submodule generated by all elements of the form 

mi ® mi ® ® m k where mi = mj for some i ^ j. 

In particular, 

m j A m 2 A ■ - - A m k = 0 if m/ = mj for some i ^ j. 

(2) (Universal Property for Alternating Multilinear Maps)\f<p : M X- - x M —> N 
is an alternating ^-mul tilin ear map then there is a unique R -module homomor- 
phism d> : /\ k (M) -»■ N such that <p = 4> o t, where 

i : M x • • * x M -v /\ k (M) 

is the map defined by 

t(mi, „ . . , m k ) = mj A • ■ ■ A m*. 


Remark: The exterior algebra also satisfies a universal property similar to (3) of The- 
orem 34, namely with respect to f?-module homomorphisms from M to R -algebras A 
satisfying a 2 = 0 for all a e A (cf. Exercise 6). 


Proof: The ^-tensors A k (M) in the ideal A(M) are finite sums of elements of the 
form 


mi®... ® (m ®m) <g>m ,+2 ® . . . <8>m* 


with mi, ... , m k% m € M (where k > 2 and 1 < / < k), which is a Jfc-tensor with two 
equal entries (in positions i and i -t- 1), so is of the form in ( 1 ). For the reverse inclusion, 
note that since 

m! (&m = —m <8 m' -t- [(m + m) ® (m -(- m) — m <g) m — m! 0 m'] 

= —m ®m mod A(M), 
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interchanging any two consecutive entries and multiplying by — 1 in a simple k-tensor 
gives an equivalent tensor modulo A k (M) . Using such a sequence of interchanges and 
sign changes we can arrange for the equal entries m, and mj of a simple tensor as in (1) 
to be adjacent, which gives an element of A k (M). It follows that the generators in (1) 
are contained in A k (M), which proves the first part of the theorem. 

As in Theorem 34, the proof of (2) follows easily from the corresponding result 
for the tensor algebra in Theorem 31 since A k (M) is contained in the kernel of any 
alternating map from T*(Af) to N. 

Examples 

( 1 ) Suppose V is a one-dimensional vector space over F with basis element v. Then 
/\ k (V) consists of finite sums of elements of the form aju A a 2 V A - - - A a^v, i.e., 

aict 2 ctk(v a v A ■■■ A v) for ori a * e F. Since u a v = 0, it follows that 

A°(V0 = F, /\ l (V) = V, and /\'(V) = 0 for i > 2, so as a graded F-algebra we 
have 

/ \(V ) = F©V©0©0©.... 

(2) Suppose now that V is a two-dimensional vector space over F with basis v, v 1 . Here 
/\ k (V) consists of finite sums of elements of the form («i v+a[ v')A- ■ - a (a* v+c/ k v'). 
Such an element is a sum of elements that are simple wedge products involving only 
u and i/. For example, an element in /\ (V) is a sum of elements of the form 

(av + bv) A (cv -)- dv') = ac(v Av) + ad(v A i/) 4- bc( v A v) 

+ bd(v' A v) 

= (ad — be) v A x/ . 

It follows that /\‘ (V) = 0 for i > 3 since then at least one of u, i/ appears twice in 
such simple products. 

We can see directly from /\ 2 (V) = T^(V)/A 2 (V) that v A v' 0, as follows. 
The vector space T^(V') is 4-dimensional with v ® v, v <g> v', v' ® v, t/ ® v' as basis 
(Proposition 16). The elements v ® v, u ® v' + v' ® v, v' ® v' and v®v' are therefore 
also a basis for 7^(V). The subspace A 2 (V) consists of all the 2-tensors in the ideal 
generated by the tensors 

(av + bv') ® (av + bv') = a 2 (v ® u) + ab(v ® v' + v' ® u) -)- b 2 (v' <g> i/), 

from which it is clear that A?(V) is contained in the 3-dimensional subspace having 
v ® v, v ® v' + v' ® i), and v' ® v' as basis. In particular, the basis element v ® v' of 
T^(V) is not contained in A?(V), i.e. t v Ax/ ?0w/\ 2 (V). 

It follows that /\°(V) = F, /\ l (V) = V y /\ 2 (V) = F(v a t/), and /\*(V) = 0 
for i > 3, so as a graded F- algebra we have 

/\(V) = F © V © F(v A x/) © 0 © . . . . 

As the previous examples illustrate, unlike the tensor and symmetric algebras, for 
finite dimensional vector spaces the exterior algebra is finite dimensional: 
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Corollary 37. Lei V be a finite dimensional vector space over the field F with basis 
B = {uj , . . . , u„}. Then the vectors 

v/, A A ■ ■ ■ A Vj t for 1 < ii < 12 < ■ - • < ik < n 

are a basis of /\ k (V), and /\ k (V) = 0 when k > n (when k = 0 the basis vector is the 
element 1 e F). In particular, dim F(/\ k (V)) = (^). 

Proof: As the proof of Theorem 36 shows, modulo A k (M), the order of the terms 
in any simple Ctensor can be rearranged up to introducing a sign change. It follows 
that the fc-tensors in the corollary (which have been arranged with increasing subscripts 
on the Vi and with no repeated entries) are generators for /\ k (V). To show these vec- 
tors are linearly independent it suffices to exhibit an alternating k -multilinear function 
from V k to F which is 1 on a given u fl a v ,- 2 a - ■ - a vi k and zero on all other gen- 
erators. Such a function / is defined on the basis of T k (V) in Proposition 32 by 
/ (Vj { ® u/j <8> - ■ ■ ® Vj k ) = e{o) if c is the unique permutation of 0‘i. h , . . . , jk) into 
(ij, i 2 , .... /*)» and / is zero on every basis tensor whose /: -tuple of indices cannot be 
permuted to (ij, / 2 , . - - , k) (where e(o) is the sign of <r). Note that / is zero on any 
basis tensor with repeated entries. The value e(cr) ensures that when / is extended to 
all elements of T k (V) it gives an alternating map, i.e., / factors through .A*(V). Hence 
/ is the desired function. The computation of the dimension of /\ k (V) (i.e., of the 
number of increasing sequences of A: -tuples of indices) is left to the exercises. 

The results in Corollary 37 are true for any free /?-module of rank n. In particular 
if M = R n with P-module basis . . . , m„ then 

/ \ n (M) — R(m 1 A • - - A m n ) 

is a free (rank l) P- module with generator m ) a ■ ■ ■ A m n and 

A” +1 (ao = a ” +2 (M) = • • = o. 


Example 

A 

Let R be the polynomial ring Z[x, y] in the variables x and y. If M = R, then /\ (M) = 0 
so, for example, there are no nontrivial alternating bilinear maps on R x R by the universal 
property of /\ 2 (R) with respect to such maps (Theorem 36). 

Suppose now that M = 1 is the ideal (x, y) generated by x and y in R. Then I /\ I 0. 
Perhaps the easiest way to see this is to construct a nontrivial alternating bilinear map on 
7x/. The map 

tp(ax + by, cx + dy) = {ad — be) mod (x, y) 

is a well defined alternating 7?-bilinear map from 7 x f to Z = R/1 (cf. Exercise 7). Since 
y) = 1. it follows that x Ay e /\ (7) is nonzero. Unlike the situation of free modules 
as in the examples following Theorem 36 (where arguments involving bases could be used), 
in this case it is Dot at all a trivial matter to give a direct verification that x a y ^ 0 in 

aV)- 

Remark: The ideal 7 is an example of a rank 1 (but not free) /?- module (the rank of a 
module over an integral domain is defined in Section 12.1), and this example shows that 
the results of Corollary 37 are not true in general if the R - module is not free over R. 
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Homomorphisms of Tensor Algebras 

If <p : M ^ N is any /{-module homomorphism, then there is an induced map on the 
A* tensor power 

T k (<p) : m\ ® m 2 ® ® nik 1 — > <2> U>(m 2 ) ® ® <p(mk). 

It follows directly that this map sends generators of each of the homogeneous compo- 
nents of the ideals C(M) and A(M) to themselves. Thus <p induces /{-module homo- 
morphisms on the quotients: 

S k (<p) : S k (M) — ► S k (N) and : /\ k (M) — ► A* (AO* 

Moreover, each of these three maps is a ring homomorphism (hence they are graded 
/f-algebra homomorphisms). 

Of particular interest is the case when M — V is an n-dimensional vector space 
over the field F and <p : V -» V is an endomorphism. In this case by Corollary 37, 
/\ n (<p) maps the 1 -dimensional space A" (VO to itself. Let ui, . . . , v„ be a basis of V, 
so that t>i a - - * a v n is a basis of A" (VO- Then 

A"(V)(*>1 A ■ • • A l>„) = <p(Vi) A ■ ■ • A <p(v„) = D(<p)vy A • • • A V„ 

for some scalar D(<p) € F. 

For any n x n matrix A over F we can define the associated endomorphism <p 

(with respect to the given basis i>l i>„), which gives a map D : Af wxw (F) — ► F 

where D(A) = D(<p). It is easy to check that this map D satisfies the three axioms 
for a determinant function in Section 4. Then the uniqueness statement of Theorem 24 
gives: 

Proposition 38. If <p is an endomorphism on a n-dimensional vector space V, then 
/\ n (<p)(w) — det(^)in for all w e A" (VO- 

Note that Proposition 38 characterizes the determinant of the endomorphism tp as 
a certain naturally induced linear map on f\ n (V). The fact that the determinant arises 
naturally when considering alternating multilinear maps also explains the source of the 
map <p in the example above. 

As with the tensor product, the maps S k (<p) and A*(^) induced from an injective 
map from M to N need not remain injective (so A (M) need not be a submodule of 
A (AO when M is a submodule of N, for example). 

Example 

The inclusion <p : l ^ R of the ideal (x, y) into the ring R = Z[x, y], both considered as 
/{-modules, induces a map 

A 2 (v>) : A V) A 2 (*)- 

Since /\ 2 (R) = 0 and A 2 (I) 7 ^ 0, the map cannot be injective. 
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One can show that if M is an R -module direct summand of N, then T (M) (respec- 
tively, S(M) and /\(M)) is an R -subalgebra of T (N) (respectively, S(N) and f\(N)) 
(cf. the exercises). When R = F is a field then every subspace M of N is a direct 
su mm and of N and so the corresponding algebra for M is a subalgebra of the algebra 
for TV. 

Symmetric and Alternating Tensors 

The symmetric and exterior algebras can in some instances also be defined in terms 
of symmetric and alternating tensors (defined below), which identify these algebras as 
.suhalgebras of the tensor algebra rather than as quotient algebras. 

For any R -module M there is a natural left group action of the symmetric group S k 
on M x M x ■ ■ ■ x M (k factors) given by permuting the factors: 

<r(mi, m 2l . . . , m k ) = ( m a - 1 (2) , .... i (Jt) ) for each o € S k 

(the reason for a~ l is to make this a left group action, cf. Exercise 8 of Section 5.1). 
This map is clearly ^-multilinear, so there is a well defined /^-linear left group action 
of Sk on 7"*(M) which is defined on simple tensors by 

<r(m] ®m 2 <g> ■ ■ ■ ®m k ) = m a i(i) ®m a - 1 ( 2) ® ■ ■ ■ ®m a -\( k ) for each a € S k . 


Definition. 

(1) An element z € 7* ( M ) is called a symmetric fc-tensor if az = z for all a in the 
symmetric group S k . 

(2) An element z € T k {M ) is called an alternating ^-tensor if <jz = e (a )z for all 
a in the symmetric group S k , where e(a) is the sign, ±1, of the permutation a. 

It is immediate from the definition that the collection of symmetric (respectively, 
alternating) k -tensors is an /^-submodule of the module of all tensors. 

Example 

The elements m 0 m and m\ 0 ra 2 + ra 2 ® mi are symmetric 2-tensors. The element 
mi 0 m 2 — m 2 <g> mi is an alternating 2-tensor. 

It is also clear from the definition that both C k (M ) and A k (M) are stable under the 
action of S k , hence there is an induced action on the quotients S k (M ) and f\ k (M). 

Proposition 39. Let a be an element in the symmetric group S k and let e(a) be die 
sign of the permutation a. Then 

(1) for every w e S k (M ) we have crw — w, and 

(2) for every w € f\ k (M) we have au> — e(cr)w. 

Proof: The first statement is immediate from (1) in Theorem 34. We showed in the 
course of the proof of Theorem 36 that 

mi a ■ • • a mj a m/ + i a ■ ■ - a m k = —m j a ■ - ■ a m, +1 a a - ■ ■ A m ks 


Sec. 1 1 .5 Tensor Algebras, Symmetric and Exterior Algebras 


451 



which shows that the formula in (2) is valid on simple products for the transposition 
a — ( i i+1). Since these transpositions generate Sk and e is a group homomorphism 
it follows that (2) is valid for any a e S* on simple products w. Since both sides are 
R -linear in w, it follows that (2) holds for all w e /\ k (M). 

By Proposition 39, the symmetric group Sk acts trivially on both the submodule 
of symmetric A: -tensors and the quotient module S k (M), the k 1 ^ symmetric power of 
M. Similarly, Sk acts the same way on the submodule of alternating A: -tensors as on 
/\ k (M), the k^ exterior power of M. We now show that when k\ is a unit in R that 
these respective submodules and quotient modules are isomorphic (where k ! is the sum 
of the 1 of R with itself k\ times). 

For any fc-tensor z e T k (M) define 

Sym(z) = T az 

aeS k 

Alt(z ) — ^2 0Z - 

cre.?t 

For any ^-tensor z, the £ -tensor Sym(z ) is symmetric and the A: -tensor Alt(z ) is alter- 
nating. For example, for any r € 5* 

r Alt(z) = ^2 e ^ cr ) xa z 

i reS* 

— ^ 2 a 'z (letting o' = tc) 

a'&S k 

= e(r _1 ) ^2 °' z — e(r)A/r( z). 


The tensor Sym(z) is sometimes called the symmetrizjation of z and A/r(z ) the skew- 
symmetrization of z- 

If z is already a symmetric (respectively, alternating) tensor then Sym(z ) (respec- 
tively, Alt(z )) is just klz. It follows that Sym (respectively, Alt ) is an R -module 
endomorphism of T*(M) whose image lies in the submodule of symmetric (respec- 
tively, alternating) tensors. In general these maps are not surjective, but if Ai! is a unit 
in R then 


~Sym(z ) — z for any symmetric tensor z, and 
A:! 

^ Alt(z ) = z for any alternating tensor z 

so that in this case the maps (1 /k\)Sym and (1 /k\)Alt give surjective A?-module ho- 
momorphisms from T k (M) to the submodule of symmetric (respectively, alternating) 
tensors. 
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Proposition 40. Suppose &! is a unit in the ring R and M is an R -module. Then 
(1) The map (1 /k\)Sym induces an F -module isomorphism between the A; 111 sym- 
metric power of M and the F-submodule of symmetric /: -tensors: 


— Sym : S k (M) = {symmetric ^-tensors}. 
kl 

(2) The map (\/k !) Alt induces an R -module isomorphism between the k^ exterior 
power of M and the F-submodule of alternating fc-tensors: 

^A It : /\ k (M) = {alternating I:-tensors}. 

rC . 


Proof: We have seen that the respective maps are surjective F-homomoiphisms 
from T k (M) so to prove the proposition it suffices to check that their kernels are C k (M) 
and A k (M), respectively. We show the first and leave the second to the exercises. It is 
clear that Sym is 0 on any difference of two /:-tensors which differ only in the order of 
their factors, so C k (M) is contained in the kernel of (1 /kl)Sym by (1) of Theorem 34. 
For the reverse inclusion, observe that 


z ~h. Sym(z) = h^ (z ~ az) 


aes k 


for any Jt-tensor z. If z is in the kernel of Sym then the left hand side of this equality 
is just z; and since z — az € C k (M) for every a € S* (again by (1) of Theorem 34), it 
follows that z e C k (M ), completing the proof. 


The maps (1 /k\)Sym and (1 /k\)Alt are projections (cf. Exercise 1 1 in Section 2) 
onto the submodules of symmetric and antisymmetric tensors, respectively. Equiva- 
lently, if k \ is a unit in R , we have R -module direct sums 

T*(Af) = ker(7r) © image (7r) 

fom = (1 / k \)Sym or n = (1 /k\)Alt. In the former case the kernel consists of C k (M) 
and the image is the collection of symmetric tensors (in which case C k (M) is said to 
form an /^-module complement to the symmetric tensors). In the latter case the kernel 
is A k (M) and the image consists of the alternating tensors. 

The /^-linear left group action of S* on T* (M) makes T* (M) into a module over 
the group ring RSk (analogous to the formation of F[x] -modules described in Section 
10.1). In terms of this module structure these projections give FS* -submodule comple- 
ments to the RSk -submodules C k (M) and A k (M). The “averaging” technique used to 
construct these maps can be used to prove a very general result (Maschke’s Theorem in 
Section 18.1) related to actions of finite groups on vector spaces (which is the subject 
of the “representation theory” of finite groups in Part VI). 

If/:! is not invertible in R then in general we do not have such Sk -invariant direct 
sum decompositions so it is not in general possible to identify, for example, the k^ 
exterior power of M with the alternating Jt-tensors of M. 

Note also that when k ! is invertible it is possible to define the k^ exterior power of M 
as the collection of alternating fc-tensors (this equivalent approach is sometimes found 
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in the literature when the theory is developed over fields such as R and C). In this case 
the multiplication of two alternating tensors z and w is defined by first t alcin g the product 
zw = z®wmT (M) and then projecting the resulting tensor into the submodule of 
alternating tensors. Note that the simple product of two alternating tensors need not be 
alternating (for example, the square of an alternating tensor is a symmetric tensor). 

Example 

Let V be a vector space over a field F in which k\ ^ 0. There are many vector space 
complements to A k (V) in T*(V) (just extend a basis for the subspace A k (V) to a basis 
for for example). These complements depend on choices of bases for T k (V) 

and so are indistinguishable from each other from vector space considerations alone. The 
additional structure on 7”*(V) given by the action of S* singles out a unique complement 
to A k (V) t namely the subspace of alternating tensors in Proposition 40. 

Suppose that kl ± 0 in F for all k > 2 (i.e., the field F has “characteristic 0,” 
cf. Exercise 26 in Section 7.3), for example, F = Q. Then the full exterior algebra 
/ V(V ) = A k ( y ) can be identified with the collection of tensors whose homogeneous 

components are alternating (with respect to the appropriate symmetric groups £*). 

Multiplication in /\(V) in terms of alternating tensors is rather cumbersome, however. 
For example let uj , V 2 , V 3 be distinct basis vectors in V. The product of the two alternating 
tensors z = v\ and u> = V 2 ® V 3 — V 3 ® V 2 is obtained by first computing 

Z ® IV = Vi ® V 2 ® l >3 — Vl ® V3 ® V 2 

in the full tensor algebra. This 3-tensor is not alternating — for example, 

(1 2 )(z ® w) = V 2 ® 1>1 ® V3 — l>3 ® vi ® V2 ^ — Z ® W 

and also (1 23)(z®iv) = V 3 ®vi ®V 2 — U 2 ®vi ®V 3 ^ z® iv. The multiplication requires 
that we project this tensor into the subspace of alternating tensors. This projection is given 
by ( 1/3 !).A//(z ® w ) and an easy computation shows that 

-Alt{z ® tv) = -[l>i®U2®i>3-t-U2®lV3®Vl-|-V3®Vi®U2 
6 3 

—VJ ® V3 ® V 2 - V 2 ® Vl ® V 3 - V 3 ® V 2 ® VI ] , 

so the right hand side is the product of z and w in terms of alternating tensors. The same 
product in terms of the quotient algebra /\(V) is simply 

Vl A ( 2 v 2 a V3) = 2 vj A V2 a V3. 


EXERCISES 

In these exercises R is a commutative ring with 1 and M is an /^-module; F is a field and V is 
a finite dimensional vector space over F. 

1. Prove that if M is a cyclic /f-module then T (M) = S(M), i.e., the tensor algebra T (M) 
is commutative. 

2. HU in the details for the proof of Proposition 33 that S/I = 0£1 O S*//*. [Show first that 
Sjlj c I j+ j. Use this to show that the multiplication ( Si/Ii)(Sj/Ij ) c Sj+j/Ij+j is weU 
defined, and then check the ring axioms and verify the statements made in the proof of 
Proposition 33.] 
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3. Show that the image of the map Symi for the Z- module Z consists of the 2-tensors a ( 1 ® 1) 
where a is an even integer Conclude in particular that the symmetric tensor 1 ® 1 in Z®zZ 
is not contained in the image of the map Sym. 

4. Prove that m a /ij a n 2 a ■ ■ ■ a n* = ( — 1)* (n i a 02 a - - - A a m). In particular, 
x A (y A z) = (y A z) a x for all x, y, z € Af. 

5. Prove that if Af is a free R -module of rank n then /\* (Af) is a free R -module of rank (") 

for i = 0, 1 , 2, 

6. If A is any /?-algebra in which a 2 = 0 for all a € A and <p : Af A is an R -module 
homomorphism, prove there is a unique ft -algebra homomorphism <P : f\(M) -*■ A such 
that <t >\ m = <P- 

7. Let R — Z[x, y] and / = (x, y). 

(a) Prove that if ax + by = a'x + Vy in R then o' = a + yf and V = b — xf for some 
polynomial f (x, y) € R. 

(b) Prove that the map <p (ax +by,cx +dy ) ~ ad— be mod (x , y ) in the example following 
Corollary 37 is a well defined alternating f?-bilinear map from I y. I to Z = Rfl. 

8. Let R be an integral domain and let F be its field of fractions. 

(a) Considering F as an ^-module, prove that /\ 2 F = 0. 

(b) Let / be any /?-submodule of F (for example, any ideal in R). Prove that /\* 7 is a 
torsion /?-module for / > 2 (Le., for every x € /\* / there is some nonzero r e R 
with rx = 0). 

(c) Give an example of an integral domain R and an R -module / in F with /\' / ^ 0 for 
every i > 0 (cf. the example following Corollary 37). 

9. Let/? = Z[G] be the group ring of the group G = {1, cr) of order 2. LetAf = Zei+Z^be 
afreeZ-moduleofrank2withbasisei and < 22 - Definea(ei) = e\ + 2 e 2 and 0 (^ 2 ) = — ei- 
Prove that this makes M into an R -module and that the R -module /\ 2 Af is a group of 
order 2 with ei a e 2 as generator. 

10. Prove that z — (1 /k\)Alt(z) = (1 /k\)^2 aeSk (z — f(a)az) for any k-tensor z and use this 
to prove that the kernel of the R -module homomorphism (I /k\)Alt in Proposition 40 is 

A k (M)- 

11. Prove that the image of Altk is the unique largest subspace of T*(V) on which each 
permutation o in the symmetric group Sk acts as multiplication by the scalar e(o). 

12. (a) Prove that if /(x, y) is an alternating bilinear map on V (i.e., f(x,x) = 0 for all 

x e VO then /(x, y) = — /(y, x) for all x, y € V. 

(b) Suppose that —1 ^ 1 in F. Prove that /(x, y) is an alternating bilinear map on V 
(Le., /(x, x) = 0 for all x € V) if and only if /(x, y) = — /(y, x) for aUx,y € V. 

(c) Suppose that — 1 = 1 in F. Prove that every alternating bilinear form /(x, y) on V is 
symmetric (Le„ /(x, y) = /(y, x) for all x, y e V). Prove that there is a symmetric 
bilinear map on V that is not alternating. [One approach: show that C 2 (V) c ^4 2 (VO 
and C 2 (V) ^ «4 2 (V0 by counting dimensions. Alternatively, construct an explicit 
symmetric map that is not alternating.] 

13. Let F be any field in which — 1 ± 1 and let V be a vector space over F. Prove that 
V ®p V =«S 2 (V0© A 2 ( v ) * that every 2-tensor may be written uniquely as a sum of 
a symmetric and an alternating tensor. 

14. Prove that if Af is an /?-module direct factor of the /?-module N then T (Af) (respectively, 
«S(Af) and f\ (Af)) is an /?-subalgebra of T(N) (respectively, S( N) and A(N))- 
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CHAPTER 12 


Modules over 
Principal Ideal Domains 


The main purpose of this chapter is to prove a structure theorem for finitely generated 
modules over particularly nice rings, namely Principal Ideal Domains. This theorem is 
an example of the ideal structure of the ring (which is particularly simple for PJD.s) 
being reflected in the structure of its modules. If we apply this result in the case where 
the P.I.D. is the ring of integers Z then we obtain a proof of the Fundamental Theorem 
of Finitely Generated Abelian Groups (which we examined in Chapter 5 without proof). 
If instead we apply this structure theorem in the case where the P.I.D. is the ring F[x ] 
of polynomials in x with coefficients in a field F we shall obtain the basic results on 
the so-called rational and Jordan canonical forms for a m atrix. Before proceeding to 
the proof we briefly discuss these two important applications. 

We have already discussed in Chapter 5 the result that any finitely generated abelian 
group is isomorphic to the direct sum of cyclic abelian groups, either Z or Z/nZ for 
some positive integer n ^ 0. Recall also that an abelian group is the same thing as 
a Z-module. Since the ideals of Z are precisely the trivial ideal (0) and the principal 
ideals (n) = nZ generated by positive integers n, we see that the Fundamental Theorem 
of Finitely Generated Abelian Groups in the language of modules says that any finitely 
generated Z-module is the direct sum of modules of the form Z/7 where / is an ideal 
of Z (these are the cyclic Z-modules), together with a uniqueness statement when the 
direct sum is written in a particular form. Note the correspondence between the ideal 
structure of Z and the structure of its (finitely generated) modules, the finitely generated 
abelian groups. 

The Fundamental Theorem of Finitely Generated Modules over a P.I.D. states that 
the sa m e result holds when the Principal Ideal Domain Z is replaced by any P.I.D. In 
particular, we have seen in Chapter 10 that a module over the ring F[x\ of polynomials 
in jc with coefficients in the field F is the same thing as a vector space V together 
with a fixed linear transformation T of V (where the element x acts on V by the linear 
transformation T ). The Fundamental Theorem in this case will say that such a vector 
space is the direct sum of modules of the form F[x]/1 where I is an ideal of F[x], 
hence is either the trivial ideal (0) or a principal ideal (/ (x)) generated by some nonzero 
polynomial f(x) (these are the cyclic F [x]-modules), again with a uniqueness statement 
when the direct sum is written in a particular form. If this is translated back into the 
language of vector spaces and linear transformations we can obtain information on the 
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linear transformation T. 

For example, suppose V is a vector space of dimension n over F and we choose 
a basis for V . Then giving a linear transformation T of V to itself is the same thing 
as giving an n x n matrix A with coefficients in F (and choosing a different basis for 
V gives a different matrix B for T which is similar to A i.e., is of the form P~ l A P 
for some invertible matrix P which defines the change of basis). We shall see that 
the Fundamental Theorem in this situation implies (under the assumption that the field 
F contains all the “eigenvalues” for the given linear transformation T) that there is a 
basis for V so that the associated matrix for T is as close to being a diagonal matrix 
as possible and so has a particularly simple form. This is the Jordan canonical form. 
The rational canonical form is another simple form for the matrix for T (that does not 
require the eigenvalues for T to be elements of F). In this way we shall be able to give 
canonical forms for arbitrary n x n matrices over fields F, that is, find matrices which 
are similar to a given n x n matrix and which are particularly simple (almost diagonal, 
for example). 

Example 

Let V = Q 3 = {(jt, y, z) | x, y, z e Q) be the usual 3-dimensional vector space of ordered 
3- tuples with entries from the field F = Q of rational numbers and suppose T is the linear 
transformation 

T(x, y, z) — ( 9x -f-4y + 5z, —Ax — 3z, -6x -4 y - 2 z), jc, y, z e Q. 

If we take the standard basis e\ = (1,0, 0), ei — (0, 1,0), e$ = (0, 0, 1) for V then the 
matrix A representing this linear transformation is 

/ 9 4 5\ 

A=\- 4 0 -3 . 

V-6 -4 -2) 

We shall see that the Jordan canonical form for this matrix A is the much simpler matrix 



obtained by taking instead the basis f\ = (2, —1, —2), fi — (1, 0, —1), f$ — (3, —2, —2) 
for V, since in this case 


T(fi) = T( 2. -1, -2) = (4, -2. -4) = 2 ■ /i + 0 - / 2 + 0 - ^ 
T(fi) = 7X1. 0, -1) = (4. -1. -4) = 1 ■ h +2 - f 2 + 0 ■ fy 
Tih) = T( 3, -2, -2) = (9, -6, -6) = 0- /i +0- f 2 + 3 - / 3 , 


so the columns of the matrix representing T with respect to this basis are (2, 0, 0) T ( 1 , 2, 0) 
and (0, 0, 3), i.e., T has matrix B with respect to this basis. In particular A is similar to the 
simpler matrix B. 

In fact this linear transformation T cannot be diagonalized (i.e., there is no choice of 
basis for V for which the corresponding matrix is a diagonal matrix) so that the matrix B 
is as close to a diagonal matrix for T as is possible. 
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The first section below gives some general definitions and states and proves the 
Fundamental Theorem over an arbitrary P.I.D., after which we return to the application 
to canonical forms (the application to abelian groups appears in Chapter 5). These 
applications can be read independently of the general proof. An alternate and compu- 
tationally useful proof valid for Euclidean Domains (so in particular for the rings Z and 
F[x]) along the lines of row and column operations is outlined in the exercises. 

12.1 THE BASIC THEORY 

We first describe some general finiteness conditions. Let R be a ring and let M be a left 
/?-module. 

Definition. 

(1) The left R -module M is said to be a Noetherian R-module or to satisfy the 
ascending chain condition on submodules (or A. C. C. on submodules ) if there 
are no infinite increasing chains of submodules, i.e., whenever 

Mi c M 2 C M 3 c • • ■ 

is an increasing chain of submodules of M, then there is a positive integer m 
such that for all k > m, M* = M m (so the chain becomes stationary at stage m\ 
M m ~ M m+ i = Mm+2 = - - ■ 

(2) The ring R is said to be Noetherian if it is Noetherian as a left module over 
itself, i.e., if there are no infinite increasing chains of left ideals in R. 

One can formulate analogous notions of A.C.C. on right and on two-sided ideals in 
a (possibly noncommutative) ring R. For noncommutative rings these properties need 
not be related. 

Theorem 1. Let R be a ring and let M be a left R -module. Then the following are 
equivalent: 

(1) M is a Noetherian R -module. 

(2) Every nonempty set of submodules of M contains a maximal element under 
inclusion. 

(3) Every submodule of M is finitely generated. 

Proof: [(1) implies (2)] Assume M is Noetherian and let E be any nonempty 
collection of submodules of M. Choose any Mi e E. If Mi is a maximal element of 
E, (2) holds, so assume Mi is not maximal. Then there is some M 2 € E such that 
Mi C M 2 . If M 2 is maximal in E, (2) holds, so we may assume there is an M 3 € E 
properly containing M 2 . Proceeding in this way one sees that if (2) fails we can produce 
by the Axiom of Choice an infinite strictly increasing chain of elements of E, contrary 
to(l). 

[(2) implies (3)] Assume (2) holds and let N be any submodule of M. Let E be 
the collection of all finitely generated submodules of N. Since {0} € E, this collection 
is nonempty. By (2) E contains a ma xim al element N'. If N' ^ N, let x € N — N'. 
Since N' € E, the submodule N' is finitely generated by assumption, hence also the 
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submodule generated by N' and x is finitely generated. This contradicts the maximality 
of N\ so N = N' is finitely generated. 

[(3) implies (1)] Assume (3) holds and let Mi c M 2 C M 3 . . . be a chain of 
submodules of M. Let 

OO 

n = \Jm, 

1=1 

and note that A is a submodule. By (3) N is finitely generated by, say, a \ , < 22 . • • * » a n ■ 
Since < 2 , g N for all 1 , each a-, lies in one of the submodules in the chain, say Mj r 
Let m = max {y'i, j 2 , . . . , j„). Then e M m for all / so the module they generate is 
contained in M my i.e., N c M m . This implies M m — N = M* for all £ > m, which 
proves ( 1 ). 

Corollary 2. If F is a P.LD. then every nonempty set of ideals of R has a maximal 
element and R is a Noetherian ring. 

Proof: The P.I.D. R satisfies condition (3) in the theorem with M = R. 

Recall that even if M itself is a finitely generated F -module, submodules of M 
need not be finitely generated, so the condition that M be a Noetherian F- module is in 
general stronger than the condition that M be a finitely generated -module. 

We require a result on ‘linear dependence” before turning to the main results of 
this chapter. 

Proposition 3. Let R be an integral domain and let M be a free F -module of rank 
n < 00 . Then any n + 1 elements of M are F-linearly dependent, i.e., for any 
yi, y 2 > - - ■ 1 y n +i € M there are elements r„ + i g R, not all zero, such dial 

nyi + r 2 yi + ■ - ■ + rn+owi = 0 . 


Proof: The quickest way of proving this is to embed R in its quotient field F (since 
R is an integral domain) and observe that since M = R©R®-@/2(n times) we 
obtain MC F®F©---©F. The latter is an n-dimensional vector space over F so 
any n + 1 elements of M are F-linearly dependent. By clearing the denominators of the 
scalars (by multiplying through by the product of all the denominators, for example), 
we obtain an F linear dependence relation among the n + 1 elements of M. 

Alternatively, let ei, . . . , e„ be a basis of the free F-module M and let yi, .... y n+ i 
be any n + 1 elements of M. For 1 < i < n + 1 write y,- = an e, + < 221^2 + ... +n m e, in 
terms of the basis ei, £ 2 . Let A be the (n+ 1) x (n + 1) matrix whose i, j entry 

is a-,j , 1 < i < n, 1 < j < n + 1 and whose last row is zero, so certainly det A = 0. 
Since R is an integral domain, Corollary 27 of Section 11.4 shows that the columns 
of A are F-linearly dependent. Any dependence relation on the colu mn s of A gives a 
dependence relation on the y\ ’s, completing the proof. 

If F is any integral domain and M is any F-module recall that 

Tor(M) = {x G M J rx = 0 for some nonzero r g F) 
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is a submodule of M (called the torsion submodule of AT) and if TV is any submodule of 
Tor (AT), TV is called a torsion submodule of AT (so the torsion submodule of AT is the 
union of all torsion submodules of M, i.e., is the maximal torsion submodule of AT). If 
Tor(AT) = 0, the module AT is said to be torsion free. 

For any submodule TV of AT, the annihilator of TV is the ideal of R defined by 

Ann(TV) = {r € R \ rn = 0 for all n G TV). 

Note that if TV is not a torsion submodule of AT then Ann (TV) = (0). It is easy to see 
that if TV, L are submodules of AT with TV C L, then Ann(L) c Ann(TV). If R is a PJ.D. 
and W C L c TIT with Ann(TV) = (a) and Ann(L) = (b), then a | b. In particular, 
the annihil ator of any element x of AT divides the annihilator of AT (this is implied by 
Lagrange’s Theorem when R = Z). 

Definition. For any integral domain R the rank of an R -module AT is the maximum 
number of /^-linearly independent elements of AT. 

The preceding proposition states that for a free R -module AT over an integral domain 
the rank of a submodule is bounded by the rank of AT . This notion of rank agrees with 
previous uses of the same term. If the ring R = F is a field, then the rank of an 
/^-module AT is the dimension of AT as a vector space over F and any maximal set 
of F-linearly independent elements is a basis for AT. For a general integral domain, 
however, an Tf-module AT of rank n need not have a “basis,” i.e., need not be a free 
R -module even if AT is torsion free, so some care is necessary with the notion of rank, 
particularly with respect to the torsion elements of AT. Exercises 1 to 6 and 20 give 
an alternate characterization of the rank and provide some examples of (torsion free) 
T?- modules (of rank 1) that are not free. 

The next important result shows that if TV is a submodule of a free module of finite 
rank over a PJ.D. then TV is again a free module of finite rank and furthermore it is 
possible to choose generators for the two modules which are related in a simple way. 

Theorem 4. Let R be a Principal Ideal Domain, let AT be a free /^-module of finite rank 
n and let TV be a submodule of AT. Then 

(1) TV is free of rank m, m < n and 

(2) there exists abasis yi , > 2 , . . . , y n of AT so that a\y \ , a 2 y 2 , .... a m y m is abasis of 
TV where a[,a 2 , . . . , a m are nonzero elements of R with the divisibility relations 

a\\a z \---\a m . 


Proof: The theorem is trivial for TV — (0), so assume TV ^ {0}. For each R -module 
homomorphism <p of AT into R, the image (p (TV) of TV is a submodule of R t i.e., an 
ideal in R. Since R is a P.I.D. this ideal must be principal, say (p(N) = (a v ) t for some 
a, v G R. Let 

E = I <P € Horn/? (AT, f?)} 

be the collection of the principal ideals in R obtained in this way from the R -module 
homomorphisms of AT into R. The collection E is certainly nonempty since taking cp 
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to be the trivial homomorphism shows that (0) e £. By Corollary 2, E has at least 
one maximal element i.e., there is at least one homomorphism v of M to R so that the 
principal ideal v(N) — (a v ) is not properly contained in any other element of E. Let 
a i — a v for this maximal element and let y e N be an element mapping to the generator 
a i under the homomorphism v: v(y) = ci\. 

We now show the element a\ is nonzero. Let x \ , x 2 , . . . , x„ be any basis of the free 
module M and let jt/ € Hom/?(M, R) be the natural projection homomorphism onto 
the i* coordinate with respect to this basis. Since N ^ (0), there exists an i such that 
7Tj (N) / 0, which in particular shows that £ contains more than just the trivial ideal 
(0). Since (ay) is a maximal element of E it follows that a\ ± 0. 

We next show that this element a\ divides <p(y) for every € Hom*(Af , R). To 
see this let d be a generator for the principal ideal generated by a\ and <p(y). Then d is a 
divisor of both ay and (p(y ) in R and d = r\a\ + r 2 (p(y) for some ri, r 2 e R. Consider 
the homomorphism \f/ ~ r { v + r 2 (p from M to R. Then = (r^v + r 2 tp){ y) = 
r yay + r 2 <p(y) = d so that d € ty(N) t hence also (d) C But d is a divisor of 

<3[ so we also have (ay) c (d). Then (aj) c (d) c yJ/(N) and by the maximality of 
(fli) we must have equality: (aj) = ( d) = In particular (a\) = (d) shows that 

\ <P(y) since d divides <p(y). 

If we apply this to the projection homomorphisms n, we see that a j divides n- x (y) 
for all i. Write n, (y) = aybj for some b; G R, \ < i < n and define 

n 

y\ = *»-*.■■ 

/=i 

Note that «iyi = y. Since ay = v(y) — v(ayyy) = aiv(yi) and^i is a nonzero element 
of the integral domain R this shows 

v(yi) = 1. 

We now verify that this element yi can be taken as one element in a basis for M 
and that ayyy can be taken as one element in a basis for N, namely that we have 

(a) M = Ryi 0 ker v, and 

(b) N = Raiyi 0 (N H ker v). 

To see (a) let x be an arbitrary element in M and write x = v(x)yi + (x — v(x)yi). 
Since 


v(x - v(x)yi) = v(x) - v(x)v(yi) 

= y(x) — v(x) • 1 

. =0 

we see that x — v(x)yj is an element in the kernel of v. This shows that* can be written 
as the sum of an element in Ry y and an element in the kernel of u, so M = Ryy + ker v . 
To see that the sum is direct, suppose ryi is also an element in the kernel of v. Then 
0 — v(ryi) — rv(yi) = r shows that this element is indeed 0. 

For (b) observe that v(x') is divisible by ay for every x' e N by the definition of ay 
as a generator for v(N). If we write v(x') = ba\ where b e R then the decomposition 
we used in (a) above is x' = vOOyi + (x* — v(x')y 5 ) = bayyy + ( x ' — bayyy) where 
the second summand is in the kernel of v and is an element of N. This shows that 
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N = Ra\ y\->c(N ker u). The fact that the sum in (b) is direct is a special case of the 
directness of the sum in (a). 

We now prove part (1) of the theorem by induction on the rank, m , of N. If m = 0, 
then N is a torsion module, hence N = 0 since a free module is torsion free, so (1) 
holds trivially. Assume then that m > 0. Since the sum in (b) above is direct we see 
easily that N O ker v has rank m — 1 (cf. Exercise 3). By induction N D ker v is then 
a free R-module of rank m — 1 . Again by the directness of the sum in (b) we see that 
adjo inin g a\ yi to any basis of N D ker v gives a basis of N, so N is also free (of rank 
m), which proves (1). 

Finally, we prove (2) by induction on n, the rank of M. Applying (1) to the 
submodule ker v shows that this submodule is free and because the sum in (a) is direct 
it is free of rank n — 1. By the induction assumption applied to the module ker v (which 
plays the role of M ) and its submodule ker v 0 N (which plays the role of AO, we see 
that there is a basis yi,yi> ■ . . ,y n of ker v such that < 22 > 2 > ^ 3 >> 3 , - - ■ , a m y m is a basis of 
N D ker u for some elements a 2 , a^, ... ,a m of R with | a 3 | ■ ■ ■ | a m . Since the 
sums (a) and (b) are direct, yi , y 2 i ■ - * , y n is a basis of M and a\y ^ , a 2 y 2 , . . . , a m y m is 
a basis of N. To complete the induction it remains to show that a\ divides a 2 . Define 
a homomorphism <p from Af to R by defining ^>(yi) = <fiiy 2 ) = 1 and <p(yt) = 0, for 
all 1 > 2, on the basis for M . Then for this homomorphism <p we have a\ = <p(a\y\) 
so a\ e <p(N) hence also (uj) c #)(A0. By the maximality of (a\) in £ it follows that 
(#!) = <p(N). Since = <p(a 2 y 2 ) € <p(N) we then have a 2 € (fli) i.e., a\ | a 2 . This 
completes the proof of the theorem. 

Recall that the left R -module C is a cyclic R-module (for any ring R, not necessarily 
commutative nor with 1) if there is an element x € C such that C = Rx. We can then 
define an R -module homomorphism 

it 1 R — y C 

by tt(t) = rx , which will be surjective by the assumption C = Rx. The First Isomor- 
phism Theorem gives an isomorphism of (left) R-modules 

R/kern ~C. 

If R is a P.I.D., ker 7r is a principal ideal, (a), so we see that the cyclic A-modules 
C are of the form R j ( a ) where (a) = Ann (C). 

The cyclic modules are the simplest modules (since they require only one generator). 
The existence portion of the Fundamental Theorem states that any finitely generated 
module over a PJ.D. is isomorphic to the direct sum of finitely many cyclic modules. 

Theorem 5. (Fundamental Theorem, Existence: Invariant Factor Form) Let R be a 
P.I.D. and let M be a finitely generated R-module. 

(1) Then M is isomorphic to the direct sura of finitely many cyclic modules. More 
precisely, 

M=R r © R/(a { ) © R/(a 2 ) 0 ■ • ■ © R/iOm) 

for some integer r > 0 and nonzero elements a \ , 02 , ■ ■ - ,a m of R which are not 
units in R and which satisfy the divisibility relations 

a\ I a 2 | - | a m . 
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(2) M is torsion free if and only if M is free. 

(3) In the decomposition in (1), 


Tor(M) = R/fa) © R/(a 2 ) © • • • © R/(a m ). 

In particular M is a torsion module if and only if r = 0 and in this case the 
annihilalor of M is the ideal ( a m ). 

Proof: The module M can be generated by a finite set of elements by assumption 
so let jq, x 2 , . . . , x n be a set of generators of M of minimal cardinality. Let R n be 
the free /^-module of rank n with basis b\, b 2 , . . . , b„ and define the homomorphism 
7r : R r — v M by defining it fa) = x\ for all i, which is automatically surjective 
since x \ , . . . , x„ generate M. By the First Isomorphism Theorem for modules we have 
R n I ker n = M. Now, by Theorem 4 appUed to R" and the submodule ker n we can 
choose another basis yi, yi, . . . , y„ of R n so that fliyi, a 2 y 2 , . . . , a m y m is a basis of 
kerrr for some elements a\ t a 2 , . . . , a m of R with a\ \a 2 \ ■ ■ - \ a m . This implies 

M = R n / kerrr = (flyi © Ry 2 © * • ■ © Ry n )/ (Rmyi © Ra 2 y 2 © ■ ■ ■ © Ra^m). 

To identify the quotient on the right hand side we use the natural surjective /^-module 
homomorphism 


Ryi © Ry 2 © - - * © fly,, R /fa) © R/fa) © ■ ■ ■ © R/fa) © R n ~ m 

that maps («iyi, . . . , a„y„) to fa mod (a t ), mod ( a m ), a m +u - ■ ■ > «*)- The 

kernel of this map is clearly the set of elements where a; divides a,, i = 1,2 , 
i.e., Raiyi © Ra 2 y 2 © • - © Ra m y m (cf. Exercise 7). Hence we obtain 

M = R/fa) © R/(a 2 ) © - - • © R/fa) © R n ~ m . 

If a is a unit in R then R/(a) = 0, so in this direct sum we may remove any of the 
initial a,- which are units. This gives the decomposition in ( 1) (with r = n — m). 

Since R / ( a ) is a torsion fl -module for any nonzero elements of R , (1) immediately 
implies M is a torsion free module if and only if M = fl r , which is (2). Part (3) is 
immediate from the definitions since the an nihil alor of Rj (a) is evidently the ideal (a). 

We shall shortly prove the uniqueness of the decomposition in Theorem 5, namely 
that if we have 


M £ fl r ' © R/fa) © R/fa) © - - © R/fa) 

for some integer r' > 0 and nonzero elements b \ , b 2 , . . . , Zv of R which are not units 
with 

b\. | b 2 ! ■ ■ ■ | b m < , 

then r ~r',m — m' and fa) — fa) (so a, = b; up to units) for all i. It is precisely the 
divisibility condition a\ \ a 2 \ ■ ■ ■ | a m which gives this uniqueness. 
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Definition. The integer r in Theorem 5 is called the free rank or the Betti number of 
M and the elements a\ , a-i, . . . , a m € R (defined up to multiplication by units in R) are 
called the invariant factors of M. 

Note that until we have proved that the invariant factors of M are unique we should 
properly refer to a set of invariant factors for M (and similarly for the free rank), by 
which we mean any elements giving a decomposition for M as in (1) of the theorem 
above. 


Using the Chinese Remainder Theorem it is possible to decompose the cyclic 
modules in Theorem 5 further so that M is the direct sum of cyclic modules whose 
annihilators are as simple as possible (namely (0) or generated by powers of primes in 
R). This gives an alternate decomposition which we shall also see is unique and which 
we now describe. 

Suppose a is a nonzero element of the Principal Ideal Domain R. Then since R is 
also a Unique Factorization Domain we can write 


<* = up“ x p? 


Ps 


where the pi are distinct primes in R and u is a unit This factorization is unique 
up to units, so the ideals (/?“')> i = 1, - . . , s are uniquely defined. For i ^ j we 
have (/?“') + (pj 1 ) = R since the sum of these two ideals is generated by a greatest 
common divisor, which is 1 for distinct primes p it pj. Put another way, the ideals 
(p“')> i = 1, .... j, are comaximal in pairs. The intersection of all these ideals is the 
ideal (a) since a is the least common multiple of p" 1 , pff , . . . , p“ j . Then the Chinese 
Remainder Theorem (Theorem 7.17) shows that 

R/(a) = R/ipT > © R/(P ?) © - ■ ■ © R/iP?) 

as rings and also as /^-modules. 

Applying this to the modules in Theorem 5 allows us to write each of the direct 
summands R /(ai) for the invariant factor a, of M as a direct sum of cyclic modules 
whose annihil ators are the prime power divisors of a, . T his proves: 


Theorem 6. (Fundamental Theorem, Existence: Elementary Divisor Form ) Let R be a 
P.I.D. and let M be a finitely generated -module. Then M is the direct sum of a finite 
number of cyclic modules whose a nnihi lators are either (0) or generated by powers of 
primes in R, i.e., 

M = R r ® R/(p?) © R/(p?) © • • • © R/(p?) 

where r > 0 is an integer and p°\ , /?“' are positive powers of (not necessarily 

distinct) primes in /?. 


We proved Theorem 6 by using the prime power factors of the invariant factors for 
M. In fact we shall see that the decomposition of M into a direct sum of cyclic modules 
whose annihil ators are (0) or prime powers as in Theorem 6 is unique, i.e., the integer 
r and the ideals (p“‘), .... (p“') are uniquely defined for M. These prime powers are 
given a name: 
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Definition. Let R be a PJ JD. and let M be a finitely generated -module as in Theo- 
rem 6. The prime powers p“' , . . . , p“' (defined up to multiplication by units in R) are 
called the elementary divisors of M. 

Suppose M is a finitely generated torsion module over the Principal Ideal Domain 
R. If for the distinct primes pi , p 2 , . . . , p n occurring in the decomposition in Theorem 6 
we group together all the cyclic factors corresponding to the same prime p, we see in 
particular that M can be written as a direct sum 

M = A^i ® N 2 ® ■ ■ ■ © N„ 

where N, consists of all the elements of M which are annihilated by some power of 
the prime p,. This result holds also for modules over R which may not be finitely 
generated: 

Theorem 7. (The Primary Decomposition Theorem ) Let R be a P.I.D. and let M be a 
nonzero torsion -module (not necessarily finitely generated) with nonzero annihilator 
a. Suppose the factorization of a into distinct prime powers in R is 

a = up\'p% ■ • ■ p? 

and let Nj = [x € M \ p°“x — 0), 1 < i < n. Then Nj is a submodule of M with 
annihil ator p“' and is the submodule of M of all elements annihilated by some power 
of pi . We have 

M = N\ 0 N 2 ffi ■ ■ ■ © 

If M is finitely generated then each Nj is the direct sum of finitely many cyclic modules 
whose annihilators are divisors of pf . 

Proof: We have already proved these results in the case where M is finitely gener- 
ated over R. In the general case it is clear that N t is a submodule of M with annihilator 
dividing . Since R is a PJ.D. the ideals (p* 1 ) and (j)j J ) are comaximal for i ^ j, so 
the direct sum decomposition of M can be proved easily by modifying the argument in 
the proof of the Chinese Remainder Theorem to apply it to modules. Using this direct 
sum decomposition it is easy to see that the annihilator of N f is precisely p“' . 

Definition. The submodule Nj in the previous theorem is called the p, -primary com- 
ponent of M. 

Notice that with this terminology the elementary divisors of a finitely generated 
module M are just the invariant factors of the primary components of Tor (M). 

We now prove the uniqueness statements regarding the decompositions in the Fun- 
damental Theorem. 

Note that if M is any module over a commutative ring R and a is an element of R 
then aM = [am | m e M } is a submodule of M . Recall also that in a Principal Ideal 
Domain R the nonzero prime ideals are maximal, hence the quotient of R by a nonzero 
prime ideal is a field. 
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Lemma 8. Let R be a P.I.D. and let p be a prime in R. Let F denote the field R j (p). 

(1) Let M = R r . Then M / pM = F r . 

(2) Let M = R/(a) where a is a nonzero element of R. Then 



if p divides a in R 

if p does not divide a in R. 


(3) Let M = R /(a\) © R/ (af) © •©/?/ (a*) where each a, is divisible by p. 

Then M/pM = F k . 


Proof: (1) There is a natural map from R r to (R/(p)) r defined by mapping 
(ai, . . . , a r ) to (ai mod ( p ), . . . , a T mod (p)). This is clearly a surjective R -module 
homomorphism with kernel consisting of the r -tuples all of whose coordinates are 
divisible by p, i.e., pR r , so R r / pR r = ( R/(p )) r , which is (1). 

(2) This follows from the Isomorphism Theorems: note first that p(R/(a)) is the 
image of the ideal (p) in the quotient R/(a), hence is (p) + (a) j (a). The ideal (p) + (a) 
is generated by a greatest common divisor of p and a, hence is (p) if p divides a and is 
R = (1) otherwise. Hence pM = (p) /{a) if p divides a and is R/(a) = M otherwise. 
If p divides a then M J pM — (R/ (a)) / ((p) / (a)) = R/ (p), and if p does not divide 
a then M j pM = M/M — 0, which proves (2). 

(3) This follows from (2) as in the proof of part (1) of Theorem 5. 


Theorem 9. ( Fundamental Theorem, Uniqueness) Let R be a P.I.D. 

(1) Two finitely generated /^-modules M i and M 2 are isomorphic if and only if they 
have the same free r ank and the same list of invariant factors. 

(2) Two finitely generated R -modules M\ and M 2 are isomorphic if and only if they 
have the same free rank and the same list of elementary divisors. 

Proof: If Mi and M 2 have the same free rank and list of invariant factors or the 
same free rank and list of elementary divisors then they are clearly isomorphic. 

Suppose that M\ and M 2 are isomorphic. Any isomorphism between M\ and M 2 
maps the torsion in M\ to the torsion in Mj so we must have Tor(Mi) = Tor(M 2 ). Then 
R r ' = M\ /Tor (Mi) = Af 2 /Tor(M 2 ) = R r2 where ri is the free rank of Mi and r 2 is 
the free rank of M 2 . Let p be any nonzero prime in R. Then from R n = R r2 we obtain 
R n j pR n = R n / pR T1 . By (1) of the previous lemma, this implies F n ^ F n where F 
is the field R/pR. Hence we have an isomorphism of an ri -dimensional vector space 
over F with an r 2 -dimensional vector space over F, so that r\ = r 2 and M\ and M 2 
have the same free r ank. 

We are reduced to showing that M\ and Af 2 have the same lists of invariant factors 
and elementary divisors. To do this we need only work with the isomorphic torsion 
modules Tor(Afr) and Tor(Af 2 ), i.e., we may as well assume that both M\ and M 2 are 
torsion R -modules. 

We first show they have the same elementary divisors. It suffices to show that for 
any fixed prime p the elementary divisors which are a power of p are the same for 
both M{ and M 2 . If Mi = M 2 then the p-primary submodule of My ( = the direct 
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sum of the cyclic factors whose elementary divisors are powers of p) is isomorphic to 
the p -primary submodule of M 2 , since these are the submodules of elements which are 
annihilated by some power of p. We are therefore reduced to the case of proving that 
if two modules M\ and M 2 which have annihilator a power of p are isomorphic then 
they have the same elementary divisors. 

We proceed by induction on the power of p in the an nihilat or of M\ (which is the 
same as the annihilator of M 2 since M\ and M 2 are isomorphic). If this power is 0, 
then both Mi and M 2 are 0 and we are done. Otherwise Mi (and M 2 ) have nontrivial 
elementary divisors. Suppose the elementary divisors of Mi are given by 

elementary divisors of Mi: p, p, . . . , p , p°' , p° 2 , , p a ‘ , 

' . ' 

m times 

where 2 < ct\ < ct 2 < ■ ■ ■ < a s , i.e M Mi is the direct sum of cyclic modules with gen- 
erators x it x 2 , - - - , x m , x m+ i , .... x m+s , say, whose annihilators are (p), (p), . . . , (p), 
(p“’) t . . . , (p a, )> respectively. Then the submodule pMi has elementary divisors 

elementary divisors of p Mi: p" 1-1 , p“ 2 ~' p^ -1 

since pMi is the direct sum of the cyclic modules with generators pjq , px 2 , . . . , px m , 
px m + 1 , . . . , px m+s whose annihilators are (1), (1), .... (1), (p a,_1 ), .... (p“' _1 ), re- 
spectively. Similarly, if the elementary divisors of M 2 are given by 

elementary divisors of M 2 : p,p,...,p, p ^ . p^ 2 , . . . , p ^‘ , 

n times 

where 2 < fi\ < fh. <•■'< fit f then pM 2 has elementary divisors 

elementary divisors of pM 2 : p^ 1-1 , p ft_1 , . . . , p^~ l . 

Since Mi = M 2 , also pMi = pM 2 and the power of p in the annihilator of pMi is 
one less than the power of p in the annihilator of Mi. By induction, the elementary 
divisors for pMy are the same as the elementary divisors for pM 2 , i.e., s = t and 
oti - 1 = fit — 1 for i = 1,2 hence = fit for / = 1, 2, . . . , Finally, since 
also Mi I pMi = M 2 f pM 2 we see from (3) of the lemma above that F m+S = F n+I , 
which shows that m + s = n + t hence m = n since we have already seen s = l. This 
proves that the set of elementary divisors for Mi is the same as the set of elementary 
divisors for M 2 . 

We now show that Mi and M 2 must have the same invariant factors. Suppose 
a\ | a 2 | ■ ■ ■ | a m are invariant factors for Mi . We obtain a set of elementary divisors for 
Mi by talcing the prime power factors of these elements. Note that then the divisibility 
relations on the invariant factors imply that a m is the product of the largest of the prime 
powers among these elementary divisors, a m ~\ is the product of the largest prime powers 
among these elementaiy divisors once the factors for a m have been removed, and so 
on. If b\ J b 2 1 ■ • ■ | b„ are invariant factors for M 2 then we similarly obtain a set of 
elementary divisors for M 2 by taking the prime power factors of these elements. But we 
showed above that the elementary divisors for Mi and M 2 are the same, and it follows 
that the same is true of the invariant factors. 
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Corollary 10. Let R be a P.I.D. and let M be a finitely generated /^-module. 

(1) The elementary divisors of M are the prime power factors of the invariant factors 
of M. 

(2) The largest invariant factor of M is the product of the largest of the distinct prime 
powers among the elementary divisors of M, the next largest invariant factor 
is the product of the largest of the distinct prime powers among the re mainin g 
elementary divisors of M, and so on. 

Proof: The procedure in (1) gives a set of elementary divisors and since the ele- 
mentary divisors for M are unique by the theorem, it follows that the procedure in (1) 
gives the set of elementary divisors. Similarly for (2). 

Corollary 11. (The Fundamental Theorem of Finitely Generated Abelian Groups ) See 
Theorem 5.3 and Theorem 5.5. 

Proof: Take R = Z in Theorems 5, 6 and 9 (note however that the invariant factors 
are listed in reverse order in Chapter 5 for computational convenience). 

The procedure for passing between elementary divisors and invariant factors in 
Corollary 10 is described in some detail in Chapter 5 in the case of finitely generated 
abelian groups. 

Note also that if a finitely generated module M is written as a direct sum of cyclic 
modules of the form R J (a) then the ideals (a) which occur are not in general unique 
unless some additional conditions are imposed (such as the divisibility condition for 
the invariant factors or the condition that a be the power of a prime in the case of the 
elementary divisors). To decide whether two modules are isomorphic it is necessary to 
first write them in such a standard (or canonical ) form. 

EXERCISES 

1. Let Af be a module over the integral domain R. 

(a) Suppose x is a nonzero torsion element in M. Show that x and 0 are “linearly 
dependent” Conclude that the rank of Tor(A/) is 0, so that in particular any torsion 
R -module has rank 0. 

(b) Show that the rank of M is the same as the rank of the (torsion free) quotient Af/TorAL 

2. Let M be a module over the integral domain R. 

(a) Suppose that M has rank n and that x\,x 2 , ... ,x n is any ma xim al set of linearly 

independent elements of Af . Let N~Rxi + ... + Rx„bo the submodule generated 
by xi , X 2 , . . . , x „ . Prove that N is isomorphic to R n and that the quotient M/N is a 
torsion /?-module (equivalently, the elements xj , . . . , are linearly independent and 

for any y e Af there is a nonzero element r e R such that ry can be written as a linear 
combination rixi + . . . + r„x„ of the x,). 

(b) Prove conversely that if M contains a submodule N that is free of rank n (i.e., N = 
R n ) such that the quotient M/N is a torsion ^-module then M has rank n. [Let 

y\ t y 2 Vn+l be any n + 1 elements of M. Use the fact that M/N is torsion 

to write nyi as a linear combination of a basis for N for some nonzero elements 
r\ , . . . , r„ + i of R. Use an argument as in the proof of Proposition 3 to see that the 
nyi, and hence also the are linearly dependent] 
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3. 

4. 


5. 


6 . 


7. 


8 . 

9. 

10 . 


11 . 


12 . 


13. 

14. 

15 . 


Let R be an integral domain and let A and B be R -modules of ranks m and n, respectively. 
Prove that the rank ofA©flism+n. [Use the previous exercise.] 

Let R be an integral domain, let M be an /^-module and let N be a submodule of M. 
Suppose M has rank n, N has rank r and the quotient M/N has rank s. Prove that 
n = r + s. [Let x \ , X2 , . . . , x s be elements of M whose images in M/N are a maximal 
set of independent elements and let x x +\ , x J+ 2 , . . . , Jt s + r be a ma ximal set of independent 

elements in N. Prove that x\,X2 x s+r are linearly independent in M and that for any 

element y e M there is a nonzero element r e R such that ry is a linear combination of 
these elements. Then use Exercise 2.] 

Let R = Z[x\ and let M = (2, x) be the ideal generated by 2 and x, considered as 
a submodule of R. Show that {2, x) is not a basis of M. [Find a nontrivial fl-linear 
dependence between these two elements.] Show that the rank of M is 1 but that M is not 
free of rank 1 (cf. Exercise 2). 

Show that if R is an integral domain and M is any nonprincipal ideal of R then M is torsion 
free of rank 1 but is not a free /^-module. 

Let R be any ring, let A \ , A 2 , . . . , A m be /?- modules and let Bi be a submodule of A,-, 
1 < t < m. Prove that 

(Aj ©A 2 ©-® A,„)/(flj © £2©- ■■©£„,) = (Ai/fii) © (A2/ B2) 0 - - - © (A m /B m ). 


Let R be a P.ID., let B be a torsion /^-module and let p be a prime in R. Prove that if 
pb = 0 for some nonzero b e B, then Ann(fi) c ( p ). 


Give an example of an integral domain R and a nonzero torsion /^-module M such that 
Ann(M) = 0. Prove that if N is a finitely generated torsion R -module then Ann (N) ^ 0. 


For p a prime in the P.ID. R and N an P-module prove that the p-primary component of 
N is a submodule of N and prove that N is the direct sum of its p-primary components 
(there need not be finitely many of them). 


Let R be a P.LD,, let a be a nonzero element of R and let M = Rf (a). For any prime p 
of R prove that 



if k < n 
if k > n. 


where n is the power of p dividing a in R. 

Let R be a P.I.D. and let p be a prime in R. 

(a) Let M be a finitely generated torsion /?-module. Use the previous exercise to prove that 
p* -1 M / p k M S F nk where F is the field R / (p) and tik is the number of elementary 
divisors of M which are powers pF with a 

(b) Suppose Mi and M 2 are isomorphic finitely generated torsion R -modules. Use (a) to 
prove that, for every k > 0, Mi and M 2 have the same number of elementary divisors 
p a with a > £. Prove that this implies Mi and M 2 have the same set of elementary 
divisors. 

If M is a finitely generated module over the RLD. R, describe the structure of M/Tor(M). 

Let R be a PID. and let M be a torsion R -module. Prove that M is irreducible (cf. 

Exercises 9 to 11 of Section 10.3) if and only if M = Rm for any nonzero element m e M 

where the annihilator of m is a nonzero prime ideal (p). 

Prove that if R is a Noetherian ring then R n is a Noetherian fl-module. [Fix a basis of R n . 

If M is a submodule of R" show that the collection of first coordinates of elements of M 

is a submodule of R hence is finitely generated. Let m\ , m2, . . . , m* be elements of M 
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whose first coordinates generate this submodule of R. Show that any element of M can be 
written as an R -linear combination of mi , m 2 , .... plus an element of M whose first 
coordinate is 0. Prove that M Pi R n ~ l is a submodule of R n l where R n l is the set of 
elements of R n with first coordinate 0 and then use induction on n. 

The following set of exercises out lin es a proof of Theorem 5 in the special case where R is 
a Euclidean Domain using a matrix argument involving row and column operations. This 
applies in particular to the cases R = Z and R = F[jtJ of interest in the applications and is 
computationally useful. 

Let R be a Euclidean Domain and let M be an F-module. 

16. Prove that M is finitely generated if and only if there is a surjective F-homomorphism 
<p ; R n — > M for some integer n (this is true for any ring R). 

Suppose <p : R n — > M is a surjective F-module homomorphism. By Exercise 15, ker <p is 

finitely generated. If x \ , X 2 x„ is a basis for R n and yi,...,y m arc generators for ker<p 

we have 

yi = anx\ + a { 2 x 2 H 1- a in x„ i = 1,2 m 

with coefficients a.jj e R. It follows that the homomorphism q> (hence the module structure of 
M) is determined by the choice of generators for R” and the matrix A = (a*/). Such a matrix 
A will be called a relations matrix. 

17. (a) Show that interchanging x- t and xj in the basis for R n interchanges the 1 th column 

with the 7 th column in the corresponding relations matrix. 

(b) Show that, for any a e R, replacing the element xj by xj — axj in the basis for R n 
gives another basis for R n and that the corresponding relations matrix for this basis 
is the same as the original relations matrix except that a times the 7 th column has 

been added to the 1 th column. [Note that h a, Xj H + ajxj H — ■ ■ ■ + (a-, 4- 

aaj)x\ Vaj (xj - ax /) + . . . .] 

18. (a) Show that interchanging the generators y\ and yj interchanges the 7 th row with the _/* 

row in the relations matrix. 

(b) Show that, for any a e R } replacing the element yj by yj — ay ; gives another set 
of generators for ker <p and that the corresponding relations matrix for this choice of 
generators is the same as the original relations matrix except that —a times the I th row 
has been added to the 7 th row. 

19. By the previous two exercises we may perform elementary row and column operations on 
a given relations matrix by choosing different generators for R n and ker <p. If all relation 
matrices are the zero matrix then ker <p = 0 and M = R n . Otherwise let ai be the (nonzero) 
g.c.d. (recall R is a Euclidean Domain) of all the entries in a fixed initial relations matrix 
for M. 

(a) Prove that by elementary row and column operations we may assume a\ occurs in a 
relations matrix of the form 

f ai /zj 2 ... tfbj ) 

<221 022 ■ ■ ■ a.2n 

\ a m 1 a m2 ■ ■ ~ a mn ) 
where a\ divides Ojj, i = 1,2 m, j = 1,2 n . 
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(b) Prove that there is a relations matrix of the form 



( a \ 

0 ... 

0 ^ 


0 

022 

Q 2 n 


^0 

a m 2 

@nm / 

where a\ divides all the entries. 




(c) Let 0.2 be a g.c.d. of all the entries except the element a\ in the relations matrix in (b). 
Prove that there is a relations matrix of the form 


( a \ 

0 

0 

... 0 v 

0 

02 

0 

... 0 

0 

0 

033 

■ • • 03 n 

<0 

0 

0>«3 

0win ' 


where a\ divides 02 and <22 divides all the other entries of the matrix. 


(d) Prove that there is a relations matrix of the form 

matrix with nonzero entries a \ , <22, - ■ - . 0* , k < n, satisfying 


(s s) 


where D is a diagonal 


a\ | 02 | ■ ■ ■ | a k . 


Conclude that 

M ^ R/(ai) @R/(a 2 ) © . . © R/(,a k ) © R n ~ k . 


If n is not the minimal number of generators required for M then some of the initial 
elements a \ , 02 , . . . above will be units, so the corresponding direct summands above will be 
0 . If we remove these irrelevant factors we have produced the invariant factors of the module 
M. Further, the image of the new generators for R n corresponding to the direct summands 
above will then be a set of ^-generators for the cyclic submodules of M in its invariant factor 
decomposition (note that the image in M of the generators corresponding to factors with a,- a 
unit will be 0 ). The column operations performed in the relations matrix reduction correspond 
to changing the basis used for R n as described in Exercise 17 : 

(a) Interchanging the i lh column with the 7 th column corresponds to interchanging the 1 th and 
y 0 * elements in the basis for R n . 

(b) For any a e R, adding a times the 7 th column to the I th column corresponds to subtracting 
a times the f * basis element from the j* basis element. 

Keeping track of the column operations performed and changing the initial choice of generators 
for M in the same way therefore gives a set of F -generators for the cyclic submodules of M in 
its invariant factor decomposition. 

This process is quite fast computationally once an initial set of generators for M and initial 
relations matrix are determined. The element a\ is determined using the Euclidean Algorithm 
as the g.c.d. of the elements in the initial relations matrix. Using the row and column operations 
we can obtain the appropriate linear combination of the entries to produce this g.c.d. in the 
( 1 , Imposition of a new relations matrix. One then subtracts the appropriate multiple of the first 
column and first row to obtain a matrix as in Exercise 19 (b), then iterates this process. Some 
examples of this procedure in a special case are given at the end of the following section. 

20 . Let R be an integral domain with quotient field F and let M be any F -module. Prove that 
the rank of M equals the dimension of the vector space F M over F. 
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21. Prove that a finitely generated module over a P.I.D. is projective if and only if it is free. 

22. Let R be a RID. that is not a field. Prove that no finitely generated i?- module is injective. 
[Use Exercise 4, Section 10.5 to consider torsion and free modules separately.] 


12.2 THE RATIONAL CANONICAL FORM 

We now apply our results on finitely generated modules in the special case where the 
P.I.D. is the ring F[ x] of polynomials in x with coefficients in a field F. 

Let V be a finite dimensional vector space over F of dimension n and let T be 
a fixed linear transformation of V (i.e., from V to itself). As we saw in Chapter 10 
we can consider V as an F[x] -module where the element x acts on V as the linear 
transformation T (and so any polynomial in x acts on V as the same polynomial in 
T). Since V has finite dimension over F by assumption, it is by definition finitely 
generated as an /’-module, hence certainly finitely generated as an F[x\ -module, so 
the classification theorems of the preceding section apply. 

Any nonzero free F[x]-moduIe (being isomorphic to a direct sum of copies of 
F[x]) is an infinite dimensional vector space over F, so if V has finite dimension over 
F then it must in fact be a torsion F[x] -module (i.e., its free rank is 0). It follows from 
the Fundamental Theorem that then V is isomorphic as an F[jc] - module to the direct 
sum of cyclic, torsion F[x]-modules. We shall see that this decomposition of V will 
allow us to choose a basis for V with respect to which the matrix representation for 
the linear transformation T is in a specific simple form. When we use the invariant 
factor decomposition of V we obtain the rational canonical form for the matrix for T, 
which we analyze in this section. When we use the elementary divisor decomposition 
(and when F contains all the eigenvalues of T) we obtain the Jordan canonical form, 
considered in the following section and mentioned earlier as the matrix representing T 
which is as close to being a diagonal matrix as possible. The uniqueness portion of the 
Fundamental Theorem ensures that the rational and Jordan canonical forms are unique 
(which is why they are referred to as canonical ). 

One important use of these canonical forms is to classify the distinct linear trans- 
formations of V. In particular they allow us to determine when two matrices represent 
the same linear transformation, i.e., when two given n x n matrices are similar. 

Note that this will be another instance where the structure of the space being acted 
upon (the invariant factor decomposition of V for example) is used to obtain significant 
information on the algebraic objects (in this case the linear transformations) which 
are acting. This will be considered in the case of groups acting on vector spaces in 
Chapter 18 (and goes under the name of Representation Theory of Groups). 

Before describing the rational canonical form in detail we first introduce some 
linear algebra. 

Definition. 

(1) An element X of F is called an eigenvalue of the linear transformation T if there 
is a nonzero vector v € V such that T(v) — Xv. In this situation u is called an 
eigenvector of T with corresponding eigenvalue A. 
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(2) If A is an n x n matrix with coefficients in F, an element X is called an eigenvalue 
of A with corresponding eigenvector v if v is a nonzero n x 1 column vector 
such that Av = Xv. 

(3) If X is an eigenvalue of the linear transformation 7\ the set {n e V \ T(v) = Xu} 
is called the eigenspace of T corresponding to the eigenvalue X. Similarly, if X 
is an eigenvalue of the n x n matrix A, the set of n x 1 matrices u with Av = Xu 
is called the eigenspace of A corresponding to the eigenvalue X. 

Note that if we fix a basis B of V then any linear transformation T of V has an 
associated n x n matrix A. Conversely, if A is any n x n matrix then the map T defined 
by T (u) = Av for v e V, where the u on the right is the n x l vector consisting of 
the coordinates of u with respect to the fixed basis B of V , is a linear transformation 
of V . Then v is an eigenvector of T with corresponding eigenvalue X if and only if 
the coordinate vector of u with respect to B is an eigenvector of A with eigenvalue 
X. In other words, the eigenvalues for the linear transformation T are the same as the 
eigenvalues for the matrix A of T with respect to any fixed basis for V. 

Definition. The determinant of a linear transformation from V to V is the determinant 
of any matrix representing the linear transformation (note that this does not depend on 
the choice of the basis used). 

Proposition 12. The following are equivalent: 

(1) X is an eigenvalue of T 

(2) X7 — T is a singular linear transformation of V 

(3) det(X7 - T) = 0. 

Proof: Since X is an eigenvalue of T with corresponding eigenvector v if and only 
if v is a nonzero vector in the kernel of X7 — T, it follows that (1) and (2) are equivalent 

(2) and (3) are equivalent by our results on determinants. 

Definition. Let x be an indeterminate over F . The polynomial det(x7 — T ) is called 
the characteristic polynomial of T and will be denoted cr(x). Tf A is an n x n matrix 
with coefficients in F, det(x7 — A) is called the characteristic polynomial of A and 
will be denoted ca(x). 

It is easy to see by expanding the determinant that the characteristic polynomial 
of either T or A is a monic polynomial of degree n = dim V. Proposition 12 says 
that the set of eigenvalues of T (or A) is precisely the set of roots of the characteristic 
polynomial of T (of A, respectively). In particular, T has at most n distinct eigenvalues. 

We have seen that V considered as a module over F[jc] via the linear transformation 
T is a torsion F[jc]-module. Letm(x) e F[x] be the unique monic polynomial generat- 
ing the annihilator of V in F[x\ Equivalently, m{x) is the unique monic polynomial of 
minimal degree annihila ting V (i.e., such that m(T) is the 0 linear transformation), and 
if f{x ) G F[jc] is any polynomial annihilating V , m(x) divides / (x). Since the ring of 
all n x n matrices over F is isomorphic to the collection of all linear transformations of 
V to itself (an isomorphism is obtained by choosing a basis for V), it follows that for 
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any n x n matrix A over F there is similarly a unique monic polynomial of minimal 
degree with m(A) the zero matrix. 

Definition. The unique monic polynomial which generates the ideal Ann( V) in F[x] 
is called the minimal polynomial of T and will be denoted mr(x). The unique monic 
polynomial of smallest degree which when evaluated at the matrix A is the zero matrix 
is called the minimal polynomial of A and will be denoted m A (x). 

It is easy to see (cf. Exercise 5) that the degrees of these minim al polynomials are 
at most n 2 where n is the dimension of V. We shall shortly prove that the minimal 
polynomial for T is a divisor of the characteristic polynomial for T (this is the Cayley - 
Hamilton Theorem), and similarly for A, so in fact the degrees of these polynomials are 
at most n. 

We now describe the rational canonical form of the linear transformation T (re- 
spectively, of the n x n matrix A). By Theorem 5 we have an isomorphism 

V £ F[x]/(a } (x)) © F[x]/(a 2 (x)) © ■ • • © F[x]/(a m (x)) (12.1) 

of F[x]-modules where a\ (x), a 2 (x), . . . ,a m (x) are polynomials in F[x] of degree at 
least one with the divisibility conditions 

ai(x) | a 2 (x) | | a m (x). 

These invariant factors a, (x) are only determined up to a unit in F[x] but since the units 
of F [x ] are precisely the nonzero elements of F (he. , the nonzero constant polynomials), 
we may make these polynomials unique by stipulating that they be monic. 

Since the annihilalor of V is the ideal (a m (x)) (part (3) of Theorem 5), we imme- 
diately obtain: 

Proposition 13. The minim al polynomial mr(x) is the largest invariant factor of V. 
All the invariant factors of V divide mj(x). 

We shall see below how to calculate not only the minimal polynomial for T but 
also the other invariant factors. 

We now choose a basis for each of the direct summands for V in the decomposition 
(1) above for which the matrix for T is quite simple. Recall that the linear transformation 
T acting on the left side of (1) is the element x acting by multiplication on each of the 
factors on the right side of the isomorphism in (1). 

We have seen in the example following Proposition 1 of Chapter 1 1 that the elements 
1, x, x 2 , , x^ 1 give a basis for the vector space F[x]/ (a(x)) where a(x) — x k + 

bk~ix k ~ l \-bix+bo is any monic polynomial in F[x] andx = x mod (a(x)). With 

respect to this basis the linear transformation of multiplication by x acts in a simple 
mann er 

1 h- > x 
x Hr x 2 

-2 -3 

X L Hr X 

x : 

X k ~ 2 Hr X*- 1 

x k ~ l Hr X k = — ho — b\X b k ^ IX* -1 
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where the last equality is because x k + b k -\x k 1 H \-bix + bo = 0 since a(x) = 0 in 

F[x) / (fl(jc)). With respect to this basis, the matrix for multiplication by x is therefore 

/0 0 -bo \ 

1 0 -b v 

0 1 — 

0 0 ■■■ : 

\0 0 1 -b k -J 

Such matrices are given a name: 

Definition. Let a(x) = x* + b k ~ ix* -1 + - - - + b\x + bo be any monic polynomial 
in F[x], The companion matrix of a(x) is the k x k matrix with 1 ’s down the first 
subdiagonal, — b Qi —b \ , . . . , —b k ~\ down the last column and zeros elsewhere. The 
companion matrix of a(x) will be denoted by C a{x) . 


We apply this to each of the cyclic modules on the right side of (1) above and let 
B, be the elements of V corresponding to the basis chosen above for the cyclic factor 
F[x] / (a,- (x)) under the isomorphism in (1). Then by definition the linear transformation 
T acts on 13, by the companion matrix for a, (x) since we have seen that this is how 
multiplication by x acts. The union B of the B-, ’s gives a basis for V since the sum on 
the right of (1) is direct and with respect to this basis the linear transformation T has as 
matrix the direct sum of the companion matrices for the invariant factors, i.e., 


/^iW 

^22 (-0 


\ 


V 



( 12 . 2 ) 


Notice that this matrix is uniquely determined from the invariant factors of the F[x\- 
module V and, by Theorem 9, the list of invariant factors uniquely determines the 
module V up to isomorphism as an F[x]-module. 


Definition. 

(1) A matrix is said to be in rational canonical form if it is the direct sum of 
companion matrices for monic polynomials a\ (x ), . . . ,a m (x) of degree at least 
one with a\(x) \ a^(x) \ ■ ■■ \ ^(x). The polynomials Oj(x) are called the 
invariant factors of the matrix. Such a matrix is also said to be a block diagonal 
matrix with blocks the companion matrices for the Oj (x). 

(2) A rational canonical form for a linear transformation T is a matrix representing 
T which is in rational canonical form. 


We have seen that any linear transformation T has a rational canonical form. We 
now see that this rational canonical form is unique (hence is called the rational canonical 
form for T). To see this note that the process we used to determine the matrix of T 
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from the direct sum decomposition is reversible. Suppose bi(x), b 2 (x), . . . ,b,(x) are 
monic polynomials in F[x] of degree at least one such that bj (x) | b i+l (x) for all i and 
suppose for some basis £ of V, that the matrix of T with respect to the basis £ is the 
direct sum of the companion matrices of the bfx). Then V must be a direct sum of 
T -stable subspaces D v , one for each bi(x) in such a way that the matrix of T on each D; 
is the companion matrix of b, (x). Let £, be the corresponding (ordered) basis of D{ (so 
£ is the union of the £ t ) and let e, be the first basis element in £\ . Then it is easy to see 
that D, is a cyclic F[x]-module with generator a and that the annihilator of £>,• is fc, (x). 
Thus the torsion F[x]-module V decomposes into a direct sum of cyclic F[x] -modules 
in two ways, both of which satisfy the conditions of Theorem 5, i.e., both of which give 
lists of invariant factors. Since the invariant factors are unique by Theorem 9, at (x) 
and b t (x) must differ by a unit factor in F[x] and since the polynomials are monic by 
assumption, we must have a,- (x) = b, (x) for all i . This proves the following result: 

Theorem 14. (Rational Canonical Form for Linear Transformations) Let V be a finite 
dimensional vector space over the field F and let T be a linear transformation of V. 

(1) There is a basis for V with respect to which the matrix for T is in rational 
canonical form, i.e., is a block diagonal matrix whose diagonal blocks are the 
companion matrices for monic polynomials a x (x), a 2 (x), . . . ,a m (x) of degree 
at least one with fli(x) | a 2 (x) | • - - | a m (x). 

(2) The rational canonical form for T is unique. 

The use of the word rational is to indicate that this canonical form is calculated 
entirely within the field F and exists for any linear transformation T. This is not the 
case for the Jordan canonical form (considered later), which only exists if the field F 
contains the eigenvalues for T (cf. also the remarks following Corollary 18). 

The following result translates the notion of similar linear transformations (i.e., the 
same linear transformation up to a change of basis) into the language of modules and 
relates this notion to rational canonical forms. 

Theorem 15. Let S and T be linear transformations of V. Then the following are 
equivalent: 

(1) S and T are similar linear transformations 

(2) the F[x] -modules obtained ffom V via S and via T are isomorphic F[x]- 
modules 

(3) S and T have the same rational canonical form. 

Proof: [(1) implies (2)] Assume there is a nonsingular linear transformation U such 
that S = UTU~ V . The vector space isomorphism U : V ->• V is also an F[x] -module 
homomorphism, where x acts on the first V via T and on the second via S, since for ex- 
ample V (xv) = U(Tv ) = UT(y) =■ SU ( v ) ~ x(U v). Hence this is an F[x]-module 
isomorphism of the two modules in (2). 

[(2) implies (3)] Assume (2) holds and denote by Vi the vector space V made into 
an F[x] -module via S and denote by V 2 the space V made into an F[x] -module via T. 
Since V^ = V 2 as F[x]-modules they have the same list of invariant factors. Thus S 
and T have a common rational canonical form. 
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[(3) implies (1)] Assume (3) holds. Since S and T have the same matrix represen- 
tation with respect to some choice of (possibly different) bases of V by assumption, 
they are, up to a change of basis, the same linear transformation of V, hence are similar. 

Let A be any n x n matrix with entries from F. Let V be an n -dimensional vector 
space over F. Recall we can then define a linear transformation T on V by choosing 
a basis for V and setting T(v) — Av where i; on the right hand side means the n x 1 
column vector of coordinates of v with respect to our chosen basis (this is just the usual 
identification of linear transformations with matrices). Then (of course) the matrix for 
this T with respect to this basis is the given matrix A. Put another way, any n x n matrix 
A with entries from the field F arises as the matrix for some linear transformation T of 
an n -dimensional vector space. 

This dictionary between linear transformations of vector spaces and matrices allows 
us to state our previous two results in the language of matrices: 

Theorem 16. (Rational Canonical Form for Matrices) Let A be an n x n matrix over 
the field F . 

(1) The matrix A is similar to a matrix in rational canonical form, i.e., there is an 

invertible n x n matrix P over F such that P _1 AP is a block diagonal ma- 
trix whose diagonal blocks are the companion matrices for monic polynomials 
a\(x), fl 2 (x), .... a m (x) of degree at least one with a\(x) | a .2 (x) | [ a m (x). 

(2) The rational canonical form for A is unique. 

Definition. The invariant factors of an n x n matrix over a field F are the invariant 
factors of its rational canonical form. 

Theorem 17. Let A and Bbenxn matrices over the field F. Then A and B are s imilar 
if and only if A and B have the same rational canonical form. 

If A is a matrix with entries from a field F and F is a subfield of a larger field K 
then we may also consider A as a matrix over K. The next result shows that the rational 
canonical form for A and questions of similarity do not depend on which field contains 
the entries of A. 

Corollary 18. Let A and B be two n x n matrices over a field F and suppose F is a 
subfield of the field K. 

(1) The rational canonical form of A is the same whether it is computed over K or 
over F. The minimal and characteristic polynomials and the invariant factors 
of A are the sa me whether A is considered as a matrix over f or as a matrix 
over AT. 

(2) The matrices A and B are similar over K if and only if they are similar over 
F, i.e., there exists an invertible n x n matrix P with entries from K such that 
B = P 1 AP if and only if there exists an (in general different) invertible nxn 
matrix Q with entries from F such that B = Q~ x A Q. 

Proof: (1) Let M be the rational canonical form of A when computed over the 
smaller field F. Since M satisfies the conditions in the definition of the rational canon- 
ical form over K, the uniqueness of the rational canonical form implies that M is also 
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the rational canonical form of A over K. Hence the invariant factors of A are the same 
whether A is viewed over F or over K. In particular, since the minimal polynomial 
is the largest invariant factor of A it also does not depend on the field over which A is 
viewed. It is clear from the deter mina nt definition of the characteristic polynomial of 
A that this polynomial depends only on the entries of A (we shall see shortly that the 
characteristic polynomial is the product of all the invariant factors for A, which will 
give an alternate proof of this result). 

(2) If A and B are similar over the smaller field F they are clearly s imilar over K. 
Conversely, if A and B are similar over K , they have the same rational canonical form 
over K. By (1) they have the same rational canonical form over F, hence are similar 
over F by Theorem 17. 

This corollary asserts in particular that the rational canonical form for an n x n 
matrix A is an n x n matrix with entries in the smallest field containing the entries 
of A. Further, this canonical form is the same matrix even if we allow conjugation of 
A by nonsingular matrices whose entries come from larger fields. This explains the 
terminology of rational canonical form. 

The next proposition gives the connection between the characteristic polynomial 
of a matrix (or of a linear transformation) and its invariant factors and is quite useful 
for determining these invariant factors (particularly for matrices of small size). 

Lemma 19. Let a(x) e F[x] be any monic polynomial. 

(1) The characteristic polynomial of the companion matrix of a(x) is a(x). 

(2) If M is the block diagonal matrix 


/ A 1 

0 

... 0 \ 

0 

A 2 

■ 0 

V 0 

0 

... A k / 


given by the direct sum of matrices A i , A 2 , . . . . A* then the characteristic poly- 
nomial of M is the product of the characteristic polynomials of A\ , A\. . . . , A*. 

Proof: These are both straightforward exercises. 

Proposition 20. Let A be an n x n matrix over the field F. 

(1) The characteristic polynomial of A is the product of all the invariant factors of 
A. 

(2) (The Cayley-Hamilton Theorem ) The minimal polynomial of A divides the 
characteristic polynomial of A. 

(3) The characteristic polynomial of A divides some power of the minimal poly- 
nomial of A. In particular these polynomials have the same roots, not counting 
multiplicities. 

The same statements are true if the matrix A is replaced by a linear transformation T 
of an n -dimensional vector space over F. 
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Proof: Let B be the rational canonical form of A . By the previous lemma the block 
diagonal form of B shows that the characteristic polynomial of B is the product of the 
characteristic polynomials of the companion matrices of the invariant factors of A. By 
the first part of the lemma above, the characteristic polynomial of the companion m^rii 
C a (x) for <2 00 is just <2 (x), which implies that the characteristic polynomial for B is die 
product of the invariant factors of A. Since A and B are similar, they have the same 
characteristic polynomial, which proves (1). Assertion (2) is immediate from (1) since 
the minimal polynomial for A is the largest invariant factor of A. The fact that all the 
invariant factors divide the largest one immediately implies (3). The final assertion is 
clear from the dictionary between linear transformations of vector spaces and matrices. 

Note that part (2) of the proposition is the assertion that the matrix A satisfies its own 
characteristic polynomial, i.e., c^(A) = 0 as matrices, which is the usual formulation 
for the Cayley-Hamilton Theorem. Note also that it implies the degree of the minimal 
polynomial for A has degree at most n, a result mentioned before. 

The relations in Proposition 20 are frequently quite useful in the determination 
of the invariant factors for a matrix A, particularly for matrices of small degree (cf. 
Exercises 3 and 4 and the examples). The following result (which relies on Exercises 
16 to 19 in the previous section and whose proof we outline in the exercises) computes 
the invariant factors in general. 

Let A be an n x n matrix over the field F. Then xj — A is an n x n matrix with 
entries in F[x], The three operations 

(a) interchanging two rows or columns 

(b) adding a multiple (in F(x]) of one row or column to another 

(c) multiplying any row or column by a unit in F[x], i.e., by a nonzero element in F, 
are called elementary row and column operations. 

Theorem 21. Let A be an n x n matrix over the field F. Using the three elementary 
row and column operations above, the n x n matrix xl — A with entries from F[x] can 
be put into the diagonal form (called the Smith Normal Form for A) 

/I \ 


1 

a\ (x) 

a 2 (x) 


\ Omlx)) 

with monic nonzero elements a\ (x), a 2 (x), ... ,a m (x) of F[x] with degrees at least 
one and satisfying zq (x) | a 2 (x) | ■ ■ ■ | a m (x). The elements a \ (x), . . . , fl^(x) are the 
invariant factors of A. 

Proof: cf. the exercises. 
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Invariant Factor Decomposition Algorithm: Converting to Rational 
Canonical Form 

As mentioned in the exercises near the end of the previous section, keeping track of 
the operations necessary to diagonalize xl — A will explicitly give a matrix P such 
that P~ l AP is in rational canonical form. Equivalently, if V is a given F[jt]-module 
with vector space basis [e lf ^2, - - ■ , e„], then P defines the change of basis giving the 
Invariant Factor Decomposition of V into a direct sum of cyclic F[xj-modules. In 
particular, if A is the matrix of the linear transformation T of the F[jt]-module V 
defined by x (i.e., T (ej) = xej = £/=i where A = (a/,)), then the matrix P 
defines the change of basis for V with respect to which the matrix for T is in rational 
canonical form. 

We first describe the algorithm in the general context of determining the Invariant 
Factor Decomposition of a given F [x j-module V with vector space basis [ej, e-i , 

(the proof is outlined in the exercises). We then describe the algorithm to convert a given 
n x n matrix A to rational canonical form (in which reference to an underlying vector 
space and associated linear transformation are suppressed). 

Explicit numerical examples of this algorithm are given in Examples 2 and 3 fol- 
lowing. 

Invariant Factor Decomposition Algorithm 

Let V be an F[x] -module with vector space basis [e \ , £2. - - - , e„] (so in particular these 
elements are generators for V as an F[x]-module). Let T be the linear transformation 
of V to itself defined by x and let A be the n x n matrix associated to T and this choice 
of basis for V, i.e., 

T (ej) = xej — where A = (a t j ) . 

i=] 

( 1 ) Use the following three elementary row and column operations to diagonalize the 
matrix xl — A over F[x] , keeping track of the row operations used: 

(a) interchange two rows or columns (which will be denoted by R; ++ Rj for the 
interchange of the I th and 7 th rows and similarly by C, Cj for columns), 

(b) add a multiple (in F[xJ) of one row or column to another (which will be denoted 
by Ri + p(x)Rj h* Rj if p(x) times the 7 th row is added to the 1 th row, and 
similarly by C, + p(x)Cj C/ for columns), 

(c) multiply any row or column by a unit in F[x], i.e., by a nonzero element in 
F (which will be denoted by «/?,• if the I th row is multiplied by u € F*, and 
similarly by uCi for columns). 

( 2 ) Beginning with the F[x]-module generators [e \ , e %, . . . , e„], for each row operation 
used in (1), change the set of generators by the following rules: 

(a) If the I th row is interchanged with the 7 th row then interchange the I th and 7 th 
generators. 

(b) If p(x) times the 7 th row is added to the I th row then subtract p(x) times the 
I th generator from the 7 th generator (note the indices). 
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(c) If the I th row is multiplied by the unit u e F then divide the I th generator by u. 

(3) When xl — A has been diagonalized to the form in Theorem 21 the genera- 
tors [e\, e 2 , . . . , e n ] for V will be in the form of F[x]-linear combinations of 
ei,e 2 , ... ,e n . Use xej = T(ej) = YH=i a U e ‘ to write these elements as F- 
linear combinations of e\, e 2 , . . . , e n . When xl — A has been diagonalized, the 
first « — m of these linear combinations are 0 (providing a useful numerical check 
on the computations) and the remaining m linear combinations are nonzero, i.e., 
the generators for V are in the form [0, . . . , 0, /i, . . . , f m ] corresponding precisely 
to the diagonal elements in Theorem 21. The elements f\, . . . , f m are a set of 
F[x]-module generators for the cyclic factors in the invariant factor decomposition 
of V (with annihilators (ui(jc)), . . . , (a m (x)) t respectively): 

V = F[x] fi © F[x] / 2 © ... © Fix'] f m , 

fW fi = F[x]/(a,(x)) 1 = 1,2 m, 

giving the Invariant Factor Decomposition of the F[x] -module V. 

(4) The corresponding vector space basis for each cyclic factor of V is then given by 
the elements f it Tf , T 2 f it .... 7 ^ *■(*)-’ /■. 

(5) Write the element of the vector space basis computed in (4) in terms of the 
original vector space basis [ci, C 2 . - > ■ » c«] and use the coordinates for the Ar* column 
of an n x n matrix P. Then P~ l A P is in rational canonical form (with diagonal 
blocks the companion matrices for the fl, (x)). This is the matrix for the linear 
transformation T with respect to the vector space basis in (4). 

We now describe the algorithm to convert a given n xn matrix A to rational canonical 
form, i.e., to determine an nxn matrix P so that P~ l AP is in rational canonical form. 
This is nothing more than the algorithm above applied to the vector space V = F n 
of n x 1 column vectors with standard basis [ei , e-i , . . . , e„\ (where e,- is the column 
vector with 1 in the I th position and 0’s elsewhere) and T is the linear transformation 
defined by A and this choice of basis. Explicit reference to this underlying vector space 
and associated linear transformation are suppressed, so the algorithm is purely matrix 
theoretic. 

Converting annxn Matrix to Rational Canonical Form 

Let A be an n x n matrix with entries in the field F. 

(1) Use the following three elementary row and column operations to diagonalize the 
matrix xl — A over F[x] , keeping track of the row operations used: 

(a) interchange two rows or columns (which will be denoted by Ri ** Rj for the 
interchange of the I th and j 01 rows and similarly by C, -f* Cj for columns), 

(b) add a multiple (in F[x\) of one row or column to another (which will be denoted 
by Ri + p(x)Rj i-> Ri if p(x) times the 7 th row is added to the 7 th row, and 
similarly by C; + p(x)Cj i-v C, for columns), 

(c) multiply any row or column by a unit in F[x], i.e., by a nonzero element in 
F (which will be denoted by uR, if the 7 th row is multiplied by u € F x , and 
similarly by uQ for columns). 
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Define d\ , . . . , dm to be the degrees of the monic nonconstant polynomials 
a\(x ), . . . , a m (x) appearing on the diagonal, respectively. 

(2) Beginning with the n x n identity matrix P\ for each row operation used in (1), 
change the matrix P' by the following rules: 

(a) If Rj Rj then interchange the I th and 7 th columns of P' (i.e., C, -o- Cj for 
P'). 

(b) If Rj + p(x)Rj t-> Rj then subtract the product of the matrix p(A) times the 
I th column of P' from the 7 th column of P' (i.e., Cj — p(A)Cj Cj for P' 
— note the indices). 

(c) If uRj then divide the elements of the I th column of P' by u (i.e., « - 1 C r - for 

P')- 

(3) When xl — A has been diagonalized to the form in Theorem 21 the first n — m 
columns of the matrix P' are 0 (providing a useful numerical check on the compu- 
tations) and the remaining m columns of P' are nonzero. For each i = 1, 2, . . . , m, 
multiply the I th nonzero column of P' successively by A 0 = /, A 1 , A 2 , . . . , A dr l , 
where dj is the integer in ( 1 ) above and use the resulting column vectors (in this 
order) as the next d f columns of an n x n matrix P. Then P~ l AP is in ratio- 
nal canonical form (whose diagonal blocks are the companion matrices for the 
polynomials a t (x), . . . , a m (x) in ( 1 )). 


In the theory of canonical forms for linear transformations (or matrices) the charac- 
teristic polynomial plays the role of the order of a finite abelian group and the minimal 
polynomial plays the role of the exponent (after all, they are the same invariants, one 
for modules over the Principal Ideal Domain 7L and the other for modules over the 
Principal Ideal Domain F[x]) so we can solve problems directly analogous to those 
we considered for finite abelian groups in Chapter 5. In particular, this includes the 
following: 

(A) determine the rational canonical form of a given matrix (analogous to decomposing 
a finite abelian group as a direct product of cyclic groups) 

(B) determine whether two given matrices are s imilar (analogous to determining whether 
two given finite abelian groups are isomorphic) 

(C) determine all similarity classes of matrices over F with a given characteristic poly- 
nomial (analogous to determining all abelian groups of a given order) 

(D) determine all similarity classes of n x n matrices over F with a given minimal 
polynomial (analogous to determining all abelian groups of rank at most n of a 
given exponent). 


Examples 

(1) We find the rational canonical forms of the following matrices over Q and determine 
if they are similar 


(2 -2 14 \ /0 

A = 0 3-7 B = 1 

\o 0 2) \0 


4 

85) 

( 2 

2 

1 \ 

4 

-30 

c= 0 

2 

- 1 

0 


\0 

0 

3/ 


A direct computation shows that all three of these matrices have the same characteristic 
polynomial: ca 00 = cs(x) = cc 00 = (x — 2) 2 (x — 3). Since the minimal and char- 
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acteristic polynomials have the same roots, the only possibilities for the minim al poly- 
nomials are (x — 2)(x— 3) or (x — 2) 2 (x —3). We quickly find that (A— 2/)(A— 3/) = 0, 
(B — 2 T){B — 31) ^ 0 (the 1,1-entiy is nonzero) and (C — 1 I)(C — 31) ^ 0 (the 
1,2- entry is nonzero). It follows that 

tua (x) = (x — 2)(x - 3), ttib(x) = mc(x) = (x — 2) 2 (x — 3). 

It follows immediately that there are no additional invariant factors for B and C. 
Since the invariant factors for A divide the minimal polynomial and have product 
the characteristic polynomial, we see that A has for invariant factors the polynomials 
x — 2, (x — 2)(x — 3) = x 2 — 5x + 6. (For 2x2 and 3x3 matrices the determination 
of the characteristic and minimal polynomials determines all the invariant factors, cf. 
Exercises 3 and 4.) We conclude that B and C are s imil ar and neither is similar to A. 
The rational canonical forms are (note (x — 2) 2 (x — 3) = x 3 — 7x 2 + 16x — 12) 

( 2 0 0 \ /0 0 12 \ /0 0 12 \ 

0 0-6 1 0 -16 1 0 -16 . 

015/ \0 1 7/ \0 1 l) 

(2) In the example above the rational canonical forms were obtained simply by determining 
the characteristic and minim al polynomials for the matrices. As mentioned, this is 
sufficient for 2 x 2 and 3x3 matrices since this information is sufficient to determine 
all of the invariant factors. For larger matrices, however, this is in general not sufficient 
(cf. the next example) and more work is required to dete rmine the invariant factors. In 
this example we again compute the rational canonical form for the matrix A in Example 
1 following the two algorithms outlined above. While this is computationally more 
difficult for this s mall matrix (as will be apparent), it has the advantage even in this 
case that it also explicitly computes a matrix P with P~ l AP in rational canonical 
form. 

L ( [Invariant Factor Decomposition) We use row and column operations (in Q[xJ) to 
reduce the matr ix 

( x — 2 2 -14 \ 

0 x -3 7 

0 0 x —2/ 

to diagonal form. As in the invariant factor decomposition algorithm, we shall use the 
notation /?/ -o Rj to denote the interchange of the t* and 7 th rows, /?, + aRj i-> Ri 
if a times the 7 * row is added to the 1 th row, simply uRj if the 1 th row is multiplied 
by w (and similarly for columns, using C instead of R). Note also that the first two 
operations we perform below are rather ad hoc and were chosen simply to have integers 
everywhere in the computation: 

/x-2 2 -L4\ / x—2 x-1 -7 

0 x— 3 7 I — »■ 0 x— 3 7 

VO 0 x—2/ VO Ox -2 

( -1 x-1 -7 \ /I -x+1 7 

— x+3 x— 3 7 — > — x-|-3 x— 3 7 

0 0 x— 2 / \ 0 0 x -2 
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*2+<*-3>tf 


■6 


-x+1 
— x 2 +5x— 6 
0 


’S’) 


C2+Cx-l)C, 
I— >C2 


/I 0 7 \ 

0 -x 2 +5x-6 7(x— 2) 
\0 0 x-2 / 


/I 0 0 \ /I 0 0 \ 

I 0 — x 2 +5x— 6 7(x— 2) — >■ I 0 x 2 — 5x+6 7(x — 2) I 
\0 0 x-2 ) ~ Cl \0 0 x-2 / 


Ri-lR-i 

^ R*i 


(\ 0 

0 \ 


/i 

0 

0 \ 

0 X 2 — 5x+6 

0 

— > 

0 x-2 

0 

\0 0 



Vo 

0 

x 2 — 5x+6 J 


This determines the invariant factors x — 2, x 2 — 5x + 6 for this matrix, which we 
determined in Example 1 above. Let now V be a 3-dimensional vector space over 
<Q> with basis e1.e2.e3 and let T be the corresponding linear transformation (which 
defines the action of x on V), Le., 

xcj =T(e\) = 7e\ 

xe2 = T (C 2 ) = -le\ + 3«2 

XC3 = T (e$) = 14ci — 7e2 + 2^3. 


The row operations used in the reduction above were 
R\ + ^2 *-> Ri, —R\, Ri + (■* — 3)/?i H* /?2. Ri — 7/?3 /?2. #3- 

Starting with the basis [ei . £2- eri for V and changing it according to the rules given 
in the text, we obtain 


[ci.c 2 .e 3 ] — > [e 1 .e 2 -e 1 .c 3 ] — > [-Ci.c 2 -e 1 .c 3 ] 

— > [-Ci-(X-3)(C2-Cl), C2-C1, C3] 

— ► [ — ei -(x-3)(«2— ei), C2-C1, C3+7(C2-Ci>] 
— ► [-ej— (x— 3 )(C 2 -Ci), e 3 + 7 (e 2 -ci), C 2 -C 1 ]. 


Using the formulas above for the action of x, we see that these last elements are 
the elements [0, — 7ei + le% -I- C3 , — cj + £2] of V corresponding to the elements 
1, x — 2 and x 2 — 5x + 6 in the diagonalized form of xl — A, respectively. The 
elements f\ = — 7ei + 7c2 -I- C3 and jj = — ci + £2 are therefore <Q>[x]-module 
generators for the two cyclic factors of V in its invariant factor decomposition as a 
Q[x]-module. The corresponding Q-vector space bases for these two factors are then 
f\ and /2, xfi = r/2, i.e., -7 £i + 7c2+e3 and-e ( +C2, T(—e \ -f£2) = — 4ci +3«2- 
Then the matrix 

/ —7 -1 -4 

P = [ 7 1 3 

V l 0 0 


conjugates A into its rational canonical form: 


P~ X AP = 




as one easily checks. 
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II. ( Converting A Directly to Rational Canonical Form) We use the row operations 
involved in the diagonalization of x I — A to determine the matrix P' of the algorithm 
above: 


/I 

0 

°\ 


(\ 

-1 

°\ 

(~ l 

-1 

0\ 


0 

1 

0 1 

> 

0 

1 

0 — ► 

0 

I 

0 


u 

0 

i) 

Ci-Cs 

y-tCi 

^0 

0 

\) - c ' 

l 0 

0 

V 



c,— 3 nc 2 



Cj+7Cj 

C 3 



C2++C3 



= P'. 


Here we have d\ = \ and <h = 2, corresponding to the second and third nonzero 
columns of P' , respectively. The columns of P are therefore given by 


/-7\ 


f-l\ 

7 

and 

1 



l 0/ 



respectively, which again gives the matrix P above. 



(3) For the 3 x 3 matrix A it was not necessary to perform the lengthy calculations 
above merely to determine the rational canonical form (equivalently, the invariant 
factors), as we saw in Example 1. For n x n matrices with n > 4, however, the 
computation of the characteristic and minimal polynomials is in general not sufficient 
for the determination of all the invariant factors, so the more extensive calculations of 
the previous example may become necessary. For example, consider the matrix 


/I 2 
2 -I 
1 0 
VO 1 


-4 

4 

1 

-2 



A short computation shows that the characteristic polynomial of D is (x — l) 4 . The 
possible minimal polynomials are then x — 1, (x — 1 ) 2 , (x - 1 ) 3 and (x — l) 4 . Clearly 
D — I ± 0 and another short computation shows that (D — l) 2 = 0, so the minimal 
polynomial for D is (x - l) 2 . There are then two possible sets of invariant factors: 

JT — — l,(jr — l) 2 and (x - l) 2 , (x - l) 2 . 


To determine the invariant factors for D we apply theprocedure of the previous example 
to the 4 x 4 matrix 


xl - D 


/x— 1 —2 4 —4 \ 

-2 x+1 -4 8 

-1 0 x-\ 2 r 

VO -1 2 Jt-3/ 


The diagonal matrix obtained from this matrix by elementary row and col umn opera- 
tions is the matrix 

/I 0 0 0 \ 

[ 0 1 0 0 

0 0 (x-1) 2 0 

Vo 0 0 (x-l) 2 / 

which shows that the invariant factors for D are (x — l) 2 , (jc — l) 2 (one series of 
elementary row andcolumnoperationswhichdiagonalizex/ — DaieRi 4 + /?3 , —R \ , 
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/?2 + 2 R\ (—*• R2 , ^3 — (x — 1)^1 H ► 7?3 , C3 + (X — l)Cl h- ► C3 , C4 + 2 Cl I — * C4 , 

/?2 Z ?4 « —R2 > ^3 + 2/?2 ^ J ?3 , J?4 - (j: + 1)I?2 Rl . Q + 2C2 H* C3 , 

C4 + (x — 3)C2 C4). 


I. (Invariant Factor Decomposition) If e\, 62. £ 3 , £4 is a basis for V in this case, then 
using the row operations in this diagonalization as in the previous example we see that 
the generators of V corresponding to the factors above are (jc — \)e\ — 2 e 2 — es = 0, 
—2e\ + (x + 1)^2 — e 4 = 0, e \ , e 2 - Hence a vector space basis for the two direct factors 
in the invariant decomposition of V in this case is given by e\ , T e\ and e 2 , Te 2 where 
T is the linear transformation defined by Z), i.e., e\,ei + 2 e 2 + €3 and C 2 > 2ej —^2 + ^ 4 - 
The corresponding matrix P relating these bases is 

/I 1 0 2\ 

0 2 1-1 

0 10 0 

Vo 0 0 1 / 


so that P 1 DP is in rational canonical form: 


P-'DP = 


/0 

-1 

0 

°\ 

1 

2 

0 

0 

0 

0 

0 

-1 

Vo 

0 

1 

2 / 


as can easily be checked. 


IL ( Converting D Directly to Rational Canonical Form) As in Example 2 we determine 
the matrix P' of the algorithm from the row operations used in the diagonalization of 


xI-D : 


/i 

0 

0 

0\ 



(° 

0 

1 

0\ 



/ 0 

0 

l 

0\ 




0 

1 

0 

0 



0 

1 

0 

0 



0 

1 

0 

0 




0 

0 

1 

0 

Ci ++C3 

1 

0 

0 

0 

- 


-1 

0 

0 

0 


-4 

Vo 

0 

0 

\) 

1 


Vo 

0 

0 

\) 



V 0 

0 

0 

\) 




( 

0 

0 

l 





(° 

0 

1 

°\ 


i 

f° 

0 

1 

°\ 



-2 

1 

0 

0 




0 

I 

0 

0 



0 

0 

0 

1 



-1 

0 

0 

0 

C,+(£>-/)C 3 

0 

0 

0 

0 

C2++C4 

0 

0 

0 

0 


K 

0 

0 

0 

\) 

\—^C[ 


Vo 

0 

0 

1/ 


1 

Ko 

1 

0 

0/ 


-C 2 


/° 

0 

1 

0\ 


(° 

-2 

1 

°v 


/0 

0 

1 

°\ 

0 

0 

0 

1 


0 

0 

0 

I 


0 

0 

0 

1 

0 

0 

0 

0 

C 2 - 2 C 3 

0 

0 

0 

0 

^2+(^+/)C4 

0 

0 

0 

0 

Vo 

-1 

0 

0 1 

I-+C 2 

Vo 

-1 

0 

0/ 

> C’l 

Vo 

0 

0 

0/ 


Here we have d\ = 2 and (h = 2, corresponding to the third and fourth nonzero 
col umns of P' . The col umn s of P are therefore given by 


0 

0 

, D 

( 1 > 
0 

0 

= 

/ 1 \ 

2 

1 

and > 

, D 

(°\ 

1 

0 

= 

{ 2 \ 
-1 

0 

\o) 


Vo/ 


Vo/ 

\o) 


^0/ 


V \) 


respectively, which again gives the matrix P above. 


(4) In this example we determine all similarity classes of matrices A with entries from Q 
with characteristic polynomial (x 4 — l)(x 2 — 1). First note that any matrix with a degree 
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6 characteristic polynomial must be a 6 x 6 matrix. The polynomial (x 4 — 1 ) (x 2 — 1 ) 
factors into irredudbles in Q[x] as (x — l) 2 (x + l) 2 (x 2 + 1). Since the minimal 
polynomial m^x) for A has the same roots as ca (x) it follows that (x— l)(x+ 1 )(x 2 + 1) 
divides Suppose ni(x), . . . , a, n (x) are the invariant factors of some A , so 

a,„ (x) = niA (x), <H (x) \ < 2 ,+i (x) (in particular, all the invariant factors divide m^Cx)) 
and a\ (x)< 22 (x) - - • a m (x) = (x 4 — l)(x 2 — l). One easily sees that theonly permissible 
lists under these constraints are 

(a) (x - l)(x + 1), (x - l)(x + l)(x 2 + 1) 

(b) x-l, (x - l)(x + l) 2 (x 2 + 1) 

(c) x + l, (x- l) 2 (x+ l)(x 2 + 1) 

(d) (x - l) 2 (x + l) 2 (x 2 + l). 

One can now easily write out the corresponding direct sums of companion matrices 
to obtain representatives of the 4 similarity classes. We shall see in the next section 
that there are still only 4 similarity classes even in Mg (C) . 

(5) In this example we find all similarity classes of 3 x 3 matrices A with entries from Q 
satisfying A 6 = I. For each such A, its minimal polynomial divides x 6 — 1 and in 
Q[x] the complete factorization of this polynomial is 

x 6 - 1 = (x - l)(x + l)(x 2 - x + l)(x 2 + x + 1). 

Conversely, if B is any 3x3 matrix whose minimal polynomial divides x 6 — 1, then 
B 6 = /. The only restriction on the minimal polynomial for B is that its degree is 
at most 3 (by the Cayley-Hamilton Theorem). The only possibilities for the minimal 
polynomial of such a matrix A are therefore 


(a) 

x - 1 



(b) 

x + l 



(0 

x 2 ~x+l 



(d) 

X 2 + X + 1 



(e) 

(X - l)(x + 1) 



<f) 

(x - l)(x 2 - X 

+ 

1) 

(g) 

(x — l)(x 2 + X 

+ 

1) 

0») 

(x + l)(x 2 - X 

+ 

1) 

(i) 

(x + l)(x 2 + x 

+ 

1) 


Under the constraints of the rational canonical form these give rise to the following 
permissible lists of invariant factors: 


(0 

x — 1, X - 

- 1, x-1 

(ii) 

x + l, x + l, x+l 

OH) 

X — 1 . (x 

- 1)(* + 1) 

(Hr) 

x + l, (x 

- l)(x + 1) 

(V) 

(x - l)(x 2 - 

-x + l) 

(Vi) 

(x - l)(x 2 + X + 1) 

(vii) 

(x + l)(x 2 - 

-X+l) 

(viii) 

(x+ l)(x 2 + x + l). 


Note that it is impossible to have a suitable set of invariant factors if the minimal 
polynomial is x 2 + x + 1 or x 2 — x + 1 . One can now write out the corresponding 
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rational canonical forms; for example, (i) is 7, (ii) is — and (iii) is 

/l 0 0\ 

0 0 1 . 

V° 1 0 / 

Note also that another way of phrasing this result is that any 3x3 matrix with entries 
from (Q> whose order (multiplicatively, of course) divides 6 is similar to one of these 8 
matrices, so this example determines all elements of orders 1,2,3 and 6 in the group 
GL 3 (Q) (up to similarity). 


EXERCISES 

1. Prove that similar linear transformations of V (or n x n matrices) have the same charac- 
teristic and the same minimal polynomial. 

2. Let M be as in Lemma 19. Prove that the minimal polynomial of M is the least common 
multiple of the minimal polynomials of A i , . . . , A* . 

3. Prove that two 2x2 matrices over F which are not scalar matrices are similar if and only 
if they have the same characteristic polynomial. 

4 . Prove that two 3x3 matrices are similar if and only if they have the same characteristic 
and same minim al polynomials. Give an explicit counterexample to this assertion for 4 x 4 
matrices. 

5. Prove directly from the fact that the collection of all linear transformations of an n dimen- 
sional vector space V over F to itself form a vector space over F of dimension n 1 that the 
minimal polynomial of a linear transformation T has degree at most n 2 . 

6. Prove that the constant term in the characteristic polynomial of the n x n matrix A is 
(—l)" det A and that the coefficient of x' I_1 is the negative of the sum of the diagonal 
entries of A (the sum of the diagonal entries of A is called the trace of A). Prove that det A 
is the product of the eigenvalues of A and that the trace of A is the sum of the eigenvalues 
of A. 

7. Determine the eigenvalues of the matrix 


f° 

1 

0 

°\ 

0 

0 

1 

0 

0 

0 

0 

1 

Vi 

0 

0 

0/ 


8. Verify that the characteristic polynomial of the companion matrix 

/ 0 0 0 ... 0 — ao \ 

1 0 0 ... 0 -a\ 

0 10 ... 0 —02 

Vo 0 0 ... 1 —a„~i 

is 

x n + a n ~ \x n ^ + - - - 4- a\x + qq. 
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9. Find the rational canonical forms of 


( 


0 

0 

-1 


-1 

0 

0 


\ ( c 

0 


/ 

j, 0 

c 

1 

and 

I l-l 

1 

c) 

\ 


-420 

840 


465 

-463 

930 

-155 


15 -30 \ 
-15 30 

32 -60 
-5 12/ 


10. Find all similarity classes of 6 x 6 matrices over Q with minim al polynomial (x +2) 2 (x — 1) 
(it suffices to give all lists of invariant factors and write out some of their corresponding 
matrices). 

11. Find all similarity classes of 6 x 6 matrices over C with characteristic polynomial 
( x 4 - 1)(* 2 - 1 ). 

12. Find all similarity classes of 3 x 3 matrices A over F 2 satisfying A 6 — / (compare with 
the answer we computed over Q). Do the same for 4 x 4 matrices B satisfying B 2Q = l . 

13. Prove that the number of similarity classes of 3 x 3 matrices over Q with a given character- 
istic polynomial in Q[x] is the same as the number of similarity classes over any extension 
field of <Q>. Give an example to show that this is not true in general for 4 x 4 matrices. 

14. Determine all possible rational canonical forms for a linear transformation with charac- 
teristic polynomial x 2 (x 2 4- 1) 2 . 

15. Determine up to similarity all 2 x 2 rational matrices (i.e., e M 2 (Q)) of precise order 4 
(multiplicatively, of course). Do the same if the matrix has entries from C. 

16. Show that x s - 1 = (x — l)(x 2 — 4x 4- l)(x 2 + 5x + 1) in Fi 9 [jc], Use this to determine 
up to similari ty all 2 x 2 matrices with entries from of (multiplicative) order 5. 

17. Determine representatives for the conjugacy classes for GL 3 (F 2 ). [Compare your answer 
with Theorem 15 and Proposition 14 of Chapter 6.] 

18. Let V be a finite dimensional vector space over Q and suppose T is a nonsingular linear 
transformation of V such that T -1 = T 2 + T. Prove that the dimension of V is divisible 
by 3. If the dimension of V is precisely 3 prove that all such transformations T are similar. 

19. Let V be the infinite dimensional real vector space 

R°° = {( a 0 ,a\,a2 ,...) | aQ,a\,ai, ■■■ € St). 

Define the map T : V -> V by T foo, £] , 02 , . . .) = (0, ao, a\ , 02 ,...). Prove that T has 
no eigenvectors. 

20. Let l be a prime and let &i(x) = = jc£_1 + 2 + ■■■+* + 1 € Z[x] be the 

£* cyclotomic polynomial, which is irreducible over Q (Example 4 following Corollary 
9.14). This exercise determines the smallest degree of a factor of 4>i(x) modulo p for 
any {rime p and so in particular determines when 0t(x) is irreducible modulo p. (This 
actually determines the complete factorization of &e(x) modulo p — cf. Exercise 8 of 
Section 13.6.) 

(a) Show that if p — l then 0t{x) is divisible by x — 1 in F^[x]. 

(b) Suppose p ^ l and let / denote the order of p in , i.e., / is the smallest power of 
p with pf = 1 mod l. Show that m = / is the first value of m for which the group 
GL m (F p ) contains an element A of order t. [Use the formula for the order of this 
group at the end of Section 11.1.] 

(c) Show that 0i (x) is not divisible by any polynomial of degree smaller than / in F p [jc] 
[consider the companion matrix for such a divisor and use (b)]. Let m^( x) e F p [x] 
denote the minim al polynomial for the matrix A in (b) and conclude that mA(x) is 
irreducible of degree / and divides 4>t(x) in F p [jc], 
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(d) In particular, prove that <Pt (x) is irreducible modulo p if and only i f l — 1 is the smallest 
power of p which is congruent to 1 modulo i, i.e., p is a primitive root modulo t. 

21. Prove that the first two elementary row and column operations described before Theorem 21 
do not change the determinant of the matrix and the third elementary operation multiplies 
the determinant by a unit Conclude from Theorem 21 that the characteristic polynomial 
of A differs by a unit from the product of the invariant factors of A. Since both these 
polynomials are monic by definition, conclude that they are equal (this gives an alternate 
proof of Proposition 20). 

The following exercises o utlin e the proof of Theorem 21 . They carry out explicitly the con- 
struction described in Exercises 16 to 19 of the previous section for the Euclidean Domain 
F[x], Let V be an n-dimensional vector space with basis ui , V 2 , . . . , v„ and let T be the lin- 
ear transformation of V defined by the matrix A and this choice of basis, i.e., T is the linear 
transformation with 

n 

T ( v j ) = 5Z a ‘J v i * j = 1 ■ 2 n 

i=i 

where A = (<3,y). Let F{x] n be the free module of rank n over F[x] and let £i, £ 2 , ... , 
denote a basis. Then we have a natural surjective F[x]-module homomorphism 

<p : F[xf -► V 

defined by mapping to u,-, i = 1 , 2, . . . , n. As indicated in the exercises of the previous section 
the invariant factors for the F[x]-module V can be determined once we have determined a set 
of generators and the corresponding relations matrix for ker <p. Since by definition x acts on V 
by the linear transformation T, we have 

n 

*(«/> = 5Z a ‘j ^ * j - 1 ’ 2 n - 

1=1 


22. Show that the elements 

vj = ~a \ yfi aj-i jtjj-i + (x - ajj)$j - aj+\ ;£,+ 1 a nj ^ n 

for j ■= 1 , 2 , .... n are elements of the kernel of <p. 

23. (a) Show that xf 7 = vj + fj where fj £ F£i -I V F£„ is an element in the F-vector 

space spanned by \\ , . . . , 

(b) Show that 

FUtet + ■ ■ + F[x]£„ = (F[x]m H + F[x]u„) + (F£j -| + F£„). 

24. Show that v\,V 2 ,...,v n generate the kernel of <p. (Use the previous result to show that 
any element of ker tp is the sum of an element in the module generated by v\,V 2 , ... ,\>n 

and ao element of the form b\%\ H + where the bj are elements of F. Then show 

that such an element is in ker ^5 if and only if all the b-, are 0 since i>i , . . . , v n are a basis 
for V over F.] 

25. Show that the generators vi , i >2 v„ of ker <p have corresponding relations matrix 


r 

-an 


— O n \ 


~ a \2 . 

x- an . . 

-On2 


—au 

—a'ln 

. x a ni 7 
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where A 1 is the transpose of A. Conclude that Theorem 21 and the algorithm for deter- 
mining the invariant factors of A follows by Exercises 16 to 19 in the previous section 
(note that the row and column operations necessary to diagonalize this relations matrix are 
the column and row operations necessary to diagonalize the matrix in Theorem 21 , which 
explains why the invariant factor algorithm keeps track of the row operations used). 


12.3 THE JORDAN CANONICAL FORM 

We continue with the notation in the previous section: F is a field, F[x] is the ring of 
polynomials in x with coefficients in F, V is a finite dimensional vector space over F 
of dimension n, T is a fixed linear transformation of V by which we make V into an 
F[x]-module, and A is an n xn matrix with coefficients in F. Recall that once a basis 
for V has been fixed any linear transformation T defines a matrix A and conversely any 
matrix A defines a linear transformation T. 

In the previous section we used the invariant factor form of the Fundamental The- 
orem for finitely generated modules over the Principal Ideal Domain F[x] to obtain the 
rational canonical form for such a linear transformation T and the rational canonical 
form for such an n x » matrix A. In this section we use the elementary divisor form 
of the Fundamental Theorem to obtain the Jordan canonical form . We shall see that 
matrices in this canonical form are as close to being diagonal matrices as possible, so 
the matrices are simpler than in the rational canonical form (but we lose some of the 
“rationality” results). 

The elementary divisors of a module are the prime power divisors of its invariant 
factors (this was Corollary 10). For the F[x]-module V the invariant factors were 
monic polynomials tfi(x), a% (x), ... ,a m (x) of degree at least one (with ci\ (x) | 02 00 I 
- - | a m (x)), so the associated elementary divisors are the powers of the irreducible 
polynomial factors of these polynomials. These polynomials are only defined up to 
multiplication by a unit and, as in the case of the invariant factors, we can specify them 
uniquely by requiring that they be monic. 

To obtain the simplest possible elementary divisors we shall assume that the poly- 
nomials tfi(x), 00i ■ ■ • , o m (x) factor completely into linear factors, i.e., that the el- 
ementary divisors of V are powers (x — X) k of linear polynomials. Since the product 
of the elementary divisors is the characteristic polynomial, this is equivalent to the as- 
sumption that the field F contains all the eigenvalues of the linear transformation T 
(equivalently, of the matrix A representing the linear transformation T). 

Under this assumption on F, it follows immediately from Theorem 6 that V is the 
direct sum of finitely many cyclic F[x]-modules of the form F[x]/( x — X) k where 
X e F is one of the eigenvalues of 7, corresponding to the elementary divisors of V. 

We now choose a vector space basis for each of the direct summands corresponding 
to the elementary divisors of V for which the corresponding matrix for 7 is particularly 
simple. Recall that by definition of the F[x]-module structure the linear transformation 
7 acting on V is the element x acting by multiplication on each of the direct summands 
F[x\/(x - X) k . 

Consider the elements 

(x-X) k ~\ (x - X ) k ~ 2 , . . . , x-X, 1, 
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in the quotient F[x]/(x — A.)*. Expanding each of these polynomials in x we see that 
the matrix relating these elements to the F -basis x k ~ l , x k ~ 2 , . . . , x, 1 of F[x] f(x — X) k 
is upper triangular with 1 's along the diagonal. Since this is an invertible matrix (having 
determinant l), it follows that the elements above are an F-basis for F[x]/ (x — X) k . 
With respect to this basis the linear transformation of multiplication by x acts in a 
particularly simple manner (note that x = X + (x — X) and that (x — X) k = 0 in die 
quotient): 


( x - A)*- 1 


X ■ (x — X) k ~ l + (x - X) k = X • (x - A)*“ l 

(x - X) k ~ 2 

J— >■ 

A - (x - A)*~ 2 + (x- A)* -1 

x — X 


A ■ (x - A) + (x - A) 2 

1 

1— ► 

1 

+ 


With respect to this basis, the matrix for multiplication by x is therefore 

(X 1 \ 

X 

1 

X 1 

V x) 

where the blank entries are all zero. Such matrices are given a name: 

Definition. The k x k matrix with X along the main diagonal and 1 along the -first su- 
perdiagonal depicted above is called the k x k elementary Jordan matrix with eigenvalue 
X or the Jordan block of size k with eigenvalue X. 

Applying this to each of the cyclic factors of V in its elementary divisor decomposi- 
tion we obtain a vector space basis for V with respect to which the linear transformation 
T has as matrix the direct sum of the Jordan blocks corresponding to the elementary 
divisors of V, i.e., is block diagonal with Jordan blocks along the diagonal: 

( J \ \ 

Ji 

V J t ) 

Notice that this matrix is uniquely determined up to permutation of the blocks along the 
diagonal by the elementary divisors of the F[x]-module V and conversely, by Theorem 
9, the list of elementary divisors uniquely determines the module V up to F[x] -module 
isomorphism. 

Definition. 

(1) A matrix is said to be in Jordan canonical form if it is a block diagonal matrix 
with Jordan blocks along the diagonal. 

(2) A Jordan canonical form for a linear transformation T is a matrix representing 
T which is in Jordan canonical form. 
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We have proved that any linear transformation T has a Jordan canonical form. 
As in the case of the rational canonical form, it follows from the uniqueness of the 
elementary divisors that the Jordan canonical form is unique up to a permutation of the 
Jordan blocks along the diagonal (hence is called the Jordan canonical form for T ). We 
summarize this in the following theorem. 


Theorem 22. (Jordan Canonical Form for Linear Transformations) Let V be a finite 
dimensional vector space over the field F and let T be a linear transformation of V. 
Assume F contains all the eigenvalues of T. 

(1) There is a basis for V with respect to which the matrix for T is in Jordan 
canonical form, i.e., is a block diagonal matrix whose diagonal blocks are the 
Jordan blocks for the elementary divisors of V. 

(2) The Jordan canonical form for T is unique up to a permutation of the Jordan 
blocks along the diagonal. 

As for the rational canonical form, the following theorem gives the corresponding 
statement for n x n matrices over F. 

Theorem 23. (Jordan Canonical Form for Matrices) Let A be an n x n matrix over the 
field F and assume F contains all the eigenvalues of A. 

(1) The matrix A is similar to a matrix in Jordan canonical form, i.e., there is an 
invertible n x n matrix P over F such that P~ [ AP is a block diagonal matrix 
whose diagonal blocks are the Jordan blocks for the elementary divisors of A. 

(2) The Jordan canonical form for A is unique up to a permutation of the Jordan 
blocks along the diagonal. 

The Jordan canonical form differs from a diagonal matrix only by the possible 
presence of some l’s along the first superdiagonal (and then only if there are Jordan 
blocks of size greater than one), hence is close to being a diagonal matrix. The following 
result shows in particular that the Jordan canonical form for a matrix A is as close to 
being a diagonal matrix as possible. 

Corollary 24. 

(1) If a matrix A is similar to a diagonal matrix D, then D is the Jordan canonical 
form of A. 

(2) Two diagonal matrices are s imilar if and only if their diagonal entries are the 
same up to a permutation. 


Proof: The first assertion is immediate from the uniqueness of Jordan canonical 
forms because a diagonal matrix is itself in Jordan form (with Jordan blocks of size 1). 
The uniqueness of the Jordan canonical form gives (2). 

The next corollary gives a criterion to determine when a matrix A can be diagonal- 
ized. 
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Corollary 25. If A is an n x n matrix with entries from F and F contains all the 
eigenvalues of A, then A is similar to a diagonal matrix over F if and only if the 
minimal polynomial of A has no repeated roots. 

Proof: Suppose A is similar to a diagonal matrix. The minim al polynomial of a 
diagonal matrix has no repeated roots (its roots are precisely the distinct elements along 
the diagonal). Since similar matrices have the same minim al polynomial it follows that 
the minimal polynomial for A has no repeated roots. 

Conversely, suppose the minim al polynomial for A has no repeated roots and let 
B be the Jordan canonical form of A. The matrix B is a block diagonal matrix with 
elementary Jordan matrices down the diagonal. By the exercises at the end of the 
preceding section the minimal polynomial for B is the least common multiple of the 
minimal polynomials of the Jordan blocks. It is easy to see directly that a Jordan 
block of size k with eigenvalue X has minimal polynomial (x — X) k (note that this is 
immediate from the fact that each elementary Jordan matrix gives the action on a cyclic 
F[x]-submodule whose annihilator is (x — X) k ). Since A and B have the same minimal 
polynomial, the least common multiple of the (x — X) k cannot have any repealed roots. 
It follows that k must be 1, i.e., that each Jordan block must be of size one and B is a 
diagonal matrix. 

Changing From One Canonical Form to Another 

We continue to assume that the field F contains all the eigenvalues of T (or A) so both 
the rational and Jordan canonical forms exist over F. The process of passing from one 
form to the other is exactly the same algorithm described in Section 5.2 for finite abelian 
groups (where the elementary divisors were determined from the list of invariant factors 
and vice versa). 

In brief summary, recall that the elementary divisors are the prime power divisors 
of the invariant factors. They are obtained from the invariant factors by writing each 
invariant factor as a product of distinct linear factors to powers; the resulting set of 
powers of linear polynomials is the set of elementary divisors. For example, if the 
invariant factors of T are 

(x - l)(x - 3) 3 , (x - l)(x - 2)(x - 3) 3 , (x - l)(x - 2) 2 (x - 3) 3 
then the elementary divisors are 

(x-1), (x-3) 3 , (x-1), (jc — 2), (x-3) 3 , (x-1), (x — 2) 2 , (x-3) 3 . 

The largest invariant factor is the product of the largest of the distinct prime powers 
among the elementary divisors, the next largest invariant factor is the product of the 
largest of the distinct prime powers among the remaining elementary divisors, and so 
on. Given a list of elementary divisors we can find the list of invariant factors by first 
arranging the elementary divisors into n separate lists, one for each eigenvalue. In each 
of these n lists arrange the polynomials in increasing (i.e., nondecreasing) degree. Next 
arrange for all n lists to have the same length by appending an appropriate number of 
the constant polynomial 1. Now form the I th invariant factor by taking the product of 
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the I th polynomial in each of these lists. For example, if the elementary divisors of T 
are 

(x-1) 3 , (x +4), (x+4)\ (x-5) 2 , (x-1) 5 , (x-1) 3 , (*-5) 3 , (x-l) 4 , (*+4) 3 

then the intermediate lists are 

(1) (x - l) 3 , {x - l) 3 , (x - l) 4 , (x - l) 5 

(2) 1, x +4, (x + 4) 2 , (jc + 4) 3 

(3) 1, 1, (x-5) 2 , (jc-5) 3 

so the list of invariant factors is 

(x - l) 3 , (x - l)\x +4), (x - l) 4 (x +4) 2 (x - 5) 2 , (jc - 1) 5 (x + 4) 3 (x - 5) 3 . 

Elementary Divisor Decomposition Algorithm: Converting to Jor- 
dan Canonical Forms 

Theorem 21 indicates a computational procedure to determine the invariant factors of 
any given matrix A. Factorization of these invariant factors produces the elementary 
divisors of A, hence determines the Jordan canonical form for A as above. 

The Invariant Factor Decomposition Algorithm following Theorem 21 starts with 
a basis e\, ... ,e n for V and produces a set /i, .... of elements of V which are 
F[jt]-module generators for the cyclic factors in the invariant factor decomposition of 
V (with annihilators (di(x)), . . . , (a m (x)), respectively). Since the elementary divisor 
decomposition is obtained from the invariant factor decomposition by applying the 
Chinese Remainder Theorem to the cyclic modules F[x)/(dj (x)), this gives a set of 
Ffx]-module generators for the cyclic factors in the elementary divisor decomposition 
of V. These elements then give rise to an explicit vector space basis for V with respect 
to which the linear transformation corresponding to A is in Jordan canonical form 
(equivalently, an explicit matrix P such that P~ l AP is in Jordan canonical form). As 
for the Invariant Factor Decomposition Algorithm we state the result first in the general 
context of decomposing a vector space and then describe the algorithm to convert a 
given n x n matrix A to Jordan canonical form. 

Explicit numerical examples of this algorithm are given later in Examples 2 and 3. 

Elementary Divisor Decomposition Algorithm 

(1) to (3): The first three steps in the algorithm are those from the Invariant Factor 
Decomposition Algorithm following Theorem 21. 

(4) For each invariant factor a(x) computed for A write 

a(x) = (x- X x ) a > ( x - A , 2 ) az . . . (x - X s ) a ‘ 

where Xi , . . . , X s e F are distinct. Let / e V be die F[jc]-module generator for 
the cyclic factor corresponding to the invariant factor a(x) computed in (3). Then 
the elements 

a(x) a(x) a(x) 

(x - A.i)°> 3 ’ (x - X 2 ) a7 3 ’ * ‘ ’ (x - \ s )“* 3 
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a(x) 


- e F[ x] are polynomials) are F[x]-module generators for 


(note that the 

(x 

the cyclic factors of V corresponding to the elementary divisors 


(x-X,) 01 , (x-X 2 ) 


QfJ 


(x - X,)' 


a. 


respectively. 

a(x) 


(5) If g, = 


_ / is the F[x]-module generator for the cyclic factor of V corre- 


(x - Xj )**' 

sponding to the elementary divisor (x — X,)“' then the corresponding vector space 
basis for this cyclic factor of V is given by the elements 


(T-w'gi, (r-w*- 2 s,, .... (r-Aj)®, gi . 

(6) Write the X th element of the vector space basis computed in (5) in terms of the 
original vector space basis |>i , e 2 , • • • , e„] for V and use the coordinates for the 
X th column of an n x n matrix P. Then P~ l AP is in Jordan canonical form (with 
Jordan blocks appearing in the order used in (5) for the cyclic factors of V ). 


Converting annx/i Matrix to Jordan Canonical Form 


(1) to (2): The first two steps are those from the algorithm for Converting ann x« 
matrix to Rational Canonical Form following Theorem 2 1 . 

(3) When xl — A has been diagonalized to the form in Theorem 21 the first n—m 
columns of the matrix P' are 0 (providing a useful numerical check on the com- 
putations) and the remaining m columns of P' are nonzero. For each successive/ 
i = 1*2 , ,m: 

(a) Factor the I th nonconstant diagonal element (which is of degree d,): 


a(x) = (x- X y ) a 1 (x - X 2 )“’ . . . (x - A*)* 

where X t , . . . * X s e F are distinct (here a{x) = a,(x) is the I th nonconstant 
diagonal element and s depends on i). 

(b) Multiply the I th nonzero column of P' successively by the d t matrices: 

(A-X y lY'-\A-X 2 lY* ...(A-X s l) a < 

(A - Xj/) H i-2 (A - X 2 /)“ 2 . . .(A - X s I ) a ' 

(A - Xy I)° (A - X 2 /)«’ . . .(A - xjy* 


(A - Xyl) a 1 (A - A 2 /)“ 2 - 1 . . .(A - A,/)“' 
(A - Xi /)«■ (A - X 2 /r~ 2 . . .(A - X s /)«- 

(A-Xi/r (A-X 2 /)° ...(A-X,/)*' 
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(A - hI) a '(A - X 2 I) a2 . . .(A - \ s I) a ~ l 
(A - XyircA - \ 2 iy \ . .(A - XJY'- 2 

(A — A)/) fll (A — - .(A — X S I)°. 

(c) Use the column vectors resulting from (b) (in that order) as the next d, columns 
of an n x n matrix P. 

Then P~*AP is in Jordan canonical form (whose Jordan blocks correspond to the 
ordering of the factors in (a)). 


Examples 


We can use Jordan canonical forms to carry out the same analysis of matrices that we did 
as examples of the use of rational canonical forms. In some instances, when the field is 
enlarged, the number of similarity classes increases (the number of similarity classes can 
never decrease when we extend the field by Corollary 18(2)). 

(X) Let A, B and C be the matrices in Example 1 of the previous section and let F = <Q). 
Note that Q contains all the eigenvalues for these matrices. Since we have already 
determined the invariant factors of these matrices we can immediately obtain their 
elementary divisors. The elementary divisors of A are x — 2, x — 2 and x — 3 and 
the elementary divisors of B and C are (x — 2) 2 and x — 3 so the respective Jordan 
canonical forms are: ( 



(2 1 0 \ / V - 1 0 \ 

0 2 0 0 2 0 . 
1,0 0 3 ) \0 0 3 / 


Notice that A is similar to a diagonal matrix but, by Corollary 25, B and C are not 
(2) For the matrix A, we determined in Example 2 of the previous section that yj = 
—le\ + le 2 + ej and f 2 = ~e\ + were Q[x]-module generators for the two cyclic 
factors of V in its invariant factor decompositi on, corresponding to the invariant factors 
x — 2 and (x — 2)(x — 3), respectively. Using the first algorithm described above, the 
elements f\, (x — 3)f 2 and (x — 2)f 2 are therefore <Q»[x]-module generators for the 
three cyclic factors of V in its elementary divisor decomposition, corresponding to the 
elementary divisors x — 2, x — 2, and x — 3. An easy computation shows that these 
are the elements —le\ + le 2 + €3, —e\ and — 2ct 4- e 2 , respectively. Then the matrix 


-7 -1 —2 


P = 


7 

1 


0 

0 


conjugates A into its Jordan canonical form: 


P~ l AP = 



as one easily checks. 

The columns of this matrix can also be obtained following the second algorithm 
above, using the nonzero columns of the matrix P' computed in Example 2 of the 
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previous section: 


(A - 2 if 



and 


(A - 2I)°(A 





respectively, which again gives the matrix P. 

(3) For the 4 x 4 matrix D of Example 3 of the previous section, the invariant factors 
were ( x — l) 2 , (jc — l) 2 , with corresponding <Q)[jc]-module generators f\ = e\ and 
f 2 — respectively. These are also the elementary divisors for this matrix. The 
corresponding vector space bases for these two factors are given by (T — l)/i, f\ 
and (T — \) f 2 , fi, respectively. An easy computation shows these are the elements 
2*2 + ej , ei and 2«i — «2 + C 4 , « 2 . respectively. Then the matrix 


/0 1 2 0 \ 

2 0-21 
10 0 0 
\0 0 10 / 


conjugates D into its Jordan canonical form: 


P~ l DP = 


/I 1 0 0\ 
0 10 0 
0 0 11 
Vo 0 0 1/ 


as can easily be checked. 

The columns of this matrix can also be obtained following the second algorithm 
above, using the nonzero columns of the matrix P' computed in Example 3 of the 
previous section: 



/ 1S 


/°\ 





(D - I) 1 

0 

0 

= 

2 

1 


0 

0 

— 

0 

0 ’ 


Voj 


Vo/ 


<0/ 


<0/ 




f 2> 

\ 

/O' 


f°\ 

< O - I) 1 

J 

= 

-2 

0 

(D- I)° 

1 

0 

= 

1 

0 


<0/ 


< 1; 

1 

Vo, 


\o/ 


respectively, which again gives the matrix P. 

(4) The set of similarity classes of 6 x 6 matrices with entries from C with characteristic 
polynomial (jc 4 — 1)(jc 2 — 1) consists of the 4 classes represented by the rational 
canonical forms in the preceding set of examples (there are no additional lists of 
invariant factors over Q. Their Jordan canonical forms cannot all be written over Q, 
however. For instance, if the invariant factors are 

(jc-1)(jc+1) and (jc - 1)(jv 4- 1)(jc 2 + 1) 
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then the elementary divisors are 

x — 1 , x+1, x — 1 , x-Fl, x—i, x + i , 

where i is a square root of —1 in C, so the Jordan form for this matrix is a diagonal 
matrix with diagonal entries 1, 1,-1, — 1, /, — i. 

(5) In contrast, the set of similarity classes of 3 x 3 matrices. A. over C satisfying A 6 — J 
is considerably larger than that over Q. If A is any such matrix, | x 6 — 1 so 

since the latter polynomial has no repeated roots in C, the minimal polynomial of A 
has no repeated roots. By Corollary 25 the Jordan canonical form of A is a diagonal 
matrix. Since this diagonal matrix has the same minimal polynomial, its 6 th power 
is also the identity, and so each diagonal entry is a 6 th root of unity. For each list 
fi , ft, ft of 6 th roots of unity we obtain a Jordan canonical form, and two such forms 
are the same (i.e., give rise to simil ar matrices) if and only if the lists are permuted 
versions of each other. One finds that there are, up to similarity, 56 classes of such 
4’s. 

EXERCISES 

1. Suppose the vector space V is the direct sum of cyclic / r [x]-modules whose annihilators 
are (x 4- l) 2 , (x — l)(x 2 + l) 2 , (x 4 — 1) and (x + l)(x 2 — 1). Determine the invariant 
factors and elementary divisors for V. 

2. Prove that if Xi , . . . , X n are the eigenvalues of the n x n matrix A then X*. .... A.* are the 
eigenvalues of A k for any k > 0. 

3. Use the method of Example 2 above to determine explicit matrices Pi and P 2 with P^ 1 B P\ 
and 1 C P 2 in Jordan canonical form. Use this to explicitly construct a matrix Q which 
conjugates B into C (proving directly that these matrices are similar). 

4. Prove that the Jordan canonical form for the matrix 

/ 9 4 5 

I -4 0-3 

\— 6 -4 -2 

is that stated at the beginning of this chapter. Explicitly determine a matrix P which 
conjugates this matrix to its Jordan canonical form. Explain why this matrix cannot be 
diagonalized. 

5. Compute the Jordan canonical form for the matrix 

1 0 0\ 

0 0-2. 

0 1 3/ 

6. Determine which of the following matrices are simil ar 

/-I 4 — 4\ / —3 -4 0\ / —3 2 — 4\ /-I 4 -4\ 

2 -1 3 2 30 2 1 0 0 -3 2 . 

VO -4 3/ V 8 8 1/ V 3 “! V \ 0 -4 3/ 

7. Determine the Jordan canonical forms for the following matrices: 

( 5 4 1\ / 3 4 2 

-1 0 0 -2-3-1 

\ —3 -4 1/ V“ 4 -4 -3 
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8. Prove that the matrices 




/ 3 -1 2 

B = I -10 6 -14 

\ -6 3-7 


are similar. Prove that both A and B can be diagonalized and determine explicit matrices 
P\ and Pi with Py l AP\ and P^BPi 'm diagonal form. 

9. Prove that the matrices 


-8 -10 -1 
7 9 1 

3 2 0 


2 -4 
-1 4 

-2 5 


both have (x — l) 2 (jr + 1) as characteristic polynomial but that one can be diagonalized 
and the other cannot. Determine the Jordan canonical form for both matrices. 

10. Find all Jordan canonical forms of 2 x 2, 3 x 3 and 4x4 matrices over C. 

11. Verify that the characteristic polynomial of 

/ 1 0 0 0 \ 

0 1 0 0 

-2 -2 0 1 

V —2 0 -1 - 2 / 


is a product of linear factors over 1 
for A over ©. 


-2 0 -1 - 2 / 

Determine the rational and Jordan canonical forms 


12. Determine the Jordan canonical form for the matrix 


1 2 

0 

°\ 

0 1 

2 

0 

0 0 

1 

2 ' 

0 0 

0 

1/ 

for the matrix 

0 

-2 

-3 

-8 

14 

-15 

-4 

7 

-7 

2 

-4 

3 


14. Prove that the matrices 

/ 2 
A _ -4 
2 

V — 2 

are similar. 

15. Prove that the matrices 


both have characteristic polynomial (x — 3)(x + l) 3 . Determine whether they are s imil ar 
and determine the Jordan canonical form for each matrix. 
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16. Determine the Jordan canonical form for the matrix 

Z 1 1 1 1\ 

0 10-1 
0 0 1 1 

Vo o o i/ 

and determine a matrix P which conjugates this matrix into its Jordan canonical form. 

17. Prove that any matrix A is similar to its transpose A' . 

18. Determine all possible Jordan canonical forms for a linear transformation with character- 
istic polynomial ( x — 2) 3 (x — 3) 2 . 

19. Prove that all n x n matrices with characteristic polynomial / (x) are similar if and only 
if / (x) has no repeated factors in its unique factorization in F[x). 

20. Show that the following matrices are s imil ar in M P (\F P ) (j> x p matrices with entries from 
F p ): 

/0 0 0 ...' 

100 ... 

0 I 0 ... 

Vo 0 0 ... 

21. Show that if A 2 = A then A 
along the diagonal. 

22. Prove that an n x n matrix A with entries from C satisfying A 3 = A can be diagonalized. 
Is the same statement true over any field F? 

23. Suppose A is a 2 x 2 matrix with entries from <Q) for which A 3 = 7 but A ^ 7. Write A in 
rational canonical form and in Jordan canonical form viewed as a matrix over <C. 

24. Prove there are no 3 x 3 matrices A over <Q) with A 8 = 7 but A 4 ^ 7. 

25. Determine the Jordan canonical form for the n x n matrix over <Q> whose entries are all 
equal to 1. 

26. Determine the Jordan canonical form fcr the n x n matrix over F p whose entries are all 
equal to 1 (the answer depends on whether or not p divides «). 

27. Determine the Jordan canonical form for the n x n matrix over <Q> whose entries are all 
equal to 1 except that the entries along the main diagonal are all equal to 0. 

28. Determine the Jordan canonical form for the n x n matrix over F p whose entries are all 
equal to 1 except that the entries along the main diagonal are all equal to 0. 

The direct sum of the cyclic submodules of V corresponding to all the elementary divisors of 
V which are powers of the same x — X is called the generalized eigenspace ofT corresponding 
to the eigenvalue X. Note that this is the p-primary component of V for the prime p = x — X 
of F(x] and consists of the elements of V which are annihilated by some power of the linear 
transformation T —X. The matrix fcr T on the generalized eigenspace fori, is the block diagonal 
matrix of all Jordan blocks for T with the same eigenvalue X. 

29. Suppose Vj is the generalized eigenspace of T corresponding to eigenvalue X t . For any 
k > 0, prove that the nullity of T—Xj on the subspace (T— A,- )* V; is the same as the nullity 
of T — Xj on (T — X;) k V and equals the number of Jordan blocks of T having eigenvalue 
Xj and size greater than k (so for k = 0 this gives the number of Jordan blocks). 


0 1\ 
0 0 
0 0 


and 


f I t u 
0 1 1 
0 0 1 


0 0 0 

Vo o o 


u u\ 
0 0 
0 0 


1 1 
0 l/ 


1 0 / 

is similar to a diagonal matrix which has only 0’s and l’s 
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30. Let X be an eigenvalue of the linear transformation T on the finite dimensional vector space 
■ V over the field F. Let = dim f(T — X) k V be the rank of the linear transformation 

(T — k) k on V. For any k > 1, prove that r^-i— 2r^+r^+i is the number of Jordan blocks 
of T corresponding to X of size k [use Exercise 12 in Section 1]. (This gives an efficient 
method for determining the Jordan canonical form for T by computing the ranks of the 
matrices (A — kl) k for a matrix A representing T , cf. Exercise 31(a) in Section 1 1.2.) 

31. Let N be an n x n matrix with coefficients in the field F. The matrix N is said to be 
nilpotent if some power of N is the zero matrix, i.e., N k = 0 for some k. Prove that any 
nilpotent matrix is similar to a block diagonal matrix whose blocks are matrices with 1 ’s 
along the first superdiagonal and 0’s elsewhere. 

32. Prove that if N is an n x n nilpotent matrix then in fact N n = 0. 

33. Let A be a strictly upper triangular nxn matrix (all entries on and below the m ain diagonal 
are zero). Prove that A is nilpotent. 

34. Prove that the trace of a nilpotent nxn matrix is 0 (recall the trace of a matrix is the sum 
of the diagonal elements). 

35. For 0 < i < n, let dj be the g.c.d. of the determinants of all the / x i minors of x I — A, 
for A as in Theorem 21 (take the 0 x 0 minor to be 1). Prove that the I th element along 
the diagonal of the Smith Normal Form for A is d-, /di-\. This gives the invariant factors 
for A. [Show these g.c.d.s do not change under elementary row and column operations.] 

36. Let V = C” be the usual n-dimensional vector space of n-tuples («i , &2> ■ • - . a n) of 

complex numbers. Let T be the linear transformation defined by setting T (a i , «2 «*„) 

equal to (0, a .\ , <Z 2 , . . . , ct n -\)- Determine the Jordan canonical form for T. 

37. Let 7 be a Jordan block of size n with eigenvalue X over C. 

(a) Prove that the Jordan canonical form for the matrix 7 2 is the Jordan block of size n 
with eigenvalue X 2 if X # 0. 

(b) If X = 0 prove that the Jordan canonical form for 7 2 has two blocks (with eigenvalues 

„ . « n .. . , . n — 1 n + 1 . 

0) of size — , — if n js even and of size — - — , — - — if n is odd. 

2 2 2 2 

38. Determine necessary and sufficient conditions for a matrix A e M„ (<C) to have a square 
root, i.e., for there to exist another matrix B e M n (<C) such that A — B 2 . [Suppose B is in 
Jordan canonical form and consider the Jordan canonical form for B' 1 using the previous 
exercise.) 

39. Let J be a Jordan block of size n with eigenvalue X over a field F of characteristic 
2. Determine the Jordan canonical form for the matrix J 2 . Determine necessary and 
sufficient conditions for a matrix A e M„(F) to have a square root, i.e., for there to exist 
another matrix B e M„(F) such that A = B 2 . 

The remaining exercises explore functions (power series) of a matrix and introduce some 
applications of the Jordan canonical form to the theory of differential equations. 

Throughout these exercises the matrices are assumed to be n x n matrices with entries 
from the field K, where K is either the real or complex numbers. Let 

00 

g to = Yh akxk 

k = 0 

be a power series with coefficients from K. Let G N (x) = Yjc = o “*** ^ ^ partial sum 

of G(jc) and for each A e M n (K) let G^(A) be the element of M n (K) obtained (as usual) by 
substituting A in this polynomial. For each fixed i, j we obtain a sequence of real or complex 
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numbers c^-, N — 0, 1, 2, . . . by taking cfj to be the i, j entry of the matrix G^(A). The series 

OO 

G(A) = ^a k A k 

k = 0 

is said to converge to the matrix C in M n (K) if for each i,j € {1, 2, .... n) the sequence c^, 
N = 0, 1, 2, . . . converges to the i, j entry of C (in which case we write G(A) = C). Say 
G(A) converges if there is some C e M„(K ) such that G(A) = C. If A is a 1 x 1 matrix, this 
is the usual notion of convergence of a series in K. 

For A = (a,j ) e M„(K) define 

n 

IMI1 = £ kjl 

i.j = 1 

Le., 1 1 A 1 1 is the sum of the absolute values of all the entries of A. 


40. Prove that for all A, B e M n (AT) and all a e K 

(a) || A + i? || < || A || + || i? || 

(b) ||AS II < ||A|| • ||fl|| 

(c) HaAII = H-|I A||. 


41. Let R be the radius of convergence of the real or complex power series G(x) (where R = oo 
if G(x) converges for all x € K). 

(a) Prove that if |) A 1 1 < R then G(A) converges. 

(b) Deduce that for all matrices A the following power series converge: 


( 2 * + 1)1 
A 2 * 


A^ A 5 i 

sin(A) = A — — + — H (- (—1) 

cos(A)=/ “ir + ir + '" +<_,) ‘(2«! 

^2 j 

exp(A) = i + a + ' 2 \+^ + 


+ 


+ 


A k 

+ ld + 


where / is the n x n identity matrix. 


In view of applications to the theory of differential equations we introduce a variable t at this 
point, so that for A € M n (K ) the matrix At is obtained from A by multiplying each entry by 
t (which is the same as multiplying A by the “scalar” matrix //). We obtain a function from a 
subset of K into M n (K) defined by t i-> G(At) at all points t where the series G(At) converges. 
In particular; sin(A/), cos(Af) and exp(Ar) converge for all t € K. 


42. Let P be a nonsingular n x n matrix. 

(a) Prove that PG(At)P _1 = G(PAtP~ l ) = G(PAP~ l t). (This implies that, up to 
a change of basis, it suffices to compute G(At) for matri ces A in canonical form). 
[Take limits of partial sums to get the first equality. The second equality is immediate 
because the matrix tl commutes with every matrix.] 

(b) Prove that if A is the direct sum of matrices Ai , A 2 , .... A m , then G(At) is the direct 
sum of the matrices G(A]t ), G(A 2 t ), . . . , G(A m t). 

(c) Show that if Z is the diagonal matri x with entries zi, 22 , - ■ - , then G(Zt) is the 
diagonal matrix with entries G(z\t), G(zit ), . . . , G(z n t). 

The matrix exp(A) defined in Exercise 41(b) is called the exponential of A and is often denoted 
by e A . The next three exercises lead to a formula for the matrix exp(Jt), where J is an 
elementary Jordan matrix. 
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43. Prove that if A and B are commuting matrices then exp(A + B) = exp(A) exp(fi). [Treat 
A and B as commuting mdeterminates and deduce this by comparing the power series on 
the left hand side with the product of the two power series on the right hand side.] 

44. Use the preceding exercise to show that if M is any matrix and X is any element of K then 

exp(X7f + M) = exp(Af). 


45. Let N be the r x r matrix with l’s on the first superdiagonal and zeros elsewhere. Compute 
the exponential of the following nilpoient r x r matrix: 

,2 


/I ' 


/° * 

0 / 


if Nt = 


\ 


t 

0 ) 


then exp (Nt) — 


i * 

51 


r- 1 \ 

Tr= I5T ' 


i j 

21 


r 

51 

t 




1 ) 


Deduce that if J is the r x r elementary Jordan matrix with eigenvalue X then 


(P te“ 

e * te*-' 


exp (Jt) = 


<r-OI 




{Le^ 

7\ e 


te“ 


•IJj 


2T e 
t^ 
e / 


[To do the first part use die observation that since Nt is a nilpotent matrix, exp (Nt) is a 
polynotnud in Nt, i.e., all but a finite number of the terms in the power series are zero. To 
compute the exponential of Jt write Jt as X/r + Nt and use Exercise 44 with M = Nt.) 


Let A e M„(K ) and let P be a change of basis matrix such that P~ l AP is in Jordan canonical 

form. Suppose P -1 AP is the sum of elementary Jordan matrices J\ J m . The preceding 

exercises (with t = 1) show that exp(A) can easily be found by writing E = exp(P -1 AP) as 

the direct sum of the matrices exp(7i ) exp(/„, ) and then changing the basis back again to 

obtain exp(A) = PEP~ l . 


46. For the 4 x 4 matrices D and P given in Example 3 of this section: 


show that 

E = 




n 

2 

’ -4 

4 V 

i 

f Q 

1 

2 

°\ 




2 

— 

1 4 

-8 


2 

0 

-2 

1 ) 


D 

— 

1 

0 1 

-2 

P = 

1 

0 

0 

° J 




Vo 

1 

-2 

3 / 


^0 

0 

1 

0/ 


( e 

e 

0 

0\ 




/ « 


2e 

~4e 

4e \ 

0 

e 

0 

0 




2e 


—e 

4e 

—8e 

0 

0 

e 

e 

and 


exp(£>) — 

e 


0 

e 

-2e 

Vo 

0 

0 

e) 




\ 0 


e 

-2e 

3e J 
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47. Compute the exponential of each of the following matrices: 

(a) the matrix A in Example 2 of this section 

(b) the matrix i n Exercise 4 (where you computed the Jordan canonical form and a change 
of basis matrix) 

(c) the matrix in Exercise 16. 

48. Show that exp(O) — 1 (here 0 is the zero matrix and 1 is the identity matrix). Deduce that 
exp(A) is nonsingular with inverse exp(— A) for all matrices A e 

49. Prove that det(exp(A)) = where tr(A) is the trace of A (the sum of the diagonal 
entries of A). 

50. Fix any A € M n (K). Prove that the map 

K -*■ GL n (K ) defined by exp(Ar) 

is a group homomorphism (here K is the additive group of the field). (Note how this gener- 
alizes the familiar exponential map from K to K x , which is the n = 1 case. The subgroup 
(exp(Af) | t € K) is called a 1 -parameter subgroup of GL n (K). These subgroups and 
the exponential map play an important role in the theory of Lie groups — GL„ (K) being 
a particular example of a Lie group.). 

Let G (x) be a power series having an infinite radius of convergence and fix a matrix A € 

The entries of the matrix G( A/) are K -valued functions of the variable / that are defined for all 
/. Let Cjj (t) be the function of / in the i, j entry of G(At). The derivative of G(A/) with respect 
d d 

to /, denoted by — G(A/), is the matrix whose / , j entry is — c, -,-(/) obtained by differentiating 
dt dt 

each of the entries of G(A/). In other words, if we identify M n (K) with K n by considering 
each n x n matrix as an n 2 - tuple, then t (-»■ G(Ar) is a map from K to K n 2 (i.e., is a vector 
valued function of t) whose derivative is just the usual (componentwise) derivative of this vector 
valued function. 

51. Establish the following properties of derivatives: 

(a) If G(x) = 52 then -^-G(At) = A 52 fca*(Ar)* -1 ■ 

k=o “t k=[ 

(b) If v is an n x 1 matrix with (constant) entries from K then 

1(G (A«)„)=(^0(A«))„. 

52. Deduce from part (a) of the preceding exercise that 

d 

— exp(Af) = A exp(Ar). 
dt 

Now let y\ (t) y n (/) be differentiable functions of the real variable t that are related by the 

following linear system of first order differential equations with constant coefficients ay e K: 

= Z3i i jyi + a n y2 + ■ ■ - + 

y'l ~ fl 21 yi + tf22y2 + ■ • ■ + yn 

(*) 

y' n = a n tyj + a myi + . . - + a nn y n 
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(here the primes denote derivatives with respect to /). Let A be the matrix whose i, j entry is 
a,j, so that (+) may be written as 


fy{\ 


f yi > 

y'l 

= A 

y2 

Wn) 


\yn) 


or, more succinctly, as / = Ay, where y is the column vector of functions yi(f), . . . , y n (0- 
An n x ti matrix whose entries are functions of t and whose columns are independent 
solutions to the system (*) is called a fundamental matrix of (+). By the theory of differential 
equations, the set of vectors y that are solutions to the system (*) form an n -dimensional vector 
space over K and so the columns of a fundamental matrix are a basis for the vector space of 
all solutions to (*). 


53 . Prove that exp(/U) is a fundamental matrix of (*). Show also that if C is then x 1 constant 
vector whose entries are yi (0), . . . , y n (0) then y (r) = exp(A/)C is the particular solution 
to the system (*) satisfying the initial condition y(0) = C. (Note how this generalizes 
the 1-dimensional result that the single differential equation y' — ay has e 0 ' as a basis for 
the 1-dimensional space of solutions and the unique solution to this differential equation 
satisfying the initial condition y(0) = c is y = ce 01 .) [Use the preceding exercises.] 

54 . Prove that if M is a fundamental matrix of (*) and if Q is a nonsingular matrix in M n (K), 
then M Q is also a fundamental matrix of (*)- [The columns of M Q are linear combinations 
of the columns of AL] 


Now apply the preceding two exercises to solve some specific systems of differential equations 
as follows; given the matrix A in a system (*), calculate a change of basis matrix P such that 
B = P~ i AP is in Jordan canonical form. Then exp(Ar) = P exp(Bt)P~ l is a fundamental 
matrix for (*)- By the preceding exercise, P exp (Bt) is also a fundamental matrix for (*) and 
exp (Bt) can be calculated by the method described in the discussion following Exercise 45 (in 
particular, one does not have to find the inverse of the matrix P to obtain a fundamental matrix 
for (*))• Thus, for example, if A = D and P are the matrices given in Exercise 46, then we 
saw that the Jordan canonical form for A is the matrix B = P~ l AP consisting of two 2x2 
Jordan blocks with eigenvalues 1. A fundamental matrix for the system / = Ay is therefore 


/° 

1 

2 



/*' 

te’ 

0 

0 > 


( ° 


2e> 

lie 1 \ 

2 

0 

-2 

1 


0 

e> 

0 

0 


2e l 

2te> 

—le* 

e t (l-2t) 

1 

0 

0 

0 


0 

0 

e l 

te‘ 


e ' 

te‘ 

0 

0 

Vo 

0 

1 

0/ 


Vo 

0 

0 

e' ) 


V 0 

0 


te 1 J 


P cxp(B) = 


Writing this out more explicitly, this shows that the general solution to the system of differential 
equations 


>1 = yi + 2y2 - 4>>3 + 4y 4 
= 2yi — y2+4y3-8y 4 
>3 = yi + w - 2 >*4 
>4 = y2 - 2y3 + 3y 4 

is given by 


(yv\ 


( ° V 


/ e‘ \ 


( ^ \ 


/ 2m* v 

yi 




2te‘ 


-2c' 


e'fl — 2t) 

>’3 , 

= a, 

e‘ 

+ 02 

te 1 

+ aj 

0 

+ 04 

0 

Vy 4 y 


Vo ) 


V 0 / 


V * ) 


V te’ / 
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where a \ , . . . , 04 are arbitrary elements of the field K (this describes the 4-dimensional vector 
space of solutions). 

55. In each of Paris (a) to (c) find a fundamental matrix for the system (*), where the coefficient 
matrix A of (*) is specified. 

(a) A is the matrix in Part (a) of Exercise 47. 

(b) A is the matrix in Part (b) of Exercise 47. 

(c) A is the matrix in Pari (c) of Exercise 47. 

56. Consider the system (*) whose coefficient matrix A is the matrix D listed in Exercise 46 
and whose fundamental matrix was computed just before the preceding exercise. Find the 
particular solution to (*) that satisfies the initial condition y> ( 0 ) = 1 for i = 1 , 2 , 3, 4. 

Next we explore a special case of (*). Given the linear / 1 th order differential equation with 
constant coefficients 

y (/,) + -\ h a\y' + aoy ~ 0 (**) 

(where y^ is the derivative of y and yf°) = y) one can form a system of linear first order 
differential equations by letting y, = y^ 1 * for 1 < i < n (the coefficient matrix of this system 
is described in the next exercise). A basis for the n-dimensional vector space of solutions to 
the /I th order equation (**) may then obtained from a fundamental matrix for the linear system. 
Specifically, in each of the n x 1 columns of functions in a fundamental matrix for the system, 
the 1, 1 entry is a solution to (**) and so the n functions in the first row of the fundamental 
matrix for the system form a basis for the solutions to (**). 

57. Prove that the matrix, A, of coefficients of the system of n first order equations obtained 
from (**) is the transpose of the companion matrix of the polynomial x n + a n -ix n ~ l + ■ - 
- + aix + oo- 

58. Use the above methods to find a basis for the vector space of solutions to the following 
differential equations 

(a) y m - 3y' + 2y - 0 

(b) y"" + Ay'" + 6 y" + 4y' + y = 0. 

A system of differential equations 


y[ = ftCyi. n,-- 

■ , yn) 

y r 2 = F2CV1, y2. . . 

■ ,yn ) 

yt, = F ti (yuy 2 >-‘ 

■ . y«) 


where F \ , F 2 F n are functions of n variables, is called an autonomous system and it 

will be written more succinctly as y' = F(y), where F = (Fi, . . . , F n ). (The expression 
autonomous means “independent of time” and it indicates that the variable t — which may 
be thought of as a time variable — does not appear explicitly on the right hand side.) The 
system (*) is the special type of autonomous system in which each F, is a linear function. In 
many instances it is desirable to analyze the behavior of solutions to an autonomous system 
of differential equations without explicitly finding these solutions (indeed, it is unlikely that it 
will be possible to find explicit solutions for a given nonlinear system). This investigation falls 
under the rubric “qualitative analysis” of autonomous differential equations and the rudiments 
of this study are often treated in basic calculus courses for 1 x 1 systems. The first step in 
a qualitative analysis of an n x n autonomous system is to find the steady states, namely the 
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constant solutions (these are called steady states since they do not change with t). Note that 
a constant function y = c, where c is the n x 1 constant vector with entries c \ , . . . , c„, is a 
solution to y' = Fiy) if and only if 

c[ = 0 = Fi (ci c„) for i = 1 , 2, . . . , n, 

so the steady states are found by computing the zeros of F (in the case of a nonlinear system 
this may require numerical methods). Next, given the initial value of some solution, one wishes 
to analyze the behavior of this solution as r — > oo. This is called the asymptotic behavior of the 
solution. Again, it may not be possible to find the solution explicitly, although by the general 
theory of differential equations a solution to the initial value problem is unique provided the 
functions Fi are differentiable. A steady state y = c is called globally asymptotically stable if 
every solution tends to c as t — ► oo, i.e., for any solution y(t) we have lim y (/) = c, for all 

t — >00 

i = l,2 n. 

In the case of the linear autonomous system (*) the solutions form a vector space, so the 
only constant solution is the zero solution. The next exercise gives a sufficient condition for 
zero to be globally asymptotically stable and it gives one example of how the behavior of a 
linear system may be analyzed in terms of the eigenvalues of its coefficient matrix. Nonlinear 
systems can be approximated by linear systems in some neighborhood of a steady state by 

( d F w \ 

— - I is the n x n Jacobian matrix of F evaluated at the 

6 yj ) 

steady state point In this way the analysis of linear systems plays an important role in the local 
analysis of general autonomous systems. 

59. Prove that the solution of (*) given by y,(r) = 0 for all t e {1, . . . , n ) (i.e., the zero 
solution) is globally asymptotically stable if all the eigenvalues of A have negative real 
parts. [For those un familiar with the behavior of the complex exponential function, assume 
all eigenvalues are real (hence are negative real numbers). Use the explicit nature of the 
solutions to show that they all tend to zero as / — »• oo.] 
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Part IV 


FIELD THEORY AND GALOIS THEORY 


The previous sections have developed the theory of some of the basic algebraic struc- 
tures of groups, rings and fields. The next two chapters consider properties of fields, 
particularly fields which arise from trying to solve equations (such as the simple equation 
x 2 + 1 = 0), and fields which naturally arise in trying to perform “arithmetic” (adding, 
subtracting, multiplying and dividing). The elegant and beautiful Galois Theory relates 
the structure of fields to certain related groups and is one of the basic algebraic tools. 
Applications include solutions of classical compass and straightedge construction ques- 
tions, finite fields and Abel’s famous theorem on the insolvability (by radicals) of the 
general quintic polynomial 
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CHAPTER 13 


Field Theory 


13.1 BASIC THEORY OF FIELD EXTENSIONS 

Recall that a field F is a commutative ring with identity in which every nonzero element 
has an inverse. Equivalently, the set F* = F — {0} of nonzero elements of F is an 
abelian group under multiplication. 

One of the first invariants associated with any field F is its characteristic , defined 
as follows: If If denotes the identity of F, then F contains the elements If, If + If, 
lf + If + lf, ... of the additive subgroup of F generated by 1 f , which may not all 
be distinct. For n a positive integer, let n - 1 f = If + • ■ ■ + 1 f (n times). Then two 
possibilities arise: either all the elements n ■ 1 f are distinct, or else n • 1 f = 0 for some 
positive integer n. 

Definition. The characteristic of a field F, denoted ch(F), is defined to be the smallest 
positive integer p such that p - 1 F = 0 if such a p exists and is defined to be 0 otherwise. 

It is easy to see that 

n ■ If + m - If = (m + n) • If and that 

(n ■ lf)(m ■ If) = mn • If (13.1) 

for positive integers m and n. It follows that the characteristic of a field is either 0 or a 
prime p (hence the choice of p in the definition above), since if n = ab is composite 
with n ■ 1 f = 0, then ab • 1 f = (a ■ 1 f )(b ■ 1 f ) = 0 and since F is a field, one of a - If 
or b ■ 1 f is 0, so the smallest such integer is necessarily a prime. It also follows that if 
n ■ If = 0, then n is divisible by p. 

Proposition 1. The characteristic of a field F, ch(F), is either 0 or a prime p. If 
ch(F) = p then for any a e F, 

p-a = a + a + ‘‘-+a = 0. 

' . ' 

p limes 


Proof: Only the second statement has not been proved, and this follows immediately 
from the evident equality p ■ a = p ■ (1 F a) = (p ■ 1 F )(a) in F. 
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Remark: This notion of a characteristic makes sense also for any integral domain and 
its characteristic will be the same as for its field of fractions. 

Examples 

( 1 ) The fields <Q> and R both have characteristic 0: ch(Q) = ch(R) = 0. The integral 
domain 7L also has characteristic 0. 

(2) The (finite) field F p — TLjpTL has characteristic p for any prime p. 

(3) The integral domain F p [x] of polynomials in the variable x with coefficients in the 
field F p has characteristic p, as does its field of fractions F p (x:) (the field of rational 
functions in x with coefficients in F p ). 

If we define (— n) ■ If = — (n ■ If) for positive n and 0 ■ 1 f = 0, then we have a 
natural ring homomorphism (by equation (1)) 

(p : Z - — ► F 

n i — n ■ 1 f 

and we can interpret the characteristic of F by noting that ker(<p) = ch(F)Z. Taking 
the quotient by the kernel gives us an injection of either Z or Z/pZ into F (depending 
on whether ch(F) = 0 or ch(F) = p). Since F is a field, we see that F contains a 
subfield isomorphic either to Q (the field of fractions of Z) or to F p = Z/pZ (the field 
of fractions of Z/pZ) depending on the characteristic of F, and in either case is the 
smallest subfield of F containing 1 F (the field generated by If in F). 

Definition. The prime subfield of a field F is the subfield of F generated by the 
multiplicative identity If of F. It is (isomorphic to) either (Q) (if ch(F) = 0) or F p (if 
ch(F) = p). 

Remark: We shall usually denote the identity If of a field F simply by 1. Then in 
a field of characteristic p, one has p - 1 =0, frequently written simply p = 0 (for 
example, 2 = 0 in a field of characteristic 2). It should be kept in mind, however, that 
this is a shorthand statement — the element “p” is really p - 1 f and is not a distinct 
element in F. T hi s notation is useful in light of the second statement in Proposition 1. 

Examples 

(1) The prime subfield of both Q and R is <Q>. 

(2) The prime subfield of the field F p (x) is isomorphic to F p , given by the constant 
polynomials. 

Definition. If K is a field containing the subfield F, then K is said to be an extension 
field (or simply an extension) of F, denoted K/F or by the diagram 

K 

F 

In particular, every field F is an extension of its prime subfield. The field F is sometimes 
called the base field of the extension. 
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The notation K/F for a field extension is a shorthand for “K over F" and is not 
the quotient of K by F. 

If K/F is any extension of fields, then the multiplication defined in K makes K 
into a vector space over F. In particular every field F can be considered as a vector 
space over its prime field. 


Definition. The degree (or relative degree or in dex) of a field extension K/F, denoted 
[K : F], is the dimension of AT as a vector space over F (i.e., [K : F] = dinif K). The 
extension is said to b e finite if [AT : F] is finite and is said to be infinite otherwise. 


An important class of field extensions are those obtained by trying to solve equations 
over a given field F. For example, if F = K is the field of real numbers, then the simple 
equation jc 2 + 1=0 does not have a solution in F. The question arises whether there is 
some larger field containing M in which this equation does have a solution, and it was 
this question that led Gauss to introduce the complex numbers C = W + Ui, where i is 
defined so that i 2 + 1 =0. One then defines addition and multiplication in C by the 
usual rules familiar from elementary algebra and checks that in fact C so defined is a 
field, i.e., it is possible to find an inverse for every nonzero element of C. 

Given any field F and any polynomial p(x) e F[x] one can ask a similar question: 
does there exist an extension K of F containing a solution of the equation p(x) = 0 
(i.e., containing a root of p(x))? Note that we may assume here that the polynomial 
p(x) is irreducible in F[x] since a root of any factor of p(x) is certainly a root of 
p(x) itself. The answer is yes and follows almost immediately from our work on the 
polynomial ring F[jc]. We first recall the following useful result on homomorphisms 
of fields (Corollary 10 of Chapter 7) which follows from the fact that the only ideals of 
a field F are 0 and F. 


Proposition 2. Let <p : F -> F' be a homomorphism of fields. Then <p is either 
identically 0 or is injective, so that the image of (p is either 0 or isomorphic to F. 


Theorem 3. Let F be a field and let p(x) e F[x] be an irreducible polynomial. Then 
there exists a field K containing an isomorphic copy of F in which p(x) has a root. 
Identifying F with this isomorphic copy shows that there exists an extension of F in 
which p(x) has a root 


Proof: Consider the quotient 


K = FUVipix)) 

of the poly nomialringF[x] by the ideal generated by p{x). Since by assumption p(x) is 
an irreducible polynomial in the P.I.D. F[x], the ideal (p(x)) is a maximal ideal. Hence 
K is actually a field (this is Proposition 12 of Chapter 7). The canonical projection tz 
of F[jc] to the quotient F[x]/(p(x)) restricted to F c F[x] gives a homomorphism 
cp = jt\ F : F -> K which is not identically 0 since it maps the identity 1 of F to the 
identity 1 of AT. Hence by the proposition above, <p(F) = F is an isomorphic copy 
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of F contained in K . We identify F with its isomorphic image in K and view F as a 
subfield of K. If x = n(x) denotes the image of x in the quotient AT, then 

p(x ) = p(x) (since 7r is a homomorphism) 

= p(x) (mod p(x)) in F[x]/(p(x)) 

= 0 in F[x]/(p(x)) 

so that AT does indeed contain a root of the polynomial p(x). Then AT is an extension 
of F in which the polynomial p(x) has a root 

We shall use this result later to construct extensions of F containing all the roots 
of p(x) (this is the notion of a splitting field and one of the central objects of interest in 
Galois theory). 

To understand the field K = F[x]/(p(x)) constructed above more fully, it is useful 
to have a simple representation for the elements of this field. Since F is a subfield of 
K, we might in particular ask for a basis for AT as a vector space over F. 

Theorem 4. Let p(x) e F[x] be an irreducible polynomial of degree n over the field 
F and let K be the field F[x]/(p(x)). Let 6 = x mod (p(x)) g K. Then the elements 

i t e,e 2 i ...,e n ~ l 

are a basis for AT as a vector space over F, so the degree of the extension is n, i.e., 
[K : F] = n. Hence 

K = [oq + a\6 + aiti 1 + ■ ■ • + 1 I Qq, o\ , . . . , a „- 1 g F] 

consists of all polynomials of degree < n in 6. 

Proof: Let a(x) € F[x] be any polynomial with coefficients in F. Since F[x] is a 
Euclidean Domain (this is Theorem 3 of Chapter 9), we may divide a(x) by p(x): 

a(x) = q(x)p(x) + r(x) q(x ), r(x) € F[x] with deg r(x) < n. 

Since q(x) p(x) lies in the ideal (p(x)), it follows that a{x) = r (x) mod (p(x)), which 
shows that every residue class in F[x]/(p(x)) is represented by a polynomial of degree 
less than n. Hence the images 1, 6,6 2 , , 6 n ~ l of 1, x, x 2 , . . . , x"^ 1 in the quotient 
span the quotient as a vector space over F. It remains to see that these elements are 
linearly independent, so form a basis for the quotient over F. 

If the elements 1, 6,6 2 , ... , 6 n ~ l were not linearly independent in K, then there 
would be a linear combination 

^0 "T b\6 + + ■ • • + b n —i& n 1 = 0 

in K, with bo t b \ , . . . , b„~ i G F, not all 0. This is equivalent to 

bo + b\x + b^x 1 H 1- b n ^x n ~' = 0mod(p(x)) 


i.e., 


p(x) divides bo + b[X + ^x 2 -\ 1- b n _\x n 1 
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in F|>]. But this is impossible, since p(x) is of degree n and the degree of the nonzero 
polynomial on the right is < n. This proves that 1, 9,9 2 , . . . , 9 n ~ l are a basis for K 
over F, so that [K : F] = n by definition. The last statement of the theorem is clear. 

This theorem provides an easy description of the elements of the field F[x]/(/j(*)) 
as polynomials of degree < n in 0 where 9 is an element (in K) with p(6) = 0. It 
re mains only to see how to add and multiply elements written in this form. The addition 
in the quotient F[x}/(p(x)) is just usual addition of polynomials. The multiplication 
of polynomials a(x ) and b(x) in the quotient F[x]/(p(x)) is performed by finding the 
product a(x)b(x) in F[jc], then finding the representative of degree < n for the coset 
a(x)b(x) + (p(x)) (as in the proof above) by dividing a(x)b(x) by p(x) and finding 
the remainder. 

This can also be done easily in terms of 6 as follows: We may suppose p(x) is 
monic (since its roots and the ideal it generates do not change by multiplying by a 
constant), say p( x) = x" + p n -\x n ~ [ + ■•■ + p\x + Po- Then in AT, since p(9) = 0, 
we have 

B n = — (j>„-[G n 1 + ■ ■ * + p\9 -|- po) 

i.e., 9 n is a linear combination of lower powers of 9. Multiplying both sides by 9 and 
replacing the 9 n on the right hand side by these lower powers again, we see that also 
9 n+1 is a polynomial of degree < n in 9. Similarly, any positive power of 9 can be 
written as a polynomial of degree < n in 9, hence any polynomial in 9 can be written 
as a polynomial of degree < n in *0. Multiplication in AT is now easily performed: one 
simply writes the product of two polynomials of degree < n in 9 as another polynomial 
of degree < nin9. 

We summarize this as: 

Corollary 5. Let K be as in Theorem 4, and let a(9),b(9) e K be two polynomials of 
degree < n in 9. Then addition in K is defined simply by usual polynomial addition 
and multiplication in AT is defined by 

a(9)b(9) = r(9) 

where r(x) is the remainder (of degree < n) obtained after dividing the polynomial 
a(x)b(x) by p(x) in F[x\. 

) 

By the results proved above, this definition of addition and multiplication on the 
polynomials of degree < n in 9 make K into & field, so that one can also divide by 
nonzero elements as well, which is not so immediately obvious from the definitions of 
the operations. 

It is also important in Theorem 4 that the polynomial p(x) be irreducible over F . In 
general the addition and multiplication in Corollary 5 (which can be defined in the same 
way for any polynomial p(x)) do not make the polynomials of degree < n in 9 into a 
field if p(x) is not irreducible. In fact, this set is not even an integral domain in general 
(its structure is given by Proposition 16 of Chapter 9). To describe \he field containing 
a root 9 of a general polynomial f(x) over F, / (x) is factored into irreducibles in F[x] 
and the results above are applied to an irreducible factor p(x) of / (x) having 9 as a 
root We shall consider this more in the following sections. 
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Examples 

(1) If we apply this construction to the special case F = K and p(x) = x 2 + 1 then we 
obtain the field 

RM/Cx 2 + 1) 

which is an extension of degree 2 of R in which x 2 + l has a root The elements of 
this field are of the form a + bd for a, b e R. Addition is defined by 

(a + be ) + (c + d6) = (a + c) + (b + d)9. (13.2a) 

To multiply we use the fact that 9 2 + 1=0, i.e., $ 2 = — 1 in K . (Alternatively, note 
that — 1 is also the remainder when x 2 is divided by x 2 + 1 in R[jc].) Then 

( a + bB)(c + d6) = ac + (ad + bc)6 + bd6 2 

= ac+ (ad + bc)6 + bd(— 1) 

= (ac — bd) + (ad + bc)9. (13.2b) 

These are, up to changing 6 to i, the formulas for adding and multiplying in C. Put 
another way, the map 

(p : R[jc]/(jt 2 -(- 1) — ► C 
a + bx j-v a + bi 

is a homomorphism. Since it is bijective (as a map of vector spaces over the reals, for 
example), it is an isomorphism. Notice that instead of taking the existence of C for 
granted (along with the fairly tedious verification that it is in fact a field), we could 
have defined C by this isomorphism. Then the fact that it is a field is a consequence 
of Theorem 4. 

(2) Take now F = Q to be the field of rational numbers and again take p(x) = x 2 + 1 (still 
irreducible over <Q>, of course). Then the same construction, with the same addition and 
multiplication formulas as (2a) and (2b) above, except that now a and b are elements 
of 0, defines a field extension <Q>(/) of Q of degree 2 containing a root / of x 2 + 1. 

(3) Take F = <Q> and p(x) = x 2 — 2, irreducible over <Q) by Eisenstein’s Criterion, for 
example. Then we obtain a field extension of Q of degree 2 containing a square root 
9 of 2, denoted Q(0). If we denote 6 by -v/2, the elements of this field are of the form 

a + b*J 2, a,be<Q> 

with addition defined by 

(a + bJ2)+ (c + d^/2) = (a + c) + (b + 
and multiplication defined by 

(a + b-Jl )(c + dV2) = (ac + 2 bd) + (ad + bd)JlL 

(4) Let F = Q and p(x) = jc 3 — 2, irreducible again by Eisenstein. Denoting a root of 
p(x) by 0, we obtain the field 

<Q[*]/(jr 3 - 2) = (a + b9 + c6 2 \ a t b, c e Q) 

with 6 3 = 2, an extension of degree 3. To find the inverse of, say, 1 + 9 in this field, 
we can proceed as follows: By the Euclidean Algorithm in <Q>[jc ] there are polynomials 
a(x) and b(x ) with 

<7(x)(l + x) + b(x)(x 3 - 2) = 1 
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(since p(x) = x 3 — 2 is irreducible, it is relatively prime to eveiy polynomial of smaller 
degree). In the quotient field this equation implies that a(6) is the inverse of 1 + 0. 
In this case, a simple computation shows that we can take a(x) = j(x 2 - x + 1) (and 
b(x) = — i), so that 


(l + er 1 


6 2 - 9 + 1 
3 


(5) In general, if 6 e K is a root of the irreducible polynomial 

p(x) = p„x n + p„_ix" -1 H h p\x + po 

we can compute 8~ l e K from 

Q(p n d n ~ { + Pn-lQ n ~ 2 + ‘ ■ • + Pi) = -PO 


namely 

0-y = — (pn 0"-i + Pa _ l 9 n ~ 2 + ... + pi )eK 
PO 


(note that po^O since p(x) is irreducible). 


Remark: Determining inverses in extensions of this type may be familiar from elementary 
algebra in the case of C or Example 3 under the n ame “rationalizing denominators.” The 
last two examples indicates a procedure which is much more general than the ad hoc 
procedures of elementary algebra. 

(6) Take F = F 2 , the finite field with two elements, and p(x) = x 2 + x + 1, which we 
have previously checked is irreducible over F 2 - Here we obtain a degree 2 extension 
ofF 2 

F 2 [x]/(x 2 + x + 1) = {a + \ a,b e F 2 ) 

where 6 2 = —9 — 1=0 + 1. Multiplication in this field F2(0) (which contains four 
elements) is defined by 

( 1 a + b9)(c + d9) = ac + (ad + bc)6 + bd9 2 

= ac + (ad + bc)9 + bd(9 + l) 

= (ac + bd) + (ad + be + bd)9. 

(7) Let F = k(t) be the field of rational functions in the variable t over a field k (for 
example, k — Q or k = F p ). Let p(x) = x 2 — t e F[x]. Then p(x) is irreducible 
(it is Eisenstein at the prime (r) in £[>]). If we denote a root by 6, the corresponding 
degree 2 field extension F (9) consists of the elements 

[a(f) + b(t)0 \a(t)Mt) € F) 

where the coefficients a(t) and b(t) are rational functions in t with coefficients in k 
and where 9 2 = t. 


Suppose F is a subfield of a field K and a € A' is an element of K. Then the 
collection of subfields of K containing both F and a is nonempty (K is such a field, for 
example). Since the intersection of subfields is again a subfield, it follows that there is a 
unique minimal subfield of K containing both F and a (the intersection of all subfields 
with this property). Similar remarks apply if a is replaced by a collection a, . . . of 
elements of K. 
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Definition. Let AT be an extension of the field F and let a, ft, ■■■ € A" be a collection 
of elements of K. Then the smallest subfield of K containing both F and the elements 
ct, ft, . . . , denoted F(a, ft, . . . ) is called the field generated by a, fi, ... over F. 

Definition. If the field K is generated by a single element a over F, K — F(a), then 
K is said to be a simple extension of F and the element a is called a primitive element 
for the extension. 

We shall later characterize which extensions of a field F are simple. In particular we 
shall prove that every finite extension of a field of characteristic 0 is a simple extension. 

The connection between the simple extension F(a) generated by a over F where 
a is a root of some irreducible polynomial p(x) and the field constructed in Theorem 3 
is provided by the following: 

Theorem 6. Let F be a field and let p(x) e F[jc] be an irreducible polynomial. Suppose 
AT is an extension field of F containing a root a of p(x): p(ct) = 0. Let F (a) denote 
the subfield of AT generated over F by a. Then 

F(a) = F[x]/(j>(x)). 


Remark: This theorem says that any field over F in which p(x) contains a root contains 
a subfield isomorphic to the extension of F constructed- in Theorem 3 and that this 
field is (up to isomorphism) the smallest extension of F containing such a root. The 
difference between this result and Theorem 3 is that Theorem 6 assumes the existence 
of a root ct of p(x) in some field K and the major point of Theorem 3 is proving that 
there exists such an extension field K. 

Proof: There is a natural homomorphism 

<p : F[x] — ► F(ct) c K 
a(x) i — > a(ct) 

obtained by mapping F to F by the identity map and sending x to a and then extending 
so that the map is a ring homomorphism (i.e., the polynomial a(jt) in jc maps to the 
polynomial a(ct ) in ct). Since p(a) = 0 by assumption, the element p(x) is in the 
kernel of <p, so we obtain an induced homomorphism (also denoted <p)\ 

<p : F[x]/(p(x)) — > F(a). 

But since p(x) is irreducible, the quotient on the left is a field, and <p is not the 0 map 
^t is the identity on F, for example), hence <p is an isomorphism of the field on the left 
with its image. Since this image is then a subfield of F(ct) containing F and containing 
ct , by the definition of F (a) the map must be surjective, proving the theorem. 

Combined with Corollary 5, this determines the field F(ct) when a is a root of an 
irreducible polynomial p(x): 
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Corollary 7. Suppose in Theorem 6 that p(x) is of degree n. Then 

F(a) = {oq + a x ci + a 2 a 2 H 1- a„- l a n ~ l \ a 0 , a u ..., G F) C K. 


Describing fields generated by more than one element is more complicated and we 
shall return to this question in the following section. 


Examples 

(1) In Example 3 above, we have determined the field Q(a/2) generated over Q by the 
element %/2 e R, having suggestively denoted the abstract solution 6 of the equation 
x 2 — 2 = 0 by the symbol a/2, which has an independent meanin g in the field R 
(namely the positive square root of 2 in R). 

(2) The equation x 2 — 2 = 0 has another solution in R, namely — %/2, the negative square 
root of 2 in R. The field generated over Q by this solution consists of the elements 
[a + b{-«J 2) | a, b e <Q>], and is again isomorphic to the field in Example 3 above 
(hence also isomorphic to the field just considered, the isomorphism given explicitly 
by a + b+Jl i-> a — £>%/2 ). As a subset of R this is the same set of elements as in 
Example 1. 

(3) Similarly, if we use the symbol 1/2 to denote the (positive) cube root of 2 in R, then 
the field generated by 1/2 over Q in R consists of the elements 

ta + ^ + c(^) 2 [fli.ce(f) 


(4) 


and is isomorphic to the field constructed in Example 4 above. 

The equation x 3 — 2 = 0 has no further solutions in R, but there are two additional 


1 | i -y/3 ^ i 

solutions in C given by l/2{ ) and X/2( ) (V3 denoting the posi- 

tive real square root of 3) as can easily be checked. The fields generated by either of 
these two elements over <Q> are subfields of C (but not of R) and are both isomorphic 
to the field constructed in the previous example (and to Example 4 earlier). 


As Theorem 6 indicates, die roots of an irreducible polynomial - ^ (x) are alge- 
braically indistinguishable in the sense that the fields obtained by adjoining any root 
of an irreducible polynomial are isomorphic. In the last two examples above, the fields 
obtained by adjoining one of the three possible (complex) roots of x 3 — 2 — 0 to Q 
were all algebraically isomorphic. The fields were distinguished not by their alge- 
braic properties, but by whether their elements were real, which involves continuous 
operations. 

The fact that different roots of the same irreducible polynomial have the same 
algebraic properties can be extended slightly, as follows: 

Aj 

Let tp : F — > F be an isomorphism of fields. The map <p induces a ring isomor- 
phism (also denoted (p ) 

<P ■ nx) -A F'[x] 

defined by applying <p to the coefficients of a polynomial in F[x], Let p(x) G F[x] 
be an irreducible polynomial and let p'(x) g F'[x] be the polynomial obtained by 
applying the map <p to the coefficients of p(x), i.e., the image of p(x) under tp. The 
isomorphism <p maps the maxim al ideal (p(x)) to the ideal (p'(x)), so this ideal is also 
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maximal, which shows that p'(x) is also irreducible in F'[ x], The following theorem 
shows that the fields obtained by adjoining a root of p(x) to F and a root of p'(x) to 
F' have the same algebraic structure (i.e., are isomorphic): 


Theorem 8. Let <p : F -> F' be an isomorphism of fields. Let p(x) € F[x] be an 
irreducible polynomial and let p'(x) € F'[x] be the irreducible polynomial obtained 
by applying the map <p to the coefficients of p(x). Let a be a root of p(x) (in some 
extension of F) and let ft be a root of p'(x) (in some extension of F')- Then there is an 
isomorphism 

or : F(a) -A F'ifi) 

mapping a to ft and extending <p, i.e., such that or restricted to F is the isomorphism <p. 


Proof: As noted above, the isomorphism (p induces a natural isomorphism from 
F[x] to F'[x] which maps the maximal ideal (p(x)) to the maximal ideal (p'(x)). 
Taking the quotients by these ideals, we obtain an isomorphism of fields 

F[xV(p{x)) ^ F'[x]/(p'(x))- 

By Theorem 6 the field on the left is isomorphic to F(a) and by the same theorem the 
field on the right is isomorphic to F'ifi). Composing these isomorphisms, we obtain the 
isomorphism a. It is clear that the restriction of this isomorphism to F is <p< completing 
the proof. 


This extension theorem will be of considerable use when we consider Galois Theory 
later. It can be represented pictorially by the diagram 

or : F(ct) ^ f'(0) 

I _ I 

<P : F F' 

EXERCISES 


1. Show that p(x ) = x 3 + 9x + 6 is Irreducible in <Q>[x]. Let 0 be a root of p(x). Find t^e 
inverse ofl + 0 in Q(0). 

2 . Show that x 3 —2x — 2 is irreducible over Q and let 0 be a root. Compute (1 +0)(1 +0+0 2 ) 

1+0 

211(1 i ^ ln Q(^)- 
1 + 0 + e 2 

3. Show that x 3 + x + 1 is irreducible over F 2 and let 0 be a root. Compute the powers of 0 
inF2 (0). 

4. Prove directly that the map a + by/2 \-+ a — by / 2 is an isomorphism of Q(y/2 ) with itself. 


5. Suppose or is a rational root of a monic polynomial in Z[x\. Prove that a is an integer. 

6. Show that if a is a root of a„x n + a n ^\x n ~ x H + aix + ao then a„u is a root of the 

monic polynomial x n + a n - ix 71-1 + a n a n - 2 X n ~ 2 + - • ■ + a"~ 2 a\x + a”~ x aQ. 

7. Prove that x 3 — nx + 2 is irreducible for n ^ — 1 , 3, 5. 

8. Prove that x 5 —ax — 1 € Z[x] is irreducible unless a = 0, 2 or — 1. The first two correspond 
to linear factors, the third corresponds to the factorization (x 2 — x + l)(x 3 + x 2 — 1). 
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13.2 ALGEBRAIC EXTENSIONS 


Let F be a field and let K be an extension of F. 

Definition. The element a e K is said to be algebraic over F if a is a root of some 
nonzero polynomial fix) e F[x]. If a is not algebraic over F (i.e., is not the root of 
any nonzero polynomial with coefficients in F) then a is said to be transcendental over 
F. The extension K/F is said to be algebraic if every element of K is algebraic over 
F. 


Note that if a is algebraic over a field F then it is algebraic over any extension field 
L of F (if / (x) having a as a root has coefficients in F then it also has coefficients in 

L). 

Proposition 9. Let a be algebraic over F. Then there is a unique monic irreducible 
polynomial m a F (x) e F[x] which has a as a root A polynomial fix) e F[x] has a 
as a root if and only if m a F (x) divides f(x) in F[x]. 

Proof: Let g(x) e F[x] be a polynomial of minimal degree having a as a root. 
Multiplying g(x) by a constant, we may assume g(x) is monic. Suppose g(x) were 
reducible in F[x], say g(x) — a(x)b(x) with a(x), b(x) e F[x] both of degree smaller 
than the degree of g(x). Then g(ot) = a(a)b(a) in K t and since K is a field, either 
a (a) = 0 or b(a) = 0, contradicting the minimality of the degree of g(x). It follows 
that g(x) is a monic irreducible polynomial having a as a root. Suppose now that 
f(x) e F[x] is any polynomial having or as a root. By the Euclidean Algorithm in 
F[x ] there are polynomials q(x) t r(x) e F[x] such that 

f(x) = q(x)g(x) + r(x) with deg r(x) < deg g(x). 

Then /(or) = q(a)g(a) + r(a) in K and since a is a root of both fix) and g(x), we 
obtain r(a) — 0, which contradicts the minimality of g(x) unless r(x) — 0. Hence 
gix) divides any polynomial fix) in F[x] having or as a root and, in particular, would 
divide any other monic irreducible polynomial in F[x] having or as a root. This proves 
that m a fix) = gix) is unique and completes the proof of the proposition. 

Corollary 10. If L/F is an extension of fields and a is algebraic over both F and L, 
then m a L ix) divides m a>F ix) in L[x]. 

Proof: This is immediate from the second statement in Proposition 9 applied to L, 
since m a>F ix) is a polynomial in L[x] having a as a root. 

Definition. The polynomial m a F ix) (or just m c (x) if the field F is understood) in 
Proposition 9 is called the minimal polynomial for a over F. The degree of m a (x) is 
called the degree of a. 

Note that by the proposition, a monic polynomial over F with a as a root is the 
minim al polynomial for a over F if and only if it is irreducible over F. Exercise 20 
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gives one method for computing the minim al polynomial for or over F, and the theory 
of Grobner bases can be used to compute the minimal polynomial for other elements 
in F(or) (cf. Proposition 10 and Exercise 48 in Section 15.1). 

Proposition 11. Let or be algebraic over the field F and let F (a) be the field generated 
by or over F. Then 

F(a) = F[xy(m a (x )) 


so that in particular 

[F(or) : F] = deg m a (x) = deg a, 

i.e., the degree of or over F is the degree of the extension it generates over F. 

Proof: This follows immediately from Theorem 6. ‘ 

Examples 

(1) The minimal polynomial for V2 over Q is jc 2 — 2 and -fl is of degree 2 over Q: 

[Q(V2) : Q] = 2. 

(2) The minim al polynomial for \f2 over Q is x 3 — 2 and Vl is of degree 3 over Q: 
mV2) : <03 = 3. 

(3) Similarly, for any n > 1, the polynomial x n — 2 is irreducible over Q since it is 
Eisenstein. Denoting a root of this polynomial by V2 (where as usual we reserve this 
symbol to denote the positive n* root of 2 if we want to view this root as an element 
of R, and where the symbol denotes any one of the algebraically indistinguishable 
abstract solutions in general), we have [Q( V2 ) : Q] = n. 

( 4 ) The minim al polynomial and the degree of an element a depend on the base field. 
For example, over R, the element V2 is of degree one, with minimal polynomial 
m V5.R&0 =*-■%/! 

(5) Consider the polynomial p(x) = x 3 — 3x — 1 over Q, which is irreducible over Q 
since it is a cubic which has no rational root (cf. Proposition 1 1 of Chapter 9). Hence 
[Q(a) : Q] = 3 for any root a of p( x). For future reference we note that a quick 
sketch of the graph of this function over the real numbers shows that the graph crosses 
the jc-axis precisely once in the interval [0,2], i.e., there is precisely one real number 
a, 0 < a ^ 2 satisfying a 3 — 3a — 1 — 0. 

Proposition 12. The element a is algebraic over F if and only if the simple extension 
F (a)/F is finite. More precisely, if a is an element of an extension of degree n over F 
then a satisfies a polynomial of degree at most n over F and if a satisfies a polynomial 
of degree n over F then the degree of F(oc) over F is at most n. 

Proof: If a is algebraic Over F, then the degree of the extension F(a)/F is the 
degree of the minimal polynomial for a over F. Hence the extension is finite, of degree 
< n if a satisfies a polynomial of degree n. Conversely, suppose or is an element of 
an extension of degree n over F (for example, if [F(or) : F] = «). Then the n + 1 
elements 

l, a, a 2 ,...,a n 
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of F(a) are linearly dependent over F, say 

bo + bya + bic? H 1- b r a n = 0 

with bo, b \ , b 2 , . . . , b n e F not all 0. Hence a is the root of a nonzero polynomial with 
coefficients in F (of degree < n), which proves a is algebraic over F and also proves 
the second statement of the proposition. 

Corollary 13. If the extension K/F is finite, then it is algebraic. 

Proof: If a € K, then the subfield F(a) is in particular a subspace of the vector 
space K over F. Hence [F(a) : F] < [K : F] and so a is algebraic over F by the 
proposition. 

Remark: We shall prove below a sort of converse to this result (Theorem 17), but note 
that there are infinite algebraic extensions (we shall have an example later), so the litoral 
converse of this corollary is not true. 

Example: (Quadratic Extensions over Fields of Characteristic ^ 2) 

Let F be a field of characteristic # 2 (for example, any field of characteristic 0, such as Q) 
and let A" be an extension of F of degree 2,[K : F] = 2. Let a be any element of K not 
contained in F. By the proposition above, a satisfies an equation of degree at most 2 over 
F. This equation cannot be of degree 1, since a is not an element of F by assumption. It 
follows that the minim al polynomial of a is a monic quadratic 

m a (x) = x 2 + bx + c b,c € F. 

Since F C F(a) C K and F(a) is already a vector space over F of dimension 2, we have 
K = F(a). 

The roots of this quadratic equation can be determined by the quadratic formula, which 
is valid over any field of characteristic ^ 2 (the formula is obtained as in elementary algebra 
by completing the square): 

-b±Jb 2 -4c 

a ~ 

2 

(the reason for requiring the characteristic of F not be 2 is that we must divide by 2). Here 
b 2 — 4c is not a square in F since a is not an element of F and the symbol Pb 2 —4c denotes 
a root of the equation jc 2 — (iP — 4c) = 0 in K (see the aid of the next paragraph). Note 
that here there is no natural choice of one of the roots analogous to choosing the positive 
square root of 2 in IR — the roots are algebraically indistinguishable. 

Now F(a) = F(Vb 2 — 4c) as follows: by the formula above, a is an element of the 
field on the right, hence F(a) c F(Pb 2 — 4c). Conversely, v7> 2 —4c = ^f(b+2a) shows 
that v7> 2 —4c is an element of F(a), which gives the reverse inclusion F(Vfc 2 — 4c) C 
F(a) (and incidentally shows that the equation x 1 — Qp- — Ac) = 0 does have a solution in 
K). 

It follows that any extension K of F of degree 2 is of the form F(-jD) where D is 
an element of F which is not a square in F, and conversely, every such extension is an 
extension of degree 2 of F. For this reason, extensions of degree 2 of a field F are called 
quadratic extensions of F. 
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Suppose that F is a subfield of a field K which in turn is a sub field of a field L. 
Then there are three associated extension degrees — the dimension of K and L as vector 
spaces over F, and the dimension of L as a vector space over K . 

Theorem 14. Let F C K c L be fields. Then 

[L : F] = [L : K][K : F), 

i.e. extension degrees are multiplicative, where if one side of the equation is infinite, 
the other side is also infinite. Pictorially, 

IL:F] 

, * > 

F C K c L 

»» 

i ■> * ' 

[*:/■] [L.K] 


Proof: Suppose first that [L : K~\ = m and [K : F] = n are finite. Let 
oij , a 2 , ■ ■ . > ct m be a basis for L over K and let fi\ , ^ . . . , fi„ be a basis for K over F. 
Then every element of L can be written as a linear combination 

a\Ot x +a 2 a 2 H b a m a m 

where a \ , . . . , a m are elements of K , hence are F-linear combinations of fi \ , . . . , fi„ : 

a i = b,\fi 1 + h;2$2 *b * ■ ■ *b bin fin * = 1, 2, . . . , W1 (13.3) 

where the bj j are elements of F. Substituting these expressions in for the coefficients 
a- t above, we see that every element of L can be written as a linear combination 

j=\,2,...,n 

of the mn elements or,- fij with coefficients in F. Hence these elements span L as a 
vector space over F. 

Suppose now that we had a linear relation in L 

b^cii fij = 0 

j= 1.2 n 

with coefficients b;j in F. Then defining the elements a, e K by equation (3) above, 
this linear relation could be written 

a\ct\ + a 2 a 2 H h a m a m = 0. 

Since the a / are a basis for L over AT, it follows that all the coefficients a lt i = 1 , 2, .... m 
must be 0, i.e., that 

bnfi\ + bi 2 fi 2 . + ’ ■ ■ + bi n fi n = 0 i = 1 , 2 , . . . , m 

in K. Since now the fij, j = 1,2 ,...,« form a basis for K over F, this implies b-,j = 0 
for all / and 7. Hence the elements at fij are linearly independent over F, so form a 
basis for L over F and [L : F] = mn — [L : Af][Af : F], as claimed. 
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If [K : F] is infinite, then there are infinitely many elements of K, hence of L, 
which are linearly independent over F, so that [L : F] is also infinite. Similarly, if 
\L : K ] is infinite, there are infinitely many elements of L linearly independent over K, 
so certainly linearly independent over F, so again [L : F] is infinite. Finally, if [L : K ] 
and [K : F] are both finite, then the proof above shows [ L : F] is finite, so that [L : F] 
infinite implies at least one of [L : K ] and [K : F] is infinite, completing the proof. 

Remark: Note the similarity of this result with the result on group orders proved in Part I. 
As with diagrams involving groups we shall frequently indicate the relative degrees of 
extensions in field diagrams. 

The multiplicativity of extension degrees is extremely useful in computations. A 
particular application is the following: 

Corollary 15. Suppose L/F is a finite extension and let K be any subfield of L 
containing F, F c K c L. Then [K : F] divides [L : F], 

Proof: This is immediate. 

Examples 

(1) The element \/2 is not contained in the field <Q(a) where a is the real root of x 3 — 3x — 1 
between 0 and 2, since we have already determined that [<Q>(\/2) : Q] = 2 and 
[Q(a) : Q] = 3 and 2 does not divide 3. Note that it is not so easy to prove directly 
that \/2 cannot be written as a rational linear combination of 1, a, a 2 . 

(2) Let as usual Xfl denote the positive real 6 th root of 2. Then [Q( \/2 ) : <Q] = 6. Since 
(•C/2) 3 = \/2 we have Q(\/2) C Q(\/2) and by the multiplicativity of extension 
degrees, [Q(\/2) : <Q>(\/2)] = 3. This gives us the field diagram 

6 

, * , 

Q C Q(V2) c Q(\/2) ( 


2 3 

In particular, this shows that the minimal polynomial for V2 over Q(\/2 )isof degree 3. 
It is therefore the polynomial x 3 — ~J2. Note that showing directly that this polynomial 
is irreducible over <Q>(\/2) is not completely trivial. 

By Theorem 14 a finite extension of a finite extension is finite. The next results 
use this to show that an extension generated by a finite number of algebraic elements is 
finite (extending Proposition 12). 

Definition. An extension K/F is finitely generated if there are elements , a 2 , . . . , a* 
in K such that K — F(ct \ , « 2 i • ■ • . a*)- 

Recall that the field generated over F by a collection of elements in a field K is 
the smallest subfield of K containing these elements and F. The next lemma will show 
that for finitely generated extensions this field can be obtained recursively by a series 
of simple extensions. 
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Lemma 16. F(a , 0) = (F(a))( 0), i.e., the field generated over F by a and 0 is the 
field generated by 0 over the field F(a) generated by a. 

Proof: This follows by the minim ality of the fields in question. The field F(a, 0) 
contains F and a y hence contains the field F(a), and since it also contains 0 y we have 
the inclusion ( F(a))(0 ) c F(a, 0) by the minimality of the field (F(a))(0). Since the 
field (F(a))(0) contains F, a and 0 , by the minimality of F(a, 0) we have the reverse 
inclusion F(a, 0) c (F(a))(0), which proves the lemma. 

By the lemma we have 

K = F(ori,a 2 , ...,<**) = (F(ai,a2, 

and so by iterating, we see that K is obtained by taking the field Fy generated over F 
by a\ y then the field F 2 generated over F\ (this is important) by a 2 , and so on, with 
F k = K . This gives a sequence of fields: 

F = F 0 c Fi c F 2 c . . . c F k = K 

where 

F i+l = Fi(a i+l ) /' = 0,1,...,*- 1. 

Suppose now that the elements a\, a 2 , . . . , a k are algebraic over F of degrees 
n[ , « 2 > - - ■ • n k (so a priori are algebraic over any extension of F). Then the extensions 
in this sequence are simple extensions of the type considered in Proposition 11. The 
relative extension degree [F, + i : F/] is equal to the degree of the minimal polynomial 
of over F;, which is at most rij+i (and equals n i+ y if and only if the minim al 
polynomial of over F re mains irreducible over F t ). By the multiplicativity of 
extension degrees, we see that 

[K :F] = [F k : F^HF^i : F*_ 2 ] ■ - - [Fj : F 0 ] 

is also finite, and < njn 2 ■ - ■ n k . 

This also gives a description of the elements of F{ot\, a 2 , . . . , ctf). For simplicity, 
consider the case of the field F(a, 0) where a and 0 are algebraic over F. Then the 
elements of this field are of the form 

bo + b[0 + b20 2 + * • * + bd-\0 d ~ l 

where d = [F (a)(0) : F(o:)3 is the degree of 0 over F(a) (which may be strictly 
smaller than the degree of 0 over F), and where the coefficients b 0 , by,..., bd-i are 
elements of F(a). The coefficients bj e F(a), i = 0, . . . , d — 1, are of the form 

am + a u a + a 2 iCt 2 h 1- a nAi a n ~ y 

where n = (F(a) : F] is the degree of a over F and the djj are elements of F. Hence 
the elements of F (a, 0) are of the form 

^ OijCtf S J a,j e F. 

i— 0. 1 n - 1 

j=0 , 1 d-l 

Since [F(a, 0 ) : F] — [F(a, 0) : F(c<)][F(a) : F] = dn, the elements a 1 0i are in 
fact an F basis for F(a, 0). 
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In practice the field F (a) generated by the algebraic a is obtained by adjoining 
the element a to F and then “closing” the resulting set with respect to addition and 
multiplication, which amounts to adjoining the powers or 2 ,a 3 , ... of a and taking linear 
combinations (with coefficients from F) of these elements. The process terminates 
when a power of a is a linear combination of lower powers of a which amounts to 
knowing the minimal polynomi al for a . The previous discussion shows a si milar process 
gives the field F(a, fi) generated by two elements, and by recursion, the field generated 
by any finite number of algebraic elements. This shows in particular that “closing” 
with respect to addition and multiplication also closes with respect to division for 
algebraic elements (cf. Example 5 following Corollary 5 above). If the elements are not 
algebraic, one must also “close” with respect to inverses. The difficulty in this procedure 
is determining the degrees of the relative extensions — for example the degree d for 
F (a, fi) over F (a) above, for which one has only an a priori upper bound (the degree 
of £ over F). 

This is the analogue of “closing” a set of elements in a group G to determine the 
subgroup they generate. 

Examples 

(1) The extension Q(y/2, y/2) is simply the extension Q(-(/2) since y/2 is already an 
element of this field Put another way, the degree d of y/2 over Q((/2) is 1, which 
is strictly smaller than the degree of y/2 over Q. We shall later have less obvious 
examples where this occurs. 

(2) Consider the field Q(y/2 , y/3 ) generated over <Q> by y/l and V3. Since V3 is of 
degree 2 over Q the degree of the extension Q(V2 , y/2 )/Q(\/2 ) is at most 2 and is 
precisely 2 if and only if x 2 — 3 is irreducible over(Q>(V2). Since this polynomial is of 
degree 2, it is reducible only if it has a root, i.e., if and only if %/3 e Q(y/2 ). Suppose 
y/3 = a + by/2 with a, b e <Q>. Squaring this we obtain 3 = (a 2 + 2b 2 ) + lab-Jl. If 
ab ^ 0, then we can solve this equation for y/2 in terms of a and b which implies that 
y/2 is rational, which it is not. If b = 0, then we would have that y/3 = a is rational, a 
contradiction. Finally, if a = 0, we have y/3 = by/ 2 and multiplying both sides by y/2 
we see that y/6 would be rational, again a contradiction. This shows y/3 £ Q(y/2 ), 
proving 

[Q(V2, V3) :Q] =4. 

Elements in this field (by “closing ” 1, y/2 , V3) include 1, y/2 , y/3, y/6 and by the 
computations above, these form a basis for this field; 

<Q>(V2 . y/3) = [a 4- by/2 + cyj 3 -j~d>/6 \ a,b,c,d € Q). 

We can now characterize the finite extensions of a field F: 

Theorem 17. The extension K/F is finite if and only if AT is generated by a finite 
number of algebraic elements over F. More precisely, a field generated over F by a 
finite number of algebraic elements of degrees u 2 . < - - » n k is algebraic of degree 
< 7I 1 /l 2 - - 'Tl k . 


Proof: If K/F is finite of degree n, let ct \ , a 2 , . . . , a n be a basis for A" as a vector 
space over F. By Corollary 15, IF (a;) : F] divides [ K : F] = n for i = 1, 2, .... n, so 
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that Proposition 12 implies each a* is algebraic over F. Since K is obviously generated 
over F by a\,ce 2 , ... ,a„, we see that K is generated by a finite number of algebraic 
elements over F. The converse was proved above. The second statement of the theorem 
is immediate from Corollary 13 and the computation above. 

The first example above shows that the inequality for the degree of the extension 
given in the theorem may be strict. We remark that information helpful in the determi- 
nation of this degree can often be obtained by determining subfields and then applying 
Corollary 15. 

Corollary 18. Suppose a and are algebraic over F. Then af, ctffi (for f ^ 0), 
(in particular a~ l for or ^ 0) are all algebraic. 

Proof: All of these elements lie in the extension F(or. ft), which is finite over F by 
the theorem, hence they are algebraic by Corollary 13. 

Corollary 19. Let L/F be an arbitrary extension. Then the collection of elements of 
L that are algebraic over F form a subfield K of L. 

Proof: This is immediate from the previous corollary. 

Examples 

(1) Consider the extension C/O and let O denote the subfield of all elements in C that are 
algebraic over O- In particular, the elements %/2 (the positive w 1 * 1 roots of 2 in R) are 
all elements of 0. so that [0 : 01 ^ n for all integers n > 1. Hence 0 is an infinite 
algebraic extension of 0. called the field of algebraic numbers. 

(2) Consider the field Q n R, the subfield of R consisting of elements algebraic over Q- 
The field Q is countable . The number of polynomials in OM of any given degree 
n is therefore also countable (since such a polynomial is determined by specifying 
n + 1 coefficients from 0). Since these polynomials have at most n roots in R, the 
number of algebraic elements of R of degree n is countable. Finally, the collection of 
all algebraic elements in R is the countable union (indexed by n) of countable sets, 
hence is countable. Since R is uncountable, it follows that there exist (in fact many) 
elements of R which are not algebraic, i.e., are transcendental, over Q. In particular 
the subfield Q n R of algebraic elements of R is a proper subfield of R, so also Q is a 
proper subfield of C. 

It is extremely di fficult in general to prove that a given real number is not algebraic. 
For example, it is known (these are theorems) that 7r =3.14159... and e = 2.71828... 
are transcendental elements of R. Even the proofs that these elements are not rational 
are not too easy. 

Theorem 20. If K is algebraic over F and L is algebraic over K, then L is algebraic 
over F. 

Proof: Let at be any element of L. Then cl is algebraic over K, so a satisfies some 
polynomial equation 

a n a" + H \-a\Ct+aQ = 0 
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where the coefficients ao, a\ , ... ,a„ are in AT. Consider the field F (a, ao, a \ , . . . , a n ) 
generated over F by a and the coefficients of this polynomial. Since K fF is algebraic, 
the elements ao, a\, . . . ,a„ art algebraic over F, so the extension F(ao, cl \ , . . . , a„)/F 
is finite by Theorem 17. By the equation above, we see that a generates an extension 
of this field of degree at most n, since its minimal polynomial over this field is a divisor 
of the polynomial above. Therefore 

[F(a,ao,au ...,a„): F] = [F(a, oq a „ ) : F(a 0 a*)][F(«io, . . . , a„) : F] 

is also finite and F(a, ao, a \, . . . , a„)/F is an algebraic extension. In particular the 
element a is algebraic over F, which proves that L is algebraic over F. 

The subfield F(ai , ff 2 . - - ■ . a k) generated by a finite set of elements a\ , a 2 , - - . , a k 
of a field K contains each of the fields F[qi), i = 1,2,...,/:. By the definitions, it is 
also the smallest subfield of K containing these fields. 

Definition. Let K\ and K 2 be two subfields of a field K. Then the composite field 
of K\ and denoted K\K 2 , is the smallest subfield of K containing both K\ and 
K 2 . Similarly, the composite of any collection of subfields of K is the smallest subfield 
containing all the subfields. 

Note that the composite K\ K 2 can also be described as the intersection of all the 
subfields of K containing both K\ and K 2 and similarly for the composite of more than 
two fields, analogous to the subgroup generated by a subset of a group (cf. Section 2.4). 

Example 

Tbe composite of the two fields Q(V2 ) and<Q>(-v/2) is the field <Q>(-t/2). This is because this 
field contains both of these subfields ( ( V2) 3 = V2 and (f/2) 2 = y/2 ) and conversely, 
any field containing both -J2 and 1/2 contains their quotient, which is \/2. 

Suppose now that K\ and K 2 are finite extensions of F in K. Let ct\, a 2 , ...,a n 
be an F-basis for K\ and let fi\, fa* - • • , be an F-basis for K 2 (so that [K\ : F] = n 
and [K 2 : F] = m). Then it is clear that these give generators for the composite K\K 2 
over F: 

K\K 2 = F( ct\,a 2 , . . . ,a n , fii, fi 2 , . . . , fi m ). 

Since ori, a 2 , . . . , a„ is an F-basis for K\ any power a/ of one of the a‘s is a linear 
combination with coefficients in F of the a 's and a similar statement holds for the fi’s. 
It follows that the collection of linear combinations 

J2 a 'J a 'Pj 

1 = 1.2,. ...N 

>=1.2 m 

with coefficients in F is closed under multiplication and addition since in a product 
of two such elements any higher powers of the a's and fi's can be replaced by linear 
expressions. Hence, the elements for i = 1,2, ... ,n and j = 1, 2, . . . , m span 
the composite extension K^K 2 over F. Ln particular, [K\K 2 : F] < mn. We summarize 
this as: 
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Proposition 21. Let K\ and K 2 be two finite extensions of a field F contained in K. 
Then 

[*1*2 : F] < [*t : F]t* 2 : F] 

with equality if and only if an F-basis for one of the fields remains linearly independent 
over the other field. If orj , a 2 , . . . , a„ and fa, p 2 , . . . , p m are bases for * i and * 2 over 

F, respectively, then the elements a t fa for / = 1,2 , ... ,n and j = 1,2 m span 

K\K 2 over F. 

Proof: From KyK 2 = F(a u ct nt p u fa, ■ • • , Pm) = Ki(Pi,fa , .... p m ), 

we see as above that fa, fa, . . . , p m span *1*2 over K\. Hence [*i* 2 : *1] < m = 
[K 2 : F] with equality if and only if these elements are linearly independent over K\. 
Since [*i* 2 : F] = [*i* 2 : *i][*i : F] this proves the proposition. 

By the proposition (and its proof), we have the following diagram: 


*1*2 



*1 *2 



F 

We shall have examples shortly where the inequality in the proposition is strict. 
' One useful situation where one can be certain of equality is the following: 

Corollary 22. Suppose that [K\ : F] = n, [K 2 : F] = m in Proposition 21, where n 
and m are relatively prime: (n, m) — 1. Then [K\K 2 : F] = [Ki : F][K 2 : F] = run. 

Proof: In general the extension degree [K\ *2 : F] is divisible by both n and 
m since Ky and K 2 are subfields of *1 * 2 , hence is divisible by their least common 
multiple. In this case, since (n, m) = 1, this means [*i* 2 : F] is divisible by nm, 
which together with the inequality [*i* 2 : F] < nm of the proposition proves the 
corollary. 

Example 

The composite of the two fields Q(*f2 ) and Q( Xfl ) is of degree 6 over Q, which we 
determined earlier by actually computing the composite Q( -t/2 ). 


EXERCISES 

1. Let F be a finite field of characteristic p. Prove that |F| = p n for some positive integer n. 

2. Let g(x) = x 2 + x — 1 and let h(x) — x 3 — x + 1 . Obtain fields of 4, 8, 9 and 27 elements 
by adjoining a root of /(x) to the field F where f(x) = g(x) or h(x) and F = F 2 or F3. 
Write down the multiplication tables for the fields with 4 and 9 elements and show that 
the nonzero elements form a cyclic group. 

3. Determine the minimal polynomial over <Q> for the element 1 + i. 
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4. Determine the degree over Q of 2 + \/3 and of 1 + Xf2 + ^/4. 

5. Let F = (Q >(/). Prove that x 3 — 2 and x 3 — 3 are irreducible over F. 

6. Prove directly from the definitions that the field F(a i , a 2 . . - - . or n ) is the composite of the 

fields F( ai), F(c* 2 ) F(a„). 

7. Prove that Q(\/2 + V3 ) = Q(\/2 , \/3 ) [one inclusion is obvious, for the other consider 
(\/2 +\/3 ) 2 , etc.]. Conclude that [<Q>(\/5 +\/3) : <Q] = 4. Find an irreducible polynomial 
satisfied by \/2 + \/3. 

8. Let F be a field of characteristic ^ 2. Let /Pi and ZP 2 be elements of F, neither of which 
is a square in F. Prove that F(-/D\ , \/^2 ) is of degree 4 over F if £^£>2 is not a square 
in F and is of degree 2 over F otherwise. When F(V^7 - ^/D 2 ) is of degree 4 over F the 
field is called a biquadratic extension of F. 

9. Let F be a field of characteristic ^ 2. Let a, b be elements of the field F with h not a 

square in F. Prove that a necessary and sufficient condition for \J a + -Jb = ^ 

for some m and n in F is that a 1 — b is a square in F. Use this to determine when the field 
Q(>fa + -Jb ) (a, b e (Q) is biquadratic over Q. 

10. Determine the degree of the extension Q(y/3 + 2-J2 ) over Q. 

1L (a) Let V3 + 4i denote the square root of the complex number 3+4* that lies in the 
first quadrant and let — Ai denote the square root of 3 — 4/ that lies in the fourth 
quadrant. Prove that [Q(\/3 + 4i 4- V3 — 4i ) : Q] = 1. 

(b) Determine the degree of the extension Q( y/l + ^f—3 + yj 1 — \f~-3 ) over Q. 

12. Suppose the degree of the extension K/F is a prime p. Show that any subfield E of K 
containing F is either K or F. 

13. Suppose F = Q(ai, 0 : 2 , , ct„) where af e <Q> for i = 1, 2 n. Prove that Xf2 F. 

14. Prove that if [F(a) : F] is odd then F(a ) = F (a 2 ). 

15. A field F is said to be formally real if —1 is not expressible as a sum of squares in F. Let 
F be a formally real field, let / (x) € F[x] be an irreducible polynomial of odd degree and 
let a be a root of f(x). Prove that F (a) is also formally real. [Pick a a counterexample 
of minimal degree. Show that — 1 + f(x)g(x) = (j>\(x)) 2 + ■ ■ ■ + ( p m (x )) 2 for some 
Pi(x), g(x) e F[x] where g(x) has odd degree < deg /. Show that some root jSofg has 
odd degree over F and F(fi) is not formally real, violating the minimality of a.) 

16. Let K/F be an algebraic extension and let R be a ring contained in K and containing F. 
Show that Fisa subfield of K containing F. 

17. Let f{x) be an irreducible polynomial of degree n over a field F. Let g(x) be any 
polynomial in F[x]. Prove that every irreducible factor of the composite polynomial 
/ (g(x)) has degree divisible by n. 


18. Let A: be a field and let k(x) be the field of rational functions in x with coefficients from k. 

P(x) 

Let t € k (x) be the rational function with relatively prime polynomials P(x), Q(x) € 

Q(x ) 

k[x ], with Q(x) ± 0. Then k(x) is an extension of k(t) and to compute its degree it is 
necessary to compute the minimal polynomial with coefficients in k(t) satisfied by x. 

(a) Show that the polynomial P(X) — tQ(X) in the variable X and coefficients in k(t) 
is irreducible over k(t) and has x as a root [By Gauss’ Lemma this polynomial is 
irreducible in (£(/))[X] if and only if it is irreducible in (Jfc[/])[X]. Then note that 
(*['])[*] = (k[X))[t).] 
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(b) Show that the degree of P(X) — t Q(X) as a polynomial in X with coefficients in k(t) 
is the maximum of the degrees of P(x) and Q(x). 

P(x) 

(c) Show that (*(x) : *(0J = [*(*) : g^y)J = max ^ de S GOO)- 

1 9. Let AT be an extension of F of degree n. 

(a) For any or e K prove that a acting by left multiplication on AT is an F-linear trans- 
formation of A. 

(b) Prove that K is isomorphic to a subfield of the ring ofnxn matrices over F, so the 
ring of n x n matrices over F contains an isomorphic copy of every extension of F 
of degree < n. 


20. Show that if the matrix of the linear transformation “multiplication by a ” considered in the 
previous exercise is A then a is a root of the characteristic polynomial for A. This gives 
an effective procedure for determining an equation of degree n satisfied by an element a 
in an extension of F of degree n. Use this procedure to obtain the monic polynomial of 
degree 3 satisfied by Ifl and by 1 + 1/2 + 1/4. 


21 . 


Let K = Q(y/D ) for some squarefree integer D. Let a = a + b\fD be an element of 
K. Use the basis 1, \Zd for A" as a vector space over <Q> and show that the matrix of 
the linear transformation “multiplication by a" on K considered in the previous exercises 


has the matrix ^ Prove directly that the map a + byfD (-► ^ is an 

isomorphism of the field K with a subfield of the ring of 2 x 2 matrices with coefficients 


in Q- 


22. Let K\ and Kj be two finite extensions of a field F contained in the field K. Prove that 
the F-algebra K\ K 2 is a field if and only if [K\Ki : F] = [Af : F][A 2 : F]. 


13.3 CLASSICAL STRAIGHTEDGE AND COMPASS CONSTRUCTIONS 

As a simple application of the results we have obtained on algebraic extensions, and in 
particular on the multiplicativity of extension degrees, we can answer (in the negative) 
the following geometric problems posed by the Greeks: 

I. (Doubling the Cube) Is it possible using only straightedge and compass to construct 
a cube with precisely twice the volume of a given cube? 

IL (Trisecting an Angle) Is it possible using only straightedge and compass to trisect 
any given angle 62 

IH. ( Squaring the Circle ) Is it possible using only straightedge and compass to construct 
a square whose area is precisely the area of a given circle? 

To answer these questions we must translate the construction of lengths by compass 
and straightedge into algebraic terms. Let 1 denote a fixed given unit distance. Then 
any distance is determined by its length a e R, which allows us to view geometric 
distances as elements of the real numbers IR. Using the given unit distance 1 to define 
the scale on the axes, we can then construct the usual Cartesian plane M 2 and view 
all of our constructions as occurring in M 2 . A point (x, y) e K 2 is then constructible 
starting with the given distance 1 if and only if its coordinates x and y are constructible 
elements of IR. The problems above then amount to determining whether particular 
lengths in M can be obtained by compass and straightedge constructions from a fixed 
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unit distance. The collection of such real numbers together with their negatives will be 
called the constructible elements of R, and we shall not distinguish between the lengths 
that are constructible and the real numbers that are constructible. 

Each straightedge and compass construction consists of a series of operations of 
die following four types: (1) connecting two given points by a straight line, (2) finding 
a point of intersection of two straight lines, (3) drawing a circle with given radius and 
center, and (4) finding the point(s) of intersection of a straight line and a circle or the 
intersection of two circles. 

It is an elementary fact from geometry that if two lengths a and b are given one may 
construct using straightedge and compass the lengths a ± b,ab and a/b (the first two 
are clear and the latter two are given by the construction of parallel lines (Figure 1)). 



H a H 



K b H 


Fig. l 

It is also an elementary geometry construction to construct *Ja if a is given: construct 
the circle with diameter 1 + a and erect the perpendicular to the diameter as indicated 
in Figure 2. Then «Ja is the length of this perpendicular. 



It follows that straightedge and compass constructions give all the algebraic operations 
of addition, subtraction, multiplication and division (by nonzero elements) in the reals 
so the collection of constructible elements is a subfield of JR. One can also take square 
roots of constructible elements. We shall now see that these are essentially the only 
operations possible. 

From the given length 1 it is possible to construct by these operations all the rational 
numbers <Q>. Hence we may construct all of the points (*, y) € R 2 whose coordinates 
are rational. We may construct additional elements of R by taking square roots, so the 
collection of elements constructible from 1 of R form a field strictly larger than Q. 

The usual formula (“two point form”) for the straight line connecting two points 
with coordinates in some fieldFgives an equation for the line of the form flx+/?y—c = 0 
with a, b,c € F. Solving two such equations simultaneously to determine the point of 
intersection of two such lines gives solutions also in F. It follows that if the coordinates 
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of two points lie in the field F then straightedge constructions alone will not produce 
additional points whose coordinates are not also in F . 

A compass construction (type (3) or (4) above) defines points obtained by the 
intersection of a circle with either a straight line or another circle. A circle with center 
(h, k) and radius r has equation 

(x - hf + (y - kf = r 2 

so when we consider the effect of compass constructions on elements of a field F 
we are considering simultaneous solutions of such an equation with a linear equation 
ax + by — c = 0 where a,b,c,h,k,r € F, or the simultaneous solutions of two 
quadratic equations. 

In the case of a linear equation and the equation for the circle, solving for y, say, 
in the linear equation and substituting gives a quadratic equation for x (and y is given 
linearly in terms of x). Hence the coordinates of the point of intersection are at worst 
in a quadratic extension of F. 

In the case of the intersection of two circles, say 

(x - h) 2 + (y - k) 2 = r 2 

and ( x — h') 2 + (y - k 7 ) 2 = r' 2 , 

subtraction of the second equation from the first shows that we have the same intersection 
by considering the two equations 

(x - h) 2 + (y - k) 2 = r 2 

and 2 Qi' - h)x + 2(k' - k)y = r 2 - h 2 - k 2 - r 2 + W 2 + k' 2 

which is the intersection of a circle and a straight line (the straight line connecting the 
two points of intersection, in fact) of the type just considered. 

It follows that if a collection of constructible elements is given, then one can con- 
struct all the elements in the subfield F of M generated by these elements and that any 
straightedge and compass operation on elements of F produces elements in at worst 
a quadratic extension of F. Since quadratic extensions have degree 2 and extension 
degrees are multiplicative, it follows that if a € R is obtained from elements in a field 
F by a (finite) series of straightedge and compass operations then a is an element of an 
extension K of F of degree a power of 2: [F : F] = 2 m for some m. Since [F(a) : F] 
divides this extension degree, it must also be a power of 2. 

Proposition 23. If the element a e M is obtained from a field F C IK by a series of 
compass and straightedge constructions then [F(a) : F) = 2 k for some integer k > 0. 

Theorem 24. None of the classical Greek problems: (I) Doubling the Cube, (If) 
Trisecting an Angle, and (III) Squaring the Circle, is possible. 

Proof: (I) Doubling the cube amounts to constructing Zj2 in the reals starting with 
the unit 1. Since [<Q>(^/2 ) : Q] = 3 is not a power of 2, this is impossible. 

(II) If an angle 0 can be constructed, then determining the point at distance 1 from 
the origin and angle 9 from the positive x axis in R 2 shows that cos 6 (the x -coordinate 
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of this point) can be constructed (so then sin 6 can also be constructed). Conversely if 
cos 0, then sin 0 , can be constructed, the point with those coordinates gives the angle 0. 

The problem of trisecting the angle 0 is then equivalent to the problem; given cos & 
construct cos 9/3. 

To see that this is not always possible (it is certainly occasionally possible, for 
example for B = 1 80°), consider B — 60°. Then cos B = ^ . By the triple angle formula 
for cosines: 

cos B = 4cos 3 6/3 — 3 cos B/ 3, 

substituting 9 = 60°, we see that 0 = cos 20° satisfies the equation 

4/3 3 - 3/3 - 1/2 = 0 

or 8(/S) 3 — 6/1 — 1 =0. This can be written (20) 3 —3(20) —1=0. Let a = 20. Then 
a is a real number between 0 and 2 satisfying the equation 

a 3 — 3o! — 1 = 0. 

But we considered this equation in the last section and determined [<Q>(a) : Q] = 3, and 
as before we see that a is not constructive. 

(HI) Squaring the circle is equivalent to determining whether the real number n = 
3.14159 .. . is constructive. As mentioned previously, it is a difficult problem even 
to prove that this number is not rational. It is in fact transcendental (which we shall 
assume without proof), so that [<Q>(tt) : Q] is not even finite, much less a power of 2, 
showing the impossibility of squaring the circle by straightedge and compass. 

Remark: The proof above shows that cos 20° and sin 20° cannot be constructed. The 
question arises as to which integer angles (measured in degrees) are constructive? The 
angles 1° and 2° are not constructive, since otherwise the addition formulae for sines 
and cosines would give the constructibility for 20°. On the other hand, elementary 
geometric constructions (of the regular 5-gon for an angle of 72° and the equilateral 
triangle for an angle of 60°) together with the addition formulae and the half-angle 
formulae show that cos 3° and sin 3° are constructive. It follows from this that the 
trigonometric functions of an integer degree angle are constructive precisely when the 
angle is a multiple of 3°. Explicitly, 

cos 3 °=l(x/3 + 1) ^5+V5 + -!-(V6 - V2)(a/5 - 1) 

8 16 

sin 3° = -!-(V6 + V2)(^- 1)- i^/3- l)^/s + ^/E f 

JO o 

showing that these are obtained from <Q> by successive extractions of square roots and 
field operations. 

After discussing the cydotomic fields in Section 14.5 we shall consider another 
classical geometric question: “which regular n-gons can be constructed by straightedge 
and compass?” (cf. Proposition 14.29). 

We have been careful here to consider constructions using a straightedge rather 
than a ruler, the distinction being that a ruler has marks on it. If one uses a ruler, it is 
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possible to construct many additional algebraic elements. For example, suppose 6 is a 
given angle and the unit distance 1 is marked on the ruler. Draw a circle of radius 1 with 
central angle 9 as shown in Figure 3 and then slide the ruler until the distance between 
points A and B on the circle is 1. Then some elementary geometry shows that (cf. 
the exercises) the angle a indicated is 9/ 3, i.e., this construction (due to Archimedes) 
trisects 6. In particular, the second classical problem in Theorem 24 (Trisecting an 
Angle) can be solved with ruler and compass. 


Fig. 3 



The first of the classical problems in Theorem 24 (Duplication of the Cube), which 
amounts to the construction of ^2, can also be solved with ruler and compass. The 
following gives a construction for fc 1/3 for any given positive real k which is less than 
1 . This construction was shown to us by J.H. Conway. 

Drawing a circle of radius 1 and using the point A = ( k , 0) as center, con- 
struct the point B = (0, Vl — k 2 ). Dividing this distance by 3, construct the point 
(0, — jVl — k 2 ) and draw the line connecting this point with A. Slide the ruler with 
marked unit length 1 so that it passes through the point B and so that the distance from 
the intersection point C to the intersection point D with the jc-axis is of length 1, as 
indicated in Figure 4. 

Then the distance between A and D is 2k l/3 and the distance between B and C, is 
2 (cf. the exercises). 



EXERCISES 


1. Prove that iris impossible to construct the regular 9-gon. 

2. Prove that Archimedes’ construction actually trisects the angle 6. [Note the isosceles 
triangles in Figure 5 to prove that p = y = 2a.] 



Fig. 5 
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3- Prove that Conway’s construction indicated in the text actually constructs 2 k}^ and 2 k 2 ^. 
[One method: let (x, y) be the coordinates of the point C, a the distance from B to 

y Vl 


C and b the distance from A to D, use similar triangles to prove (a) - = 
x b + k y Vl - k 7 

(*>)- = t— . (c) = — — — 

a 1 + a x — k 3k 
solve these equations for a and b.) 


1+fl ’ 

, and also show that (d)(l— k 2 )+ (b+k) 2 = (1 +a) 2 ; 


4. The construction of the regular 7-gon amounts to the constructibility of cos(2jt/7). We 
shall see later (Section 14.5 and Exercise 2 of Section 14.7) that or = 2 cos(2tt/7) satisfies 
the equation x 3 + x 2 — 2x — 1 =0. Use this to prove that the regular 7-gon is not 
constructible by straightedge and compass. 

5. Use the fact that a = 2 cos(27r/5) satisfies the equation x 2 + x — 1 = 0 to conclude that 
the regular 5-gon is constructible by straightedge and compass. 


13.4 SPLITTING FIELDS AND ALGEBRAIC CLOSURES 

Let F be a field. 

If /(x) is any polynomial in F[x] then we have seen in Section 2 that there exists 
a field AT which can (by identifying F with an isomorphic copy of F) be considered 
an extension of F in which fix') has a root a . This is equivalent to the statement that 
fix ) has a linear factor x — a in K [x] (this is Proposition 9 of Chapter 9). 

Definition. The extension field K of F is called a splitting field for the polynomial 
/ (x) g F[x] if fix ) factors completely into linear factors (or splits completely) in 
AT[x] and fix) does not factor completely into linear factors over any proper subfield 
of K containing F. 

If fix) is of degree n, then / (x) has at most n roots in F (Proposition 17 of 
Chapter 9) and has precisely n roots (counting multiplicities) in F if and only if fix) 
splits completely in F[x]. 

Theorem 25. For any field F, if fix) € F[x] then there exists an extension K of F 
which is a splitting field for / (x). 

Proof We first show that there is an extension E of F over which fix) splits 
completely into linear factors by induction on the degree n of fix). If n = 1, then take 
E = F . Suppose now that n > 1. If the irreducible factors of / (x) over F are all of 
degree 1, then F is the splitting field for fix) and we may take E = F. Otherwise, 
at least one of the irreducible factors, say p(x) of fix) in F[x] is of degree at least 2. 
By Theorem 3 there is an extension E\ of F containing a root a of p(x). Over E\ the 
polynomial fix) has the linear factor x — a. The degree of the remaining factor f\ (x) 
of fix) is n — 1 , so by induction there is an extension E of E\ containing all the roots 
of /i (x). Since or e E, E is an extension of F containing all the roots of fix). Now 
let AT be the intersection of all the subfields of E containing F which also contain all 
the roots of fix). Then AT is a field which is a splitting field for / (x). 
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We shall see shortly that any two splitting fields for / (x) are isomorphic (which 
extends Theorem 8), so (by abuse) we frequently refer to the splitting field of a poly- 
nomial. 


Definition. If AT is an algebraic extension of F which is the splitting field over F for 
a collection of polynomials f(x) e F[x] then K is called a normal extension of F. 

We shall generally use the term “splitting field” rather than “normal extension” (cf. 
also Section 14.9). 


Examples 

(1) The splitting field for x 2 — 2 over Q is just <Q>(\/2 ), since the two roots are ±-j2 and 

-V2 e <Q(V2). 

(2) The splitting field for (x 2 — 2) (x 2 — 3) is the field Q(\/2 , \/3 ) generated over Q by 
\/2 and \/3 since the roots of the polynomial are ±sj2, ±\/3. We have already seen 
that this is an extension of degree 4 over <Q) and we have the following diagram of 
known subfields: 

<X>/5, V5) 




Q(\/2) QU/6) Q^/3) 




(3) The splitting field of x 3 — 2 over (Q> is not just Q(l/l) since as previously noted the 
three roots of this polynomial in C are 

i/2, ^( ~ 1+ 2 <7? ). 


and the latter two roots are not elements of since the clcmcnLs of this field 

are of the form a + b 1/2 + c XfA with rational a, h, c and all such numbers are real. 

The splitting field K of this polynomial is obtained by adjoining all three of these 
roots to Q. Note that since K contains the first two roots above, then it contains their 

j _l_ ^ 

quotient hence K contains the element V— 3. On the other hand, any field 

containing ifl and ■>/—% contains all three of the roots above. It follows that 


= Q(^/2,n/=3) 

is the splitting field of x 3 — 2 over Q. Since 3 satisfies the equation x 2 + 3 = 0, the 
degree of this extension over Q( 1/2 ) is at most 2, hence must be 2 since we observed 
above that Q(\/2) is not the splitting field. It follows that 


[<Q)( X/2 , V— 5 ) : Q] = 6. 


Note that we could have proceeded slightly differently at the aid by noting that 
Q(a/^ 3) is a subfield of K , so that the index [Q(V~ 3) : Q] — 2 divides [A - : Q]. 
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Since this extension degree is also divisible by 3 (because Q(V2) C K ), the degree 
is divisible by 6, hence must be 6. 

This gives us the diagram of known subfields: 



e t = V2. e 3 =Vi(=L±^y 

( 4 ) One must be careful in computing splitting fields. The splitting field for the polynomial 
x 4 +4 over <Q> is smaller than one might at first suspect. In fact this polynomial factors 
over <Q>: 

x 4 + 4 = j 4 + 4* 2 + 4 - 4x 2 = (x 2 + 2) 2 - 4x 2 
= (x 2 + 2x + 2)(% 2 -lx + 2) 

where these two factors are irreducible (Eisenstein again). Solving for the roots of the 
two factors by the quadratic formula, we find the four roots 

±1 ± i 

so that the splitting field of this polynomial is just the field <Q>(/), an extension of degree 
2 of Q. 


In general, if f(x) G F[x\ is a polynomial of degree n, then adjoining one root of 
f(x ) to F generates an extension F\ of degree at most n (and equal to n if and only if 
/ (x) is irreducible). Over F\ the polynomial f(x) now has at least one linear factor, 
so that any other root of f(x) satisfies an equation of degree at most n — 1 over F\. 
Adjoining such a root to F[ we therefore obtain an extension of degree at most n — 1 
of F\, etc. Using the multiplica tivity of extension degrees, this proves 

Proposition 26. A splitting field of a polynomial of degree n over F is of degree at 
most n! over F . 


As the examples above show, the degree of a splitting field may be smaller than n!. 
It will be proved later using Galois Theory that a “general” polynomial of degree n (in 
a well defined sense) over Q has a splitting field of degree n !, so this may be viewed 
as the “generic” situation (although most of the interesting examples we shall consider 
have splitting fields of smaller degree). 
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Example: (Splitting Field of x" — 1: Cyclotomic Fields) 

Consider the splitting field of the polynomial x" — 1 over Q. The roots of this polynomial 
are called the n A roots of unity. 

Recall that every nonzero complex number a+bi eC can be written uniquely in the 
form 

re' 6 = r(cos 0 + i sin 9) r > 0, 0 <9 <2n 


which is simply representing the point a + bi in the complex plane in terms of polar 
coordinates: r is the distance of (a, b) from the origin and 9 is the angle made with the real 
positive axis. 

Over C there are n distinct solutions of the equation x" = 1, namely the elements 


Ink , 2nk 

gkrki/n _ cog ^ ) + j sm ( ) 

n n 


for Jfc = 0, 1 n — 1 . These points are given geometrically by n equally spaced points 

starting with the point (1,0) (corresponding to k = 0) on a circle of radius 1 in the complex 
plane (see Figure 6). The fact that these are all /I th roots of unity is immediate, since 


nki/ny _ e (2nki/n)n _ iki _ j 


It follows that C contains a splitting field for x n — ] and we shall frequently view the 
splitting field for x" — 1 over Q as the field generated over Q in C by the numbers above. 



In any abstract splitting field K /Q for x" — 1 the collection of n* roots of unity form 
a group under multiplication since if or” = 1 and p n = 1 then (afi) n = 1 , so this subset of 
K* is closed under multiplication. It follows that this is a cyclic group (Proposition 18 of 
Chapter 9); we shall see that there are n distinct roots in AT so it has order n. 


Definition. A generator of the cyclic group of all / 2 th roots of unity is called a primitive /I th 
root of unity. 


Let fn denote a primitive n* root of unity. The other primitive /I th roots of unity are then the 
elements where 1 < a < n is an integer relatively prime to n, since these are the other 
generators for a cyclic group of order n. In particular there are precisely <p(n) primitive n* 
roots of unity, where <p(n) denotes the Euler ^-function. 
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Over C we can see all of this directly by letting 

(„ = <>'> 

(the first n“ root of unity counterclockwise from l). Then all the other roots of unity are 
powers of £„ : 

^Zirki/n yk 

so that £„ is oDe possible generator for the multiplicative group of X th roots of unity. When 
we view the roots of unity in C we shall usually use to denote this choice of a primitive 
/ 1 th root of unity. The primitive roots of unity in C for some small values of n are 


fi = 1 

6 = -L 

-1 +iV3 

& = i 

. Vs-l . ,(y/V> + 2&\ 

& 

1 + iy/3 
V2 .yfl 

(these formulas follow from the elementary geometry of n-gons and in any case can be 
verified directly by raising them to the appropriate power). 

The splitting field of x n — 1 over Q is the field Q(f„) and this field is given a name: 


Definition. The field <Q>(f„) is called the cyclotomic field of n * roots of unity. 


Determining the degree of this extension requires some analysis of the minimal polynomial 
of („ over Q and will be postponed until later (Section 6). One important special case which 
we have in fact already considered is when n = p is a prime . In this case, we have the 
factorization 

X P- 1 =(x- IXxP-^jc^+ '+x + I) 
and since £ p 1 it follows that is a root of the polynomial 

<M*) = - — r ~ xP ~ l +x p ~ 2 + --- + x + \ 

which we showed was irreducible in Section 9.4. It follows that <P p (x) is the minimal 
polynomial of f p over Q, so that 

KKf#>) : Q] = P ~ 1. 

We shall see later that in general (<Q>(f„) : Q1 = <p(d)> where <p(n) is the Euler phi-function 
of n (so that <p(p) = p — 1). 
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Example: (Splitting Field of x p — 2, p a prime) 

Let p be a prime and consider the splitting field of x p — 2. If a is a root of this equation, 
he., a p = 2, then (£<v) p = 2 where £ is any p 1 ^ root of unity. Hence the solutions of this 
equation are 

£ V2, £ a p 01 root of unity 

where as usual the symbol Z/2 denotes the positive real p* root of 2 if we wish to view 
these elements as complex numbers, and denotes any one solution of x p =2 if we view 
these roots abstractly. Since the ratio of the two solutions £ p \fl and K/2 for £ p a primitive 
p* root of unity is just £ p , the splitting field of x p — 2 over Q contains <Q( y/2 . , £ p ). On 
the other hand, all the roots above lie in this field, so that the splitting field is precisely 

Q(V2 ,£ p ). 

This field contains the cyclotomic field of p* roots of unity and is generated over it by 
V2, hence is an extension of degree at most p. It follows mat the degree of this extension 
over Q is < p(p — 1). Since both Q( %/l) and Q(£ p ) are subfields, the degree of the 
extension over Q is divisible by p and by p — 1. Since these two numbers are relatively 
prime it follows that the extension degree is divisible by p(p — 1) so that we must have 

[Q( ^/2 , £ p ) ' Q3 = p(p — 1) 

(this is Corollary 22). Note in particular that we have proved x p — 2 rem ains irreducible 
over Q(£ p ), which is not at all obvious. We have the following diagram of known subfields: 


Q(V2 ,£ p ) 



Q(f P ) Q(V2) 



The special case p = 3 was Example 3 above, where we simply indicated the 3 rd roots 
of unity explicitly. 

We now return to the problem of proving it makes no difference how the splitting 
field of a polynomial / (x) over a field F is constructed. As in Theorem 8 it is convenient 
to state the result for an arbitrary isomorphism tp : F F' between two fields. 

Theorem 27. Let <p : F — ► F' be an isomorphism of fields. Let fix) € F{x] 
be a polynomial and let fix) € F'[x] be the polynomial obtained by applying <p to 
the coefficients of fix). Let £ be a splitting field for / (x) over F and let E' be a 
splitting field for fix) over F' . Then the isomorphism cp extends to an isomorphism 
o : E E', i.e., a restricted to F is the isomorphism tp : 

a \ E -A E f 

I I 

<p : F F' 

Proof: We shall proceed by induction on the degree n of / (x). As in the discus- 
sion before Theorem 8, recall that an isomorphism <p from one field F to another field 
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F' induces a natural isomorphism between the polynomial rings £[x] and £'[x]. In 
particular, if /(x) and /'(x) correspond to one another under this isomorphism then 
the irreducible factors of f(x ) in £[x] correspond to the irreducible factors of /'(x) in 
F'[xl 

If /(x) has all its roots in F then f(x) splits completely in £[x] and /'(x) splits 
completely in F'[x] (with its linear factors being the imagesr'of the l in ear factors for 
/(x)). Hence E = F and E' = F', and in this case we may take a = <p. This shows 
the result is true for n = 1 and in the case where all the irreducible factors of f(x) have 
degree 1. 

Assume now by induction that the theorem has been proved for any field F, iso- 
morphism <p, and polynomial /(x) e F[x] of degree < n. Let p(x) be an irreducible 
factor of /(x) in £[x] of degree at least 2 and let p'(x) be the corresponding irreducible 
factor of f(x) in £'[x]. If a e E is a root of /?(x) and e £' is a root of p'(x) t then 
by Theorem 8 we can extend <p to an isomorphism o' : F(a^ -> F'(fi): 

<x': F(a) F'(P) 

I I 

<p : F ^ r. 

Let F\ = F (a), F[ = F'(fi), so that we have the isomorphism o' : F\ F[. We have 
f(x) = ( x-a)f x (x) over F x where /, (x) has degree n - 1 and f'(x) = (x~P)f{(x). 
The field £ is a splitting field for f\ (x) over F\\ aD the roots of f\ (x) are in E and if 
they were contained in any smaller extension L containing F\, then, since F\ contains 
a, L would also contain all the roots of /(x), which would contradict the minimality 
of E as the splitting field of /(x) over F. Similarly E' is a splitting field for f{(x) over 
F[. Since the degrees of /i(x) and f((x) are less than n, by induction there exists a 

map a : E -> E' extending the isomorphism o' : F\ ->• F[. This gives the extended 
diagram: 


O : 

£ 

n 

— ► E' 

o' : 

1 

F\ 

1 

1 

— > F[ 

n 

<p : 

1 

F 

1 

— ► F'. 


Then as the diagram indicates, o restricted to F\ is the isomorphism o', so in particular 
o restricted to F is o' restricted to £, which is <p, showing that o is an extension of <p, 
completing the proof. 

Corollary 28. ( Uniqueness of Splitting Fields) Posy two splitting fields for a polynomial 
/(x) e £[x] over a field F are isomorphic. 

Proof: Take to be the identity mapping from F to itself and E and E' to be two 
splitting fields for /(x)(= f(x)). 

As we mentioned before, this result justifies the terminology of the splitting field 
for f(x) over £, since any two are isomorphic. Splitting fields play a natural role in 
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the study of algebraic elements (if you are adjoining one root of a polynomial, why not 
adjoin all the roots?) and so take a particularly important role in Galois Theory. 

We end this section with a discussion of field extensions of F which contain all the 
roots of all polynomials over F. 

Definition. The field F is called an algebraic closure of F if F is algebraic over F 
and if every polynomial / (x) 6 F[x] splits completely over F (so that F can be said 
to contain all the elements algebraic over F). 

Definition. A field K is said to be. algebraically closed if every polynomial with 
coefficients in K has a root in K. 

It is not obvious that algebraically closed fields exist nor that there exists an algebraic 
closure of a given field F (we shall prove this shortly). 

Note that if A" is algebraically closed, then in fact every / (x) e K[x\ has all its 
roots in K, since by definition fix) has a root ct g K, hence has a factor x — a in 
K\x]. The remaining factor of / (x) then is a polynomial in K[x], hence has a root, so 
has a linear factor etc., so that fix) must split completely. Hence if AT is algebraically 
closed, then K itself is an algebraic closure of K and the converse is obvious, so that 
K = AT if and only if AT is algebraically closed. 

The next result shows that the process of “taking the algebraic closure” actually 
stops after one step — taking the algebraic closure of an algebraic closure does not give 

a larger field: the field is already algebraically closed (notationally: F = F ). 
Proposition 29. Let F be an algebraic closure of F. Then F is algebraically closed. 

Proof: Let fix) be a polynomial in F[x] and let a be a root of / (x). Then a 
generates an algebraic extension Fia) of F, and F is algebraic over F. By Theorem 
20, Fia) is algebraic over F so in particular its element a is algebraic over F. But then 
a e F, showing F is algebraically closed. 

Given a field F we have already shown how to construct (finite) extensions of F 
containing all the roots of any given polynomial fix) e F[x]. Intuitively, an algebraic 
closure of F is given by the field “generated” by all of these fields. The difficulty 
with this is “generated” where?, since they are not all subfields of a given field. For a 
finite collection of polynomials /j (x), . . . , /jt(x), we can identify their splitting fields 
as subfields of the splitting field of the product polynomial f\ (x) ■■■ /* (x), but the same 
idea used for an infinite number of polynomials requires numerous “bookkeeping” 
identifications and an application of Zorn’s Lemma. 

We shall instead construct an algebraic closure of F by first constructing an al- 
gebraically closed field containing F. The proof uses a clever idea of Artin which 
very neatly solves the “bookkeeping” problem of constructing a field containing the 
appropriate roots of polynomials (which also ultimately relies on Zorn’s Lemma) by 
introducing a separate variable for every polynomial. 
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Proposition 30. For any field F there exists an algebraically closed field K containing F . 

Proof: For every nonconstant monic polynomial / = f (x) with coefficients in 
F, let Xf denote an indeterminate and consider the polynomial ring F [. . . , x /, . . . ] 
generated over F by the variables xj. In this polynomial ring consider the ideal / 
generated by the polynomials /(*/). If this ideal is not proper, then 1 is an element of 
the ideal, hence we have a relation 

/l (*/, ) + glfliXff) + ■■■'+ g n fn(Xf n ) = 1 

where the g iy i = 1,2 , . . . , n, are polynomials in the Xf. For i = 1, 2, . . . , n let 
Xf t = jc, and let Jt„ + i, . . . , x m be the remaining variables occurring in the polynomials 
gj, j = 1,2 , . . . , n. Then the relation above reads 

g\(x u x 2 , .... + ■ ■ ■ + g n (x u x 1 ,... t x nl )f n (x n ) = 1. 

Let F' be a finite extension of F containing a root a; of fi(x) fori = 1,2 , . . . , n. Letting 

Xj = ctj , i = 1,2 n and setting x r+ [ = • ■ • = x m = 0, say, in the polynomial 

equation above would imply that 0 = 1 in F\ clearly impossible. 

Since the ideal / is a proper ideal, it is contained in a maximal ideal M (this is 
where Zorn’s Lemma is used). Then the quotient 

K\ = F[. . . , x f , . . . yM 

is a field containing (an isomorphic copy of) F. Each of the polynomials / has a root 
in K\ by construction, namely the image of x/, since f(x y) € I c FA. We have 
constructed a field K\ in which every polynomial with coefficients from F has a root. 
Performing the same construction with K\ instead of F gives a field Kj containing 
K\ in which all polynomials with coefficients from K\ have a root. Continuing in this 
fashion we obtain a sequence of fields 

F = K 0 c c K 7 ^ - - . c Kj c K j+l c - • • 

where every polynomial with coefficients in Kj has a root in Kj+i, j = 0,1, Let 

K = {jKj 

j>0 

be the union of these fields. Then K is clearly a field containing F. Since A' is the 
union of the fields Kj , the coefficients of any polynomial h(x) in Afx] all lie in some 
field Kfj for N sufficiently laige. But then h(x) has a root in K^ + \, so has a root K. 
It follows that K is algebraically closed, completing the proof. 

We now use the algebraically closed field containing F to construct an algebraic 
closure of F: 

Proposition 31. Let A" be an algebraically closed field and let F be a subfield of K. 
Then the collection of elements F of K that are algebraic over F is an algebraic closure 
of F. An algebraic closure of F is unique up to isomorphism. 

Proof: By definition, F is an algebraic extension of F. Eveiy polynomial / (jc) e 
F[x] splits completely over K into linear factors x — a (the same is true for every 
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polynomial even in AT[x]). But each a is a root of f(x), so is algebraic over F, hence 
is an element of F. It foilows_tbat all the linear factors x — a have coefficients in F, 

i.e., f(x) splits completely in F[x] and F is an algebraic closure of F. 

The uniqueness (up to isomorphism) of the algebraic closure is natural in light of 
the uniqueness (up to isomorphism) of splitting fields, and is proved along the same 
lines together with an application of Zorn’s Lemma and will be omitted 

We shall prove later using Galois theory the following result (purely analytic proofs 
using complex analysis also exist). 

Theorem. (Fundamental Theorem of Algebra) The field C is algebraically closed 
By Proposition 31, we immediately obtain: 

Corollary 32. The field C contains an algebraic closure for any of its subfields. In 
particular, <Q>, the collection of complex numbers algebraic over Q, is an algebraic 
closure of Q. 

The point of these considerations is that all the computations involving elements 
algebraic over a field F may be viewed as taking place in one (large) field namely 
F. Similarly, we can speak sensibly of the composite of any collection of algebraic 
extensions by viewing them all as subfields of an algebraic closure. In the case of Q or 
finite extensions of Q we may consider all of our computations as occurring in C. 

EXERCISES 

1. Determine the splitting field and its degree over Q for x 4 — 2. 

2. Determine the splitting field and its degree over Q for x 4 + 2. 

3. Determine the splitting field and its degree over Q for x 4 -P x 2 + 1. 

4. Determine the splitting field and its degree over Q for x 6 — 4. 

5. Let K be a finite extension of F. Prove that K is a splitting field over F if and only if 
every irreducible polynomial in F[x] that has a root in K splits completely in F[.r], [Use 
Theorems 8 and 27.] 

6. Let AT] and K 2 be finite extensions of F contained in the field K, and assume both are 
splitting fields over F. 

(a) Prove that their composite K\ K 2 is a splitting field over F. 

(b) Prove that K\ D K 2 is a splitting field over F. [Use the preceding exercise.] 


13.5 SEPARABLE AND INSEPARABLE EXTENSIONS 

Let F be a field and let f(x) 6 F[x] be a polynomial. Over a splitting field for f(x) 
we have the factorization 

f(x) = (x - cci)’" (x - a 2 ) m ■ ■ - (x - Q !*)”* 

where oq, ot 2 , ■ . ■ , orjt are distinct elements of the splitting field and rc; > 1 for all i. 
Recall that or,- is called a multiple root if n t > 1 and is called a simple root if n, = 1. 
The integer «, is called the multiplicity of the root or, . 
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Definition. A polynomial over F is called separable if it has no multiple roots (i.e., 
all its roots are distinct). A polynomial which is not separable is called inseparable . 

Note that if a polynomial / (jc) has distinct roots in one splitting field then fix) has 
distinct roots in any splitting field (since this is equivalent to / (jc) factoring into distinct 
linear factors, and there is an isomorphism over F between any two splitting fields of 
f(x) that is bijective on its roots), so that we need not specify the field containing all 
the roots of fix). 

Examples 

(1) The polynomial x 2 — 2 is separable over Q since its two roots ±%/2 are distinct. The 
polynomial (x 2 — 2) n for any n > 2 is inseparable since it has the multiple roots 
each with multiplicity n. 

(2) The polynomial x 2 — t (= x 2 + /) over the field F = F 2 O) of rational functions in t 
with coefficients from F 2 is irreducible as we’ve seen before, but is not separable. If 
■J~t denotes a root in some extension field (note that -J~t $ F), then 

(x — -Jt) 1 = x 2 — 2xy/t + t=x 2 -\-t = x 2 —t 

since F is a field of characteristic 2. Hence this irreducible polynomial has only one 
root (with multiplicity 2), so is not separable over F. 

There is a simple criterion to check whether a polynomial has multiple roots. 

Definition. The derivative of the polynomial 

fix) = a„ x n + a n - 1 *” -1 H h a x x + oq e F[x] 

is defined to be the polynomial 

D x fix) = na„x n ~ l + (« - l)a„_iJ: w ~ 2 H 1- 2^2* + at € F[x]. 


This formula is nothing but the usual formula for the derivative of a polynomial 
familiar from calculus. It is purely algebraic and so can be applied to a polynomial 
over an arbitrary field F, where the analytic notion of derivative (involving limi ts — a 
continuous operation) may not exist. 

The usual (calculus) formulas for derivatives hold for derivatives in this situation 
as well, for example the formulas for the derivative of a sum and of a product: 

D x (f(x) + g(x)) = D x f(x) + D x g(x) 

D x (f(x)g(x)) = f(x)D x g(x) + (D x f(x))g(x). 

These formulas can be proved directly from the definition for polynomials and do not 
require any limiting operations and are left as an exercise. 

The next proposition shows that the separability of /(jc) can be determined by the 
Euclidean Algorithm in the field where the coefficients of / (x) lie, without passing to 
a splitting field and factoring fix). 
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Proposition 33. A polynomial fix ) has a multiple root a if and only if a is also a root 
of D x f(x ), i.e., f(x) and D x f(x) are both divisible by the minimal polynomial for 
a. hi particular, f(x) is separable if and only if it is relatively prime to its derivative: 
ifix),D x fix)) = l. 

Proof: Suppose first that a is a multiple root of fix). Then over a splitting field, 

f(x) = (x-ay 8 (x) 

for some integer n >2 and some polynomial g(x). Takin g derivatives we obtain 

D*f(x) = n(x - a) n ~ l g(x) + (x - a) n D x g(x ) 

which shows (n > 2) that D x f(x) has a as a root 

Conversely, suppose that a is a root of both f(x) and D x f(x). Then write 

f(x) = ix-cc)hix) 

for some polynomial h(x) and take the derivative: 

D x fix) = h(x) + (x — ct)D x h(x). 

Since D x fict ) = 0 by assumption, substituting a into the last equation shows that 
fi(a) = 0. Hence h(x) = (x — a)h\ix) for some polynomial /ii(jc), and 

f ix) = ix - ctfhiix) -— 

showing that a is a multiple root of fix). 

The equivalence with divisibility by the minimal polynomial for a follows from 
Proposition 9. The last statement is then clear (let a denote any root of a common factor 
of fix) and D x f (x)). 

Examples 

(1) The polynomial — x over F p has derivative p n — 1 = — 1 since the field has 
characteristic p. Since in this case the derivative has no roots at all, it follows that the 
polynomial has no multiple roots, hence is separable. 

(2) The polynomial x n — 1 has derivative nx n ~ l . Over any field of characteristic not divid- 
ing n (including characteristic 0) this polynomial has only the root 0 (of multiplicity 
n — 1), which is not a root of x n — 1 . Hence x” — 1 is separable and there are n distinct 
n* roots of unity. We saw this directly over <Q> by exhibiting n distinct solutions over 
C. 

(3) If F is of characteristic p and p divides n, then there are fewer than n distinct tj* roots 
of unity over F: in this case the derivative is identically 0 since n = 0 in F. In fact 
every root of x n — 1 is multiple in this case. 

Corollary 34. Every irreducible polynomial over a field of characteristic 0 (for example, 
(Q>) is separable. A polynomial over such a field is separable if and only if it is the product 
of distinct irreducible polynomials. 

Proof: Suppose F is a field of characteristic 0 and p(x) e Fix] is irreducible 
of degree n. Then the derivative D x pix) is a polynomial of degree n — 1. Up to 
constant factors the only factors of pix) in F[x] are 1 and pix), so D x pix) must be 
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relatively prime to p(x). This shows that any irreducible polynomial over a field of 
characteristic 0 is separable. The second statement of the corollary is then clear since 
distinct irreducibles never have zeros in common (by Proposition 9). 

The point in the proof of the corollary that can fail in characteristic p is the statement 
that the derivative D x p(x) is of degree n — 1. In characteristic p the derivative of any 
power x pm of x p is identically 0: 

D x (x pm ) = pmx pm ~ l = 0 

so it is possible for the degree of the derivative to decrease by more than one. If the 
derivative D x p(x) of the irreducible polynomial p(x) is nonzero, however, then just as 
before we conclude that p(x) must be separable. 

It is clear from the definition of the derivative that if p(x) is a polynomial whose 
derivative is 0, then every exponent of x in p(x) must be a multiple of p where p is the 
characteristic of F : 

p(x) = a m x mp + a m - lX (m - l)p + ■ ■ • + a lX p + a 0 . 

Letting 

pi(x) = a m x m +a m - ix m ~ l H \-a\x +a 0 

we see that p(x) is a polynomial in x p , namely p(x) = p\ (x p ). 

We now prove a simple but important result about raising to the p^ power in a field 
of characteristic p. 

Proposition 35. Let F be a field of characteristic p. Then for any a, b £ F, 

(a + b) p — a p + b p , and ( ab) p = a p b p . 

Put another way, the p^-power map defined by <p(a) = a p is an injective field homo- 
morphism from F to F . 

Proof: The Binomial Theorem for expanding (a + b) n for any positive integer n 
holds (by the standard induction proof) over any commutative ring: 

(f~ l b + - - * + + ■•■+&". 

It should be observed that the binomial coefficients 

n\ 

i\(n-i)\ 

are integers (recall that ma for m € Z is defined for a an element of any ring) and here 
are elements of the prime field. 

If p is a prime, then the binomial coefficients (?) for i = 1, 2, . . . , p — 1 are all 
divisible by p since for these values of i the numbers i ! and (p — i) ! only involve factors 
smaller than p, hence are relatively prime to p and so cannot cancel the factor of p in the 

p\ 

numerator of the expression . It follow s that over a field of characteristic p all 

il(p - 1)! 

the intermediate terms in the expansion of (a + b) p are 0, which gives the first equation 
of the proposition. The second equation is trivial, as is the fact that <p is injective. 



(a + b) n = a n + 


c 
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Definition. The map in Proposition 35 is called the Frobenius endomorphism of F. 

Corollary 36. Suppose that F is a finite field of characteristic p. Then every element 
of IF is a p^ power in F (notationally, F — F p ). 

Proof: The injectivity of the Frobenius endomorphism of F implies that it is also 
surjective when F is finite, which is the statement of the corollary. 

We now prove the analogue of Corollary 34 for finite fields. 

Let F be a finite field and suppose that p(x) € F[x] is an irreducible polynomial 
with coefficients in F. If p(x) were inseparable then we have seen that p(x) = q(x p ) 
for some polynomial q{x) e F[jc]. Let 

q(x) — a m x m + a m -ix m ^ 1 H 1 -a x x+a Q . 

By Corollary 36, each a,-, i = 1, 2, . . . , m is a /7 th power in F, say a, = bf. Then by 
Proposition 35 we have 

pM = g(x p ) = a, +a m -l(*'T 1 H +a 0 

= b p (x p r +b^_ i (xT -' + ■ • • +^x p + b p 
= (b m x m Y + (i>„_ ,x m ~y + ■ • • + (b,xy + (i 0 ) p 
— (b m x m + b„- ix m ~ l H V b,x + b D Y 

which shows that p(x) is the power of a polynomial in F[jc], a contradiction to the 
irreducibility of p(x). This proves: 

Proposition 37. Every irreducible polynomial over a finite field F is separable. A 
polynomial in F[jc] is separable if and only if it is the product of distinct irreducible 
polynomials in F[jc]. 

The important part of the proof of this result is the fact that every element in the 
characteristic p field F was a /7 th power in F. This suggests the following definition: 

Definition. A field K of characteristic p is called perfect if every element of K is a 
/7 th power in K, i.e., K = K p . Any field of characteristic 0 is also called perfect 

With this definition, we see that we have proved that every irreducible polynomial 
over a perfect field is separable. It is not hard to see that if K is not perfect then there 
are inseparable irreducible polynomials. 

Example: (Existence and Uniqueness of Finite Fields) 

Let n > 0 be any positive integer and consider the splitting field of the polynomial xP" — x 
over W p . We have already seen that this polynomial is separable, hence has precisely p n 
roots. Let a and $ be any two roots of this polynomial, so that aP" — a and ^ = /?. 
Then (afi) pK = a/3, = a -1 and by Proposition 35 also 

+ =a pr +fP n = a + 0. 
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Hence the set IF consisting of the p n distinct roots of — x over F p is closed under 
addition, multiplication and inverses in its splitting field. It follows that IF is a subfield, 
hence in fact must be the splitting field. Since the number of elements is p", we have 
[F : F p ] = n , which shows that there exist finite fields of degree n over F p for any n > 0. 

Let now IF be any finite field of characteristic p. If F is of dimension n over Us prime 
subfield F p , then IF has precisely p n elements. Since the multiplicative group IF* is (in 
fact cyclic) of order p n — 1, we have a p "~ l = 1 for every a ^ 0 in IF, so that a p " = a for 
every aeF. But this means a is a root of — x, hence IF is contained in a splitting field 
for this polynomial. Since we have seen that the splitting field has order p n this shows that 
IF is a splitting field for** 7 " — x. Since splitting fields are unique up to isomorphism, this 
proves that finite fields of any order p n exist and are unique up to isomorphism. We shall 
denote the finite field of order p n by F^ . 

We shall consider finite fields more later. 

We now investigate further the structure of inseparable irreducible polynomials over 
fields of characteristic p. We have seen above that if p(x) is an irreducible polynomial 
which is not separable, then its derivative D x p(x) is identically 0, so that p(x) = p\(x p ) 
for some polynomial pi (x). The polynomial p\ (jc) may or may not itself be separable. 
If not, then it too is a polynomial in x p , p\ (x) = P2(x p ), so that p(x) is a polynomial in 
x : p(x) — p 2 (x p2 ). Continuing in this fdJhion we see that there is a uniquely defined 
power p k of p such that p(x) = pk(x^) where Pk(x) has nonzero derivative. It is clear 
that Pk(x) is irreducible since any factorization of Pk(x) would, after replacing x by 
x^ , immediately imply a factorization of the irreducible p(x). It follows that Pk(x) is 
separable. We summarize this as: 

Proposition 38. Let p(x) be an irreducible polynomial over a field F of characteristic 
p. Then there is a unique integer k > 0 and a unique irreducible separable polynomial 
p S e P (x) e F[x) such that 

P(x) = Psepi*^)- 


Definition. Let p(x) be an irreducible polynomial over a field of characteristic p. The 
degree of PsepC*) in the last proposition is called the separable degree of p(x) t denoted 
dtgsp(x). The integer p k in the proposition is called the inseparable degree of p(x), 
denoted deg ip(x). 

From the definitions and the proposition we see that p(x) is separable if and only 
if its inseparability degree is 1 if and only if its degree is equal to its separable degree. 
Also, computing degrees in the relation p(x) — p sep (x p *) we see that 

deg p(x) = deg J p(x)deg,' p(x). 


Examples 

(1) The polynomial p(x) = x 1 — t over F ~ F 2 (f) considered above has derivative 
0. hence is nol separable (as we determined earlier). Here p sep (x) = x — t with 
inseparability degree 2. 
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(2) The polynomial p(x) = x 1 ™ — t over F = F 2 O) is irreducible with the same separable 
polynomial part, but with inseparability degree 2 m . 

(3) The polynomial ( x p 2 — t)(x p — t) ova - F = F p (r) has (two) inseparable irreducible 

factors so is inseparable. This polynomial cannot be written in the form f sep (x^) 
where f sep (:t)is separable, which is the reason we restricted to irreducible polynomials 
above. This example also shows that there is no analogous factorization to define the 
separable and inseparable degrees of a general polynomial. 

The notion of separability carries over to the fields generated by the roots of these 
polynomials. 

Definition. The field K is said to be separable (or separably algebraic ) over F if 
every element of AT is the root of a separable polynomial over F (equivalently, the 
minimal polynomial over F of every element of K is separable). A field which is not 
separable is inseparable. 

We have seen that the issue of separability is straightforward for finite extensions 
of perfect fields since for these fields the minimal polynomial of an algebraic element 
is irreducible hence separable. 

Corollary 39. Every finite extension of a perfect field is separable. In particular, every 
finite extension of either Q or a finite field is separable. 


We shall consider separable and inseparable extensions more after developing some 
Galois Theory, in particular defining the separable and inseparable degree of the exten- 
sion K/F. 


EXERCISES 

1. Prove that the derivative D x of a polynomial satisfies D x (f(x) + £(jc)) = D x (f(x)) + 
D x (g(x)) and D x (f(x)g(x)) = D x (_f(x))g(x) + D x (g(x))f(x) for any two polynomials 
f(x) andgCx). 

2. Find all irreducible polynomials of degrees 1 , 2 and 4 over F 2 and prove that their product 
is jc 16 — x. 

3. Prove that d divides n if and only if x d — 1 divides x n — 1. [Note that if n = qd + r then 
x n - 1 = (x^ +r - x r ) + (x r - 1).] 

4. Let a > 1 be an integer. Prove for any positive integers n, d that d divides n if and only if 
a d — 1 divides a n — 1 (cf. the previous exercise). Conclude in particular that F ^ c 

if and only if d divides n. 

5. For any prime p and any nonzero a € F p prove that x p ~x+a is irreducible and separable 
over F p . [For the irreducibility: One approach — prove first that if or is a root then a + 1 
is also a tool Another approach — suppose it’s reducible and compute derivatives.] 

6. Prove thaix^ -1 — 1 = flaeF^, ( x ~ a )' Conclude that n^eiF^, a = ( — 1)^ so the product 
of the nonzero elements of a finite field is +1 if p = 2 and — 1 if p is odd. For p odd and 
n = 1 derive Wilson's Theorem : (p — 1)! = — 1 (mod p). 
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7. Suppose K is a field of characteristic p which is not a perfect field: K K p . Prove there 
exist irreducible inseparable polynomials over K. Conclude that there exist inseparable 
finite extensions of K. 

8. Prove that / (x) p = / (x p ) for any polynomial / (x) € F p [x]. 

9. Show that the binomial coefficient^”) is the coefficient of x pi in the expansion of ( l +x) pn . 
Working over 1F P show that this is the coefficient of (x p ) 1 in (1 + x p ) n and hence prove 
Aat Q = (?) (mod p). 

10. Let fix \,X 2 x„ ) e %\x \ , X 2 , ■ . - , x„] be a polynomial in the variables x \ , xj x„ 

with integer coefficients. For any prime p prove that the polynomial 

fix \,X2 X„) P - fix\,X%, ...,X%)€ Z[X \ , X2 x „ ] 

has all its coefficients divisible by p. 

11. Suppose K[x] is a polynomial ring over the field K and F is a subfield of K. If F is a 
perfect field and / (x) e F[x] has no repeated irreducible factors in Fix], prove that fix) 
has no repeated irreducible factors in F[x], 


13.6 CYCLOTOMIC POLYNOMIALS AND EXTENSIONS 


The purpose of this section is to prove that the cyclotomic extension 

QiZnVQ 

generated by the n* roots of unity over <Q) introduced in Section 4 is of degree (pin) 
where denotes Euler’s phi-function ( = the number of integers a, 1 < a < n relatively 
prime to n = the order of the group (Z/nZ) x ). 

Definition. Let \x n denote the group of /I th roots of unity over Q. 


Then as we have already observed, Z//iZ = as groups (under multiplication 
on the right, addition on the left), given explicitly by the map a h* (£„) a for a fixed 
primitive root of unity. The primitive roots of unity are given by the residue 
classes prime to n so there are precisely (pin) primitive /I th roots of unity. 

If d is. a divisor of n and £ is a d ^ root of unity, then £ is also an n ^ root of unity 
since £" = i£ d ) n / d = 1. Hence 

Pd <= Pn for all d | n. 

Conversely, the order of any element of the group p„ is a divisor of n so that if £ is an 
/I th root of unity which is also a d^ root of unity for some smaller d then d \ n. 


Definition. Define the / 1 th cyclotomic polynomial <P„ (x) to be the polynomial whose 
roots are the primitive /I th roots of unity: 

<m*>= n (*-?>= n 

f primitive €**„ l<a<n 

(.a,n)= 1 


(which is of degree (pin)). 
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The roots of the polynomial jc m — 1 are precisely the n 1 ^ roots of unity so we have 
the factorization 

x n - i = n (*-«. 

c"“i 

i.c. f e^r, 

If we group together the factors (x - £) where f is an element of order d in fx n (i.e., f 
is a primitive root of unity) we obtain 

= n n <*-«• 

d\n 

f primitive 

The inner product is 0d(x) by definition so we have the factorization 

^ 

x n -\ = \\& d {x). (13.4) 

d\n 

Note incidentally that comparing degrees gives the identity 

n - y^ <p(d). 

d\ri 

This factorization allows us to compute 0 n (x) for any n recursively: clearly 
&i(x) = x — 1 and = x 4- 1. Then 

JC 3 — 1 = &i(x)&$(x) = (x — 1 ) 03 (*) 

which gives 

& 3 (x) = x 2 +x + 1. 

Similarly 

x 4 - 1 = 0](x)02(*)04(*) = (* ~ D(* + 1)<Z>4(*) 

gives 

d> 4 (x) = jc 2 + 1 

(in these cases these could also be obtained directly from the explicit roots of unity). 
Continuing in this fashion we can compute 0„(x) for any n. Note also that for p a 
prime we recover our polynomial 

0 p (x) = xP~ l + x p ~ 2 + - ■ ■ + jc + 1. 

For some small values of n the polynomials are 

<J> 5 (x) = x 4 + x 3 + x 1 + jc + 1 

0 6 (x) =x 2 -x + 1 

0 7 (x) = jc 6 + x 5 +x 4 +x 3 +x 2 + x + 1 
0 8 (x) = x 4 + 1 
0 9 (x) = x 6 + X 3 + 1 
0\q(x) = JC 4 — JC 3 + x 2 — X + 1 

<Z>n(.x) = x 10 +JC 9 H \-x + 1 

0 l2 (x) = JC 4 - x 2 + 1. 
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For all the values computed above, <P„ix) was a (monic) polynomial with integer 
coefficients. This is always the case: 

Lemma 40. The cyclotomic polynomial <P n (x) is a monic polynomial in Z[x] of degree 
(p{n). 


Proof : It is clear that (jc) is monic and has degree (pin). We must show the 
coefficients lie in Z. We use induction on n. The result is true for n = 1 (and n < 12). 
Assume by induction that <Pd(x) € Z[x] for all 1 < d < n. Then x" — 1 = f(x)& n (x) 
where f(x) = Y[d\ n &di x ) is monic and has coefficients in Z. Since f(x) clearly 

d*n 

divides r" — 1 in Fix] where F = Q(f„) is the field of roots of unity and both 
/ ix) and x n — 1 have coefficients in <Q>, fix) divides x" — 1 in Q(x] by the Division 
Algorithm (cf. the remark at the end of Section 9.2). By Gauss’ Lemma, fix) divides 
x n — 1 in Z[x], hence <P„(x) e Z[x], 

We remark in passing that while all the coefficients of ^(x) in the examples 
computed above were 0, ±1, it is known that there are cyclotomic polynomials with 
arbitrarily large coefficients. 

Theorem 41. The cyclotomic polynomial @ n (x) is an irreducible monic polynomial in 
Z[jc] of degree (pin). 


Proof: We must show that <f> w (x) is irreducible. If not then we have a factorization 


&n (*) = / i x )gi x ) with f(x), gix) monic in Z[x] 

where we take / (jc) to be an irreducible factor of &„ix). Let f be a primitive n 111 root 
of 1 which is a root of fix) (so then fix) is the minim al polynomial for f over Q) and 
let p denote any prime not dividing n. Then f p is again a primitive n^ 1 root of 1, hence 
is a root of either fix) or g(x). 

Suppose gi£ p ) = 0. Then f is a root of gix p ) and since fix) is the minimal 
polynomial for fix) must divide gix p ) in Z[x], say 

gix p ) = fix)hix) , hix) e Z[x]. 

If we reduce this equation mod p. we obtain 

gi* p ) = f(x)hix) in F p [x]. 

By the remarks of the last section, 

gix p ) = igix)) p 


so we have the equation 


(g(x)) p = fixMx) 


in the U.F.D. F p [x]. It follows that fix) and gix) have a factor in common in F p [x]. 

Now, from <f>„(x) — fix)gix) we see by reducing mod p that &„ix) = /(x)g(x), 
and so by the above it follows that <£„(x) €. F p [x] has a multiple root But then also 
x" — 1 would have a multiple root over F p since it has <f>„ (x) as a factor. This is a 
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contradiction since we have seen in the last section that there are n distinct roots of 
x n — 1 over any field of characteristic not dividing n. 

Hence must be a root of fix). Since this applies to every root £ of f(x), 
it follows that £ fl is a root of fix) for every integer a relatively prime to n: write 
a = pi p 2 * - - pk as a product of (not necessarily distinct) primes not dividing n so 
that £ p ' is a root of fix), so also is a root of fix), etc. But this means that 

every primitive n 1 * 1 root of unity is a root of fix), i.e., fix) = 4> n (x), showing 4> n ix) 
is irreducible. » 

Corollary 42. The degree over Q of the cyclotomic field of n* roots of unity is (pin) : 

KKCn) : 03 = «e(»). 


Proof: By the theorem, (x) is the minimal polynomial for any primitive n* root 
of unity 

Example 

The cyclotomic field Q(fs) of the 8 th roots of unity is of degree <pi%) = 4 over Q. This field 
contains the 4 th roots of unity, i.e., Q(t) c Q(fs) as well as the element £g + £g 7 = y/2 
(recall the explicit roots of unity in Section 4). It follows that 

Q(Cs) = Q(i, V2)- 

One interesting number-theoretic application of the cyclotomic polynomials out- 
lined in the exercises is the proof that for any n there are infinitely many primes which 
are congruent to 1 modulo n. The complete factorization in F p [x] of <Pg(x) for a prime 
l (which is irreducible in Z[x]) is described in Exercise 8 below. 

We shall return to the example of cyclotomic fields after we have developed some 
Galois Theory. 


EXERCISES 

1. Suppose m and n are relatively prime positive integers. Let £ m be a primitive nt* root of 
unity and let £„ be a primitive n* root of unity. Prove that is a primitive m/i* root of 
unity. 

2. Let £„ be a primitive / 1 th root of unity and let d be a divisor of n. Prove that is a primitive 
( n/d 1 th root of unity. 

3. Prove that if a field contains the n* roots of unity for n odd then it also con tains the 2n A 
roots of unity. 

4. Prove that if n = p k m where p is a prime and m is relatively prime to p then there are 
precisely m distinct roots of unity over a field of characteristic p. 

5. Prove there are only a finite number of roots of unity in any finite extension AT of Q. 

6. Prove that for n odd. n > 1, <Pin(x) = &„i—x). 

7. Use the Mobius Inversion formula indicated in Section 14.3 to prove 

<M*> = n< x<f - D MM/d) - 

d\n 
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8- Let l be a prime and let 4>e(x) = = x e 1 + x e 2 + . - - + x + 1 € Z[x] be the 

cyclotomic polynomial, which is irreducible over Z by Theorem 41. This exercise 
determines the factorization of <Pt(x) modulo p for any prime p. Let £ denote any fixed 
primitive root of unity. 

(a) Show that if p = l then <J>t(x) = (x - l) f_l e Ff[x]. 

(b) Suppose p i and let / denote the order of p mod £, i.e., / is the smallest power of 
p with pf = 1 mod £. Use the fact that F^, is a cyclic group to show that n = / is 
the smallest power p n of p with ? e F^ . Conclude that the minimal polynomial of 
£ over Fp has degree f. 

(c) Show that F p (f ) = F p (£ fl ) for any integer a not divisible by i. [One inclusion is 
obvious. For the other, note that f = (£°) b where b is the multiplicative inverse of 
a mod £.] Conclude using (b) that, in F p [x], &e(x) is the product of distinct 
irreducible polynomials of degree /. 

(d) In particular, prove that, viewed in F p [jc], <t^(x) = x 6 +jc 5 -f-. . .-f-x+ 1 is (x — 1)^ for 
p = 7, a product of distinct linear factors for p = 1 mod 7, a product of 3 irreducible 
quadratics for p = 6 mod 7, a product of 2 irreducible cubics for p = 2, 4 mod 7, 
and is irreducible for p = 3, 5 mod 7. 


9 . Suppose A is an n x n matrix over C for which A k = / for some integer k > 1 . Show that 
A can be diagonalized. Show that the matrix A = ^ ^ ^ where a is an element of a 
field of characteristic p satisfies A p = I and cannot be diagonalized if a ^ 0. 


10. Let <p denote the Frobenius map x h>- x p on the finite field Fp». Prove that <p gives an 
isomorphism of Fp* to itself (such an isomorphism is called an automorphism). Prove that 
<p n is the identity map and that no lower power of <p is the identity. 

11. Let <p denote the Frobenius map x x p on the finite field F^ as in the previous ex- 
ercise. Determine the rational canonical form over ¥ p for <p considered as an F p -linear 
transformation of the n-dimensional F p -vector space Fp* . 

12. Let <p denote the Frobenius map r hv r p on the finite field Fpn as in the previous exercise. 
Determine the Jordan canonical form (over a field containing all the eigenvalues) for <p 
considered as an F p -linear transformation of the n -dimensional F p - vector space Fp« . 

13. (Wedderbum 's Theorem on Finite Division Rings) This exercise outlines a proof (following 
Witt) of Wedderbum ’s Theorem that a finite division ring D is a field (i.e., is commutative). 

(a) Let Z denote the center of D (i.e., the elements of D which commute with every 
element of D). Prove that Z is a field containing Fp for some prime p. If Z = F^ 
prove that D has order q" for some integer n [D is a vector space over Z]. 

(b) The nonzero elements D* of D form a multiplicative group. For any x e D* show 
that the elements of D which commute with x form a division ring which contains Z. 
Show that this division ring is of order q m for some integer m and that m < n if x is 
not an element of Z. 

(c) Show that the class equation (Theorem 4.7) for the group D x is 


-i = («-i)+E 

i=i 


q n - 1 
\Cjy* (.Xj)| 
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contained in die center of £>*. Conclude from (b) that for each r, )Cd*(*/)I = q m> — 1 
for some m-, < n. 
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(d) Prove that since 


q*-l 

q m ‘ — 1 


is an integer (namely, the index \D X 


: C D x (*,)| ) thenm,- 


divides /I (cf. Exercise 4 of Section 5). Conclude that <#>„ (x) divides (x n — \)/{x m ‘ — 1) 

and hence that the integer 4> n (q) divides (q n — 1 )/(q rr>i — 1) for i = 1,2 r. 

(e) Prove that (c) and (d) imply that 0 n (q) = n { primitive fa ~ O divides q - 1. Prove 
that \q — f | > q — 1 (complex absolute value) for any root of unity f ^ 1 [note that 
1 is the closest point on the unit circle in C to the point q on the real line). Conclude 


that 7i = 1, i.e., that D = Z is a field. 


The following exercises provide a proof that for any positive integer m there are infinitely man y 
primes p with p = 1 (mod m). This is a special case of Dirichlet’s Theorem on Primes in 
Arithmetic Progressions which states more generally that there are infinitely many primes p 
with p = a (mod m) for any a relatively prime to tm. 

14. Given any monic polynomial P(x) e %\x\ of degree at least one show that there are 
infinitely many distinct prime divisors of the integers 

P(l), P(2), P( 3) P(ti),.... 

(Suppose pi , p 2 , . . . , Pk are the only primes dividing the values P(n),n = 1,2 Let 

N be an integer with P(N) = a ± 0. Show that Q(x) = a~ l P(N+a pip 2 . . . Pk x) is an 

element of Z[x ) and that Q(n) = 1 (mod p\P 2 • ■ ■ Pk) for n = 1,2, Conclude that 

there is some integer Af such that Q(M) has a prime factor different from pi, P2. ■ ■ ■ , Pk 
and hence that P(N + npiP 2 - • ■ PkM) has a prime factor different from p\, P 2 Pk-] 

15. Let p be an odd prime not dividing tw and let <P m (x) be the 771 th cyclotomic polynomial. 
Suppose a ^TL satisfies 4> m (a) = 0 (mod p). Prove that a is relatively prime to p and 
that the order of a in (Z/pZ) x is precisely m. [Since 

X™ - 1 = ll&M = <PM ]“[ <p d (x) 

d\m d\rn 

don 

we see first that a m — 1=0 (mod p) i.e., a m = 1 (mod p). If the order of a mod p were 
less than 771 , then a d = 1 (mod p) for some d dividing m, so then 4> d (a) = 0 (mod p) for 
some d < 77i. But then x m — 1 would have a as a multiple root mod p, a contradiction.) 

16. Let a e Z. Show that if p is an odd prime dividing 4> m (a) then either p divides m or p = 1 
(mod tm). 

17. Prove there are infinitely many primes p with p = 1 (mod 771 ). 
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CHAPTER 14 


Galois Theory 


14.1 BASIC DEFINITIONS 

In the previous chapter we proved the existence of a finite extension of a field F which 
contains all the roots of a given polynomial / (x) whose coefficients are in F. The 
main idea of Galois Theory (named for Evariste Galois, 181 1-1832) is to consider the 
relation of the group of permutations of the roots of f(x) to the algebraic structure of 
its splitting field. The connection is given by the Fundamental Theorem of the next 
section. It can be viewed as another (extremely elegant) application of the important 
idea in mathematics that one (in our case algebraic) object acting on another provides 
structural information about both. 

In this section we introduce the terminology and basic properties of the objects of 
interest Let K be a field. 

Definition. 

(1) An isomorphism a of K with itself is called an automorphism of K. The 
collection of automorphisms of K is denoted Aut(A'). If a € A' we shall write 
a a for a {a). 

(2) An automorphism a € Aut(Af) is said to fix an element a e K if va = a. If 
F is a subset of K (for example, a subfield), then an automorphism a is said to 
fix F if it fixes all the elements of F , i.e., aa =a for all a € F. 

Note that any field has at least one automorphism, the identity map, denoted by 1 
and sometimes called the trivial automorphism. 

The prime field of A" is generated by 1 e K and since any automorphism a takes 
1 to 1 (and 0 to 0), i.e., cr( 1) = 1, it follows that a a = a for all a in the prime field. 
Hence any automorphism of a field K fixes its prime subfield. In particular we see that 
<Q> and ¥ p have only the trivial automorphism: Aut(Q) = (1) and Aut(¥ p ) = (1}. 

Definition. Let K/ F be an extension of fields. Let Aut(K/F) be the collection of 
automorphisms of K which fix F. 


Note that if F is the prime subfield of K then Aut(AT) = Aut(K/F) since every 
automorphism of K automatically fixes F. 
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If cr and r are automorphisms of K then the composite ox (and also the composite 
to, which may not be the same) is defined and is again an automorphism of K. 

Proposition 1. Aut(AT) is a group under composition and Aut (K/F) is a subgroup. 

Proof: It is clear that Aut(AT) is a group. If a and r are automorphisms of K which 
fix F then also or and o~ ] are the identity on F, which shows that Aut (K/F) is a 
subgroup. 

The following proposition is extremely useful for determining the automorphisms 
of algebraic extensions. 

Proposition 2. Let K/F be a field extension and let a e K be algebraic over F . 
Then for any a e Aut (K /F), a a is a root of the minimal polynomial for a over F i.e., 
Aut(.K / F) permutes the roots of irreducible polynomials. Equivalently, any polynomial 
with coefficients in F having a as a root also has era as a root 

Proof: Suppose a satisfies the equation 

a” + a„- 1 «" _1 H (- a\a + oq = 0 

where oq, ay, a n \ are elements of F. Applying the automorphism a we obtain 
(using the fact that a is an additive homomorphism) 

o(a n ) + a(a n -\a n ~ v ) h a(aya) +o(ao) = o( 0) = 0. 

Using the fact that o is also a multiplicative homomorphism this becomes 

(o(a)) n + o(a TI -i)(o(a)) n ~ i H h o(ay)(o(oe)) + o(ao) = 0. 

By assumption, o fixes all the elements of F, so<r(/3 f ) = Oi,i = 0, 1, . . . , n — 1. Hence 
(oa) n + a„-i (<Ta)" _1 -\ 1 -ay (era) + = 0. 

But this says precisely that era is a root of the same polynomial over F as a. This 
proves the proposition. 

Examples 

(1) Let K = Q(%/2). If r e Aut(Q(-v/2)) = Aut(Q(-v/2 )/Q), then z(y/2) = 
since these are the two roots of the minimal polynomial for -Jl. Since r fixes Q, this 
determines z completely: 

z(a + by/2) = a ± by/2. 

The map y/2 t-y y/2 is just the identity automorphism l of Q(\/2). The map 
a ; -v/2 i— > —y/2 is the isomorphism considered in Example 2 following Corollary 
13.7. Hence Aut(Q(\/2)) = Aut(Q(%/2)/Q) = {1, cr} is a cyclic group of order 2 
generated by a. 

(2) Let K = Q(\/2). As before, if r e Aut (AT /Q), then z is completely determined by 
its action on y/2 since 

z(a + bl/2 + c(V2) 2 ) = a + bz\/2 + c{z y/2) 1 . 

Since z X/2 must be a root of jc 3 — 2 and the other two roots of this equation are not 
elements of K (recall the splitting field of this polynomial is degree 6 over (Q), the 
only possibility is z y/2 = 1/2 i.e., r = 1. Hence Aut((Q>( X/2 ) /Q) = 1 is the trivial 
group. 
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In general, if A" is generated over F by some collection of elements, then any auto- 
morphism a € Aut(K/F) is completely determined by what it does to the generators. 
If K /F is finite then K is finitely generated over F by algebraic elements so by the 
proposition the number of automorphisms of K fixing F is finite, i.e., Aut (K/F) is a 
finite group. In particular, the automorphisms of a finite extension can be considered 
as permutations of the roots of a finite number of equations (not every permutation 
gives rise to an automorphism, however, as Example 2 above illustrates). It was the 
investigation of permutations of the roots of equations that led Galois to the theory we 
are describing. 

We have associated to each field extension K/F (equivalently, with a subfield F of 
K) a group , Aut(A /F), the group of automorphisms of K which fix F. One can also 
reverse this process and associate to each group of automorphisms a field extension. 

Proposition 3. Let H < Aut (A - ) be a subgroup of the group of automorphisms of K. 
Then the collection F of elements of K fixed by all the elements of H is a subfield of K. 

Proof: Let h e H and let a, b € F. Then by definition h(a) = a , h(b) = b so that 
h(a±b ) = h(a)i:h(b) — a±b , h(ab) = h(a)h(b) — ab md h(a~ l ) — h(a)~ l = a -1 , 
so that F is closed, hence a subfield of K. 

Note that it is nqt important in this proposition that H actually be a subgroup of 
Aut (A') — the collection of elements of K fixed by all the elements of a subset of 
Aut(Af) is also a subfield of K. 

Definition. If H is a subgroup of the group of automorphisms of K y the subfield of 
K fixed by all the elements of H is called the fixed field of H. 

Proposition 4. The association of groups to fields and fields to groups defined above 
is inclusion reversing, namely 

(1) if F\ C F 2 ^ K ^ two subfields of K then Au^A /Ff) < AxA(K/F\), and 

(2) if Fly < H 2 < Aut(A") are two subgroups of automorphisms with associated 
fixed fields F y and F^, respectively, then F% C F\ . 

Proof: Any automorphism of K that fixes F 2 also fixes its subfield Fi, which gives 
(1). The second assertion is proved similarly. 

Examples 

(1) Suppose K = <Q>(V2) as in Example 1 above. Then the fixed field of Aut(Q(V2 )) = 
Aut(Q(V2)/<Q)) = {1, < 7 } will be the set of elements of Q(V2) with 

o(a + b\J 2) = a + by/2 

since everything is fixed by the identity automorphism. This is the equation 

a — by/l = a + by/ 2 . 

which is equivalent to b = 0, so the fixed field of AuttUXv^VQ) is just 0. 

(2) Suppose now that K = <Q>(\/2) as in Example 2 above. In this case Aut(A) = 1, so 
that every element of A" is fixed, i.e., the fixed field of Aut(<Q>( XFi ) /Q) is <Q>(^/2). 
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Given a subfield F of K, the associated group is the collection of automorphisms 
of K which fix F. Given a group of automorphisms of K, the associated extension is 
defined by talcing F to be the fixed field of the automorphisms. In the first example 
above, starting with the subfield (Q) of <Q)(\/2 ) one obtains the group {1, a} and starting 
with the group { 1 , a) one obtains the subfield <Q>, so there is a “duality” between the two. 
In the second example, however, starting with the subfield <Q> of <Q>( 1/2 ) one ob tains only 
the trivial group and starting with the trivial group one obtains the full field Q(V2). 

An examination of the two examples suggests that for the second example there 
are “not enough” automorphisms to force the fixed field to be Q rather than the full 
<Q>( \fl ). This in turn seems to be due to the fact that the other roots of x 3 — 2, which are 
the only possible images of 1/2 under an automorphism, are not elements of Q ( \FL ). 
(Although even if they were we would need to check that the additional maps we could 
define were automorphisms.) We now make precise the notion of fields with “enough” 
automorphisms (leading to the definition of a Galois extension). As one might suspect 
even from these two examples (and we prove in the next section) these are related to 
splitting fields. 

We first investigate the size of the automorphism group in the case of splitting fields. 

Let F be a field and let £ be the splitting field over F of / (x) e F[x], The main 

tool is Theorem 13.27 on the existence of extensions of isomorphisms, which states 

_ _ 

that any isomorphism <p : F — ► F' of F with F' can be extended to an isomorphism 
a : E -> E' between E and tLe splitting field E' for f'(x) = <p(/ (x)) e F'[x]. 

We now show by induction on [F : F] that the number of such extensions is at 
most [£ : F], with equality if / (x) is separable over F. If [E : F] = 1 then E = F, 
E' — F' , a = <p and the number of extensions is 1. If [E : F] > 1 then / (x) has at 
least one irreducible factor p(x) of degree > 1 with corresponding irreducible factor 
p'(x) of /'(x). Let a be a fixed root of p(x). If a is any extension of <p to E, then a 
restricted to the subfield F(a) of E is an isomorphism i of F(a) with some subfield of 
E'. The isomorphism r is completely determined by its action on a , i.e., by ra, since 
a generates F (a) over F. Just as in Proposition 2, we see that xa must be some root 
P of p'(x). Then we have a diagram 


a : 

E — > 

i 

E' 

l 

t : 

1 

F<«) ^ 

1 

1 

F'(P) 

1 

<P : 

1 

F 

i 

F' 


Conversely, for any a root of p'(x) there are extensions r and a giving such a diagram 
(this is Theorem 13.8 and Theorem 13.27). Hence to count the number of extensions 
a we need only count the possible number of these diagrams. 

The number of extensions of <p to an isomorphism r is equal to the number of distinct 
roots fi of p'(x). Since the degree of p(x) and p’(x) are both equal to [F(a) : F], we 
see that the number of extensions of <p to a r is at most [F(a) : F], with equality if the 
roots of p(x) are distinct. 

Since E is also the splitting field of / (x) over F (a) y E' is the splitting field of fix) 
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over F'(fi), and [E : F(a)] < [F : F], we may apply our induction hypothesis to these 
field extensions. By induction, the number of extensions of x to a is < \E : F(a)), 
with equality if / ( x ) has distinct roots. 

From [E : F] = [E : F(a)][F(a) : F] it follows that the number of extensions of 
<p to a is < [E : F], We have equality if p(x) and f(x) have distinct roots, which is 
equivalent to / (x) having distinct roots since p(x) is a factor of / (x), completing the 
proof by induction. 

In the particular case when F — F' and <pisthe identity map we have f(x) = f'(x) 
and E = E' so the isomorphisms of E to E' restricting to (p on F are the automorphisms 
of E fixing F. We state this as follows: 

f 

Proposition 5. Let E be the splitting field over F of the polynomial / (x) € F[x\. 
Then 

[Aut(F/F)| < [E : F] 
with equality if / (x) is separable over F. 

Remark: While we were primarily interested in counting the automorphisms of E 
which fix F (which is the situation of F = F', (p = 1 above), it would still have been 
necessary to consider the situation of more general <p (and different fields F') because 
of the induction step in the proof (which involves the fields F(a) and F(fi) for two 
roots of the same polynomial p(x)). 

One can modify the proof above to show more generally that |Aut(F/F) | <[K\F] 
for any finite extension K/F (we shall prove this in the next section from a slightly 
different point of view). This gives us a notion of field extensions with “enough” 
automorphisms. 

Definition. Let K/F be a finite extension. Then K is said to be Galois over F and 
K/F is a Galois extension if |Aut(F/F)| = [K : F). If K/F is Galois the group of 
automorphisms Aut(F/F) is called the Galois group of K/F, denoted Ga \(K/F). 

Remark: The Galois group of an extension K/F is sometimes defined to be the group 
of automorphisms Aut(F/F) for all K/F. We have chosen the definition above so 
that the notation Gal(F/F) will emphasize that the extension K/F has the maximal 
number of automorphisms. 

Corollary 6. If K is the splitting field over F of a separable polynomial / (x) then 
K/F is Galois. 

We shall see in the next section that the converse is also true, which will completely 
characterize Galois extensions. 

Note also that Corollary 6 implies that the splitting field of any polynomial over Q 
is Galois, since the splitting field of f(x) is clearly the same as the splitting field of the 
product of the irreducible factors of f(x) (i.e., the polynomial obtained by removing 
multiple factors), which is separable (Corollary 13.34). 
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Definition. If f(x ) is a separable polynomial over F, then the Galois group of f (x) 
over F is the Galois group of the splitting field of f(x) over F. 

Examples 

(1) The extension Q(72)/Q is Galois with Galois group Gal(Q(72)/Q) = {1, <r} = 
Z/2Z where a is the automorphism 

: Q(72) Q(75) 

a + bVl i — > a — £>72. 

( 2 ) More generally, any quadratic extension K of any field F of characteristic different 
from 2 is Galois. This follows from the discussion of quadratic extensions following 
Corollary 13.13, which shows that any extension K of degree 2 of F (where the 
characteristic of F is not 2) is of the form F(\fD ) for some D hence is the splitting 
field of x 2 — D (since if 7 D € K then also — <J~D e K). 

(3) The extension Q(72)/Q is not Galois since its group of automorphisms is only of 
order 1. 

(4) The extension Q(\/2 , 73 ) is Galois over Q since it is the splitting field of the poly- 
nomial (x 2 — 2 )(jc 2 — 3). Any automorphism a is completely determined by its action 
on the generators 72 and 73, which must be mapped to ±72 and ±73, respectively. 
Hence the only possibilities for automorphisms are the maps 

72 >->72 (72i->-72 (72i->72 [72h»-72 

73 73 { 73 m- 73 { 73 (-> -73 73 (-> -73 ‘ 

Since the Galois group is of order 4, all these elements are in fact automorphisms of 
Q(72 , 73 ) overQ. 

Define the automorphisms a and z by 

|72^ -72 72 » 72 

° [ 73 V3 r ‘l73i->-73 

or, more explicitly, by 

a : a + £>72 + c73 + d76 a — £>72 -I- c73 — d76 
r : a + £>72 + c73 + dV6 (-> a + £>72 — c73 — d76 
(since, for example, 

<r(76) = er(7273) = <7(72 )<t(73) = (-72)(73) = -76 ). 

Then er 2 (72) = <t(<t72) = o(— 72) = 72 and clearly <r 2 (73) = 73. Hence 
<7 2 = 1 is the identity automorphism. Similarly, r 2 = 1. The automorphism <rr can 
be easily computed: 

ot(72) ~ <t(t( 72)) = er(72) = —72 

and 

err (73) = <t(t( 73 )) = <r(— 73) = —73 

so that err is the remaining nontrivial automorphism in the Galois group. Since this 
automorphism also evidently has order 2 in the Galois group, we have 

Gal(Q(72 , 73 )/Q) = {1, <t, r, ox) 
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i.e., the Galois group is isomorphic to the Klein 4-group. 

Associated to each subgroup of GalfQKv^, -\/3)/Q) is the corresponding fixed 
subfield of Q(-\/2 , \/3 ). For example, the subfield corresponding to {1, err} is the set 
of elements fixed by the map 

ax : a + by/2 + c\J 3 + dy/Z a — by / 2 — c\J 3 + dy/Z 


which is the set of elements a-tdy/6, i.e., the field Q(y/Z). One can similarly determine 
the fixed fields for the other subgroups of the Galois group: 


subgroup 

{1} 

(1, ax) 
lit) 

{l,er, z,az) 


fixed field 

Q(y/2,y/l) 

Q(V3) 

Q<\/6) 

Q(V2) 


(5) The splitting field of x 3 — 2 over Q is Galois of degree 6. The roots of this equation 

_1 + 

are 1 /2, p 1/2, p 2 1/2 where p = f 3 = is a primitive cube root of unity. 

Hence the splitting field can be written Q( i/2 , p i/2 ). Any automorphism maps each 
of these two elements to one of the roots of x 3 — 2, giving 9 possibilities, but since 
the Galois group has order 6 not every such map is an automorphism of the field. 

To determine the Galois group we use a more convenient set of generators, namely 
i/2 and p. Then any automorphism a maps i/2 to one of i/2, p i/2, p 2 i/2 and maps 


n — L — V — J 

p to p or p z = since these are the roots of the cyclotomic polynomial 

tf>3(x) = x 2 + x + 1. Since a is completely determined by its action on these two 
elements this gives only 6 possibilities and so each of these possibilities is actually an 
automorphism, lb give these automorphisms explicitly, let a and r be the automor- 
phisms defined by 


a 


i/2^pi/2 \i/2»i/2 

t : 

P^P p !->■ p = — 1 — p. 


As before, these can be given explicitly on the dements of Q( i/2 , p), which are linear 
combinations of the basis { 1 , i/2, {i/2) 2 , p, pi/2, p(i/2) 2 }. For example 

aifii/2) = (p)(pi/l) = p 2 i/l = (-1 - p)i/2 

= -i/2- pi/2 


and we may similarly determine the action of a on the other basis elements. This 
gives 

a : a +bi/2 + ci/4 + dp +epi/2+ fpi /4 1 — > 


a — ei/2 + (f — c) i/4 + dp + (b — e)p i/2 — cp i/4. 


The other elements of the Galois group are 

\i/2»i/2 

I P 


i/2 ^p 2 i/2 
P i-» P 


(14.1) 
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x/2 px/l 

PH p 2 


so that ax = rcr 2 . S imilar ly one computes that cr 3 = r 2 = 1. Hence 

Gal(Q(4/2, ft)/Q) = <ff, r ) ^ S 3 

n 

is the symmetric group on 3 letters. Alternatively (and less computationally), since 
G = Gal(Q( in , £ 3 ) /Q) acts as permutations of the 3 roots of x 3 — 2, G is a subgroup 
of S 3 , hence must be -S 3 since it is of order 6 . The computations above explicitly 
identify the automorphisms in G and give an explicit isomorphism of G with S 3 . 

As in the previous example we can determine the fixed fields for any of the 
subgroups of the Galois group. For example, consider the fixed field of the subgroup 
{l,o, a 2 ) generated by a. These are just the elements fixed by a (given explicitly in 
equation (1)) since if an element is fixed by a then it is also fixed by o 2 . (In general, the 
fixed field of some subgroup is the field fixed by a set of generators for the subgroup.) 
The elements fixed by a are those with 

a = a b~-e c = f -c d=d e — b-e f — -c 

which is equivalent to b = c = f — e = 0. Hence the fixed field of {1, a, a 2 ] is the 
field <Q(/>). 


Remark: This example shows that some care must be exercised in determining Galois 
groups from the actions on generators. As mentioned, not every map taking 1/2 and p K/2 
to roots of x 3 — 2 gives rise to an automorphism of the field (for example, the map 

l/l^pl/2 

pUl p\! 2 

clearly cannot be an automorphism since it is evidently not an injection). The point is 
chat there may be (sometimes very subtle) algebraic relations among the generators and 
these relations must be respected by an automorphism. For example, the quotient of the 
generators here is p, which is mapped to 1 and not to a root of the minimal polynomial for 
p. Put another way, the quotient of these generators satisfies a quadratic equation and this 
map does not respect that property. 

For another (less trivial) example, compare with the discussion of the splitting field of 
x 8 — 2 in Section 2. 
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(6) As in Example 3, the field <0(^2) is not Galois over Q since any automorphism is 
determined by where it sends y/l and of file four possibilities {±^/2, ±i Vl), only 
two are elements of the field (the two real roots). 

Note that we have 

4 


Q c Q(V2) c ? Q(V2) 


2 2 

where <Q(V2 ) /Q and Q( Xpl ) /Q( ) are both Galois extensions by Example 2 since 
both are quadratic extensions. This shows that a Galois extension of aGalois extension 
is not necessarily Galois. 

(7) The extension of finite fields Iy> f¥ p constructed after Proposition 13.37 is Galois 
by Corollary 6 since is the splitting field over W p of the separable polynomial 

— x. It follows that the group of automorphisms for this extension is of order n. 
The injective homomorphism 

O Fp" — ► Fpn 

a f — > a p 

of Proposition 13.35 is surjective in this case since F y is finite, hence is an isomor- 
phism. This gives an automorphism of Fpn , called the Frobenius automorphism, which 

we shall denote by a p . Iterating a p we have o 2 (a) = o p (o p (a)) = (a p ) p = a p . 
Similarly we have 

0p(fit) = i =0,1,2,... 

Since a pr — a, we see that af = 1 is the identity automorphism. No lower power of 
o p can be the identity, since this would imply = a for all a e Wp* for some i < n, 
which is impossible since there are only p 1 roots of this equation. It follows that o p 
is of order n in the Galois group, which means that GalflFp" /F p ) is cyclic of order n, 
with the Frobenius automorphism o p as generator. 

(8) The inseparable extension F 2 (x) over F 2 </) where x 1 — t = 0 considered in Section 
13.5 is not Galois. Any automorphism of this degree 2 extension is determined by its 
action on x, which must be sent to a root of the equation x 2 — t . We have already seen 
that there is only one root of this equation (with multiplicity 2) since we are in a field 
of characteristic 2. Hence the extension has only the trivial automorphism. Note that 
F 2 (x) is the splitting field for x 2 — t over F 2 (/), so this example shows the separability 
condition in Corollary 6 is necessary. 


EXERCISES 

(a) Show that if the field K is generated over F by the elements aj, . . . , a n then an 
automorphism a of K fixing F is uniquely deter min ed by cr(ai), . . . , cr(a„). In 
particular show that an automorphism fixes K if and only if it fixes a set of generators 
for K. 

(b) Let G < Gal(AT/F) be a subgroup of the Galois group of the extension K/F and 

suppose <ti , . . . , ajt are generators for G. Show that the subfield E /F is fixed by G if 
and only if it is fixed by the generators ay oy. 
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2. Let r be the map z : C -*• C defined by r(a 4- bi) =a—bi ( complex conjugation). Prove 
that r is an automorphism of C. 

3. Determine the fixed field of complex conjugation on C. 

4. Prove that Q(>/2) and Q(V3 ) are not isomorphic. 

5. Determine the automorphisms of the extension <Q>( y/2 ) /Q(V2 ) explicitly. 

6. Let k be a field 

(a) Show that the mapping <p : k\t] — ► k\t] defined by <p(f (/)) = f(at + b) for fixed 

c.fcGifc, a^Oisan automorphism of k\t] which is the identity on it. *> 

(b) Conversely, let (p be an automorphism of kit) which is the identity on k. Prove that 
there exist a,b e k with a^O such that <p(f(t)) = f(at + b) as in (a). 

7. This exercise determines Aut(R/Q). 

(a) Prove that any o € Aut(R/Q) takes squares to squares and takes positive reals to 
positive reals. Conclude that a < b implies aa < ab for every a, b € R. 

(b) Prove that < a— b < — implies < oa — ob < — for every positive integer 

m mm m 

m. Conclude that <r is a continuous map on R. 

(c) Prove that any continuous map on IR which is the identity on Q is the identity map, 
hence Aut(R/<Q>) = 1. 

8- Prove that the automorphisms of the rational function field k(t) which fix it are precisely the 

at + b 

fractional linear transformations determined by t t-* fora, b. c % d e k,ad—bc / 0 

ct +d 

at + b 

(so / (0 e k(t) maps to /( -) ) (cf. Exercise 18 of Section 13.2). 

ct + d 

9. Determine the fixed field of the automorphism 1 1 -* t + 1 of k(t ). 

10. Let K be an extension of the field F. Let <p : K —*■ K' be an isomorphism of K with a 
field K' which maps F to the subfield F' of K'. Prove that the map <r !-► <po(p~ 1 defines 
a group isomorphism AutCK JF) -y AucC K'/F'). 


14.2 THE FUNDAMENTAL THEOREM OF GALOIS THEORY 

[n the Galois extension Gal(Q(>/2, >/3)/Q) considered in the previous section, there 
was a strong similarity between the diagram of subgroups of the Galois group: 


{ 1 } 

\ 

2 

Rt) { 1. <TT } { 1 > C } 



{1, o', T, a x} 


and the diagram of corresponding fixed fields 
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Q(n/2,n/3) 



Q(V2) Q(V6) <Q>(\/3) 



(we have inverted the lattice of subgroups because of the inclusion-reversing nature of 
the correspondence). 

Note that this is also the diagram of all known subfields of the extension and that 
in this case each of the subfields is also a Galois extension of Q. 

In a similar way there is a strong similarity between the diagram 



of subgroups of the Galois group and the diagram of known subfields for the splitting 
field of x 3 — 2: 



where the subfields in the second diagram are precisely the fixed fields of the subgroups 
in the first diagram. 

Note in this pair of diagrams only the subgroup ( a ) generated by a is normal in 
£3 and that the subfield Q(p) is the only subfield Galois over Q. 

The Fundamental Theorem of Galois Theory states that the relations observed in 
the two examples above are not coincidental and hold for any Galois extension. Before 
proving this we first develop some preliminary results on group characters , of which 
field automorphisms give particular examples. 
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De fini tion. A character 1 x of a group G with values in a field L is a homomorphism 
from G to the multiplicative group of L: 

X : G L x 

i.e., xigigi) = X(gi)x(gi) for all g u g 2 e G and x(g) is a nonzero element of L for 
allg € G. 

Definition. The characters Xii X 21 ■ . ■ , Xn of G are said to be linearly independent 
over L if they are linearly independent as functions on G, i.e., if there is no nontrivial 
relation 

aiXi+a 2 X 2 -\ (ai, . . . , a„ e L notall 0) (14.2) 

as a function on G (that is, a\ xi (g) + aiX 2 (g) H h a*Xn(g) = 0 for all g € G). 

1 

Theorem 7. (Linear Independence of Characters) If Xii X2i • ■ - « Xn arc distinct char- 
acters of G with values in L then they are linearly independent over L. 

Proof: Suppose the characters were linearly dependent Among all the linear 
dependence relations (2) above, choose one with the minimal number m of nonzero 
coefficients a-, . We may suppose (by renumbering, if necessary) that the m nonzero 
coefficients are a \ , a 2 , . . . , a m : 

«lXl +^2X2 H +OmXm = 0. 

Then for any g e G we have 

*iXi(g) + ^ 2 X 2 ( 8 ) + MmXm(s) = 0. (14.3) 

Let go he an element with Xi(go) ^ Xmigo) (which exists, since xj ^ Xm)- Since (3) 
holds for every element of G, in particular we have 

aiXiigog) + a 2 X 2 (go 8 ) + ■ ■ ■ + OmXmigOg) = 0 
Le., 

«iXi(So)Xi(g) + o. 2 X 2 (go)X 2 (g) + • ‘ • + a m Xmigo)Xmig) = 0. (14.4) 

Multiplying equation (3) by Xmigo) and subtracting from equation (4) we obtain 

[Xm(go) - Xi(So)khXi(s) + tX^Cgo) “ X 2 (go)]a 2 X 2 (g) H 

+ [Xjn(go) - Xm-lteo )}Om-lXm-l(g) = 0, 

which holds for all g e G . But the first coefficient is nonzero and this is a relation with 
fewer nonzero coefficients, a contradiction. 

Consider now an injective homomorphism a of a field K into a field L, called an 
embedding of K into L. Then in particular a is a homomorphism of the multiplicative 
group G = K* into the multiplicative group L x , so a may be viewed as a character of 
K x with values in L. Note also that this character contains all of the useful information 
about the values of o viewed simply as a function on K , since the only point of K not 
considered in K x is 0, and we know o maps 0 to 0. 

1 This is the definition of a linear character. More general char acters will be studied in Chapter 1 8. 
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Corollary 8. If a\ , cr 2 , . . . , cr„ are distinct embeddings of a field K into a field L % then 
they are linearly independent as functions on K. In particular distinct automorphisms 
of a field K are linearly independent as functions on K . 

We now use Corollary 8 to prove the fundamental relation between the orders of 
subgroups of the automorphism group of a field K and the degrees of the extensions 
over their fixed fields. 

Theorem 9. Let G — [cr\ = 1 , o 2i . . . , a„] be a subgroup of automorphisms of a field 
K and let F be the fixed field. Then 

[K:F] = n = \G\. 


Proof: Suppose first that n > [K : F] and let a>\ , o> 2 , . . . , a> m be a basis for K over 
F (m = [K \ T 7 ]). Then the system 

G\ )-*•! + <? 2 (a)l)X 2 H 1- o n {a)\)x n = 0 


Ol(tO m )Xi + 02(to m )x 2 H 1- Onica^Xn = 0 

of m equations in « unknowns x \ , x 2 , . . . , x n has a nontrivial solution , /J 2 , • ■ • . fin in 
K since by assumption there are more unknowns than equations. 

Let be w arbitrary elements of F. The field F is by definition 

fixed by o \ , . . . , o„ so each of these elements is fixed by every a,, i.e., crfaj) = aj, 
i = 1,2 1 ... ,n,j = 1 , 2, .... m. Multiplying the first equation above by ai , the second 
by o 2 , ■■ ■ , the last by a m then gives the system of equations 

+ 02 (a\( 0 \)fh. + ■ ■ ■ + cr n (a\a>\)fi„ = 0 


o\i.a m cOm)f\ + tx 2 (a m o) m )fi2 H 1 - or n (a m co m )fi„ = 0. 

Adding these equations we see that there are elements fi\ fin in K, not all 0, 

satisfying 


o \ («icui + a 2 w 2 H 1- u m co m )P\ H (- o n (a\u>\ + H h a m co m )f n = 0 

for all choices of a\, . . . , in F. Since . . . , co m is an T 7 -basis for K> every a e K 
is of the form a\oi\ + a 2 o >2 + • ■ ■ + so the previous equation means 

cri(ot)0i -\ + a„(a)fi n = 0 


for all or e K. But this means the distinct automorphisms o\ , . . . , cr„ are linearly 
dependent over K, contradicting Corollary 8. 

We have proved n < [K : Z 7 ]. Note that we have so far not used the fact that 
cri , cr 2 , . . . , cr„ are the elements of a group. 

Suppose now that n < [K : F]. Then there are more than n T-linearly independent 
elements of K, say a,, . . . , a n+1 . The system 

C\ (orO*! -f or, (a 2 )x 2 H \-a y (<*„+! )*n+i = 0 


cr n (ai)x 1 -I- <J n (a 2 )x 2 -\ 1- cr n (or„ + i)x„ +1 = 0 


(14.5) 
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of n equations in n 4- 1 unknowns xi , . . . , x„ + i has a solution Pi , . . . , #,+t in K where 

not all the i = 1,2 n + 1 are 0. If all the elements of the solution Pi , . . . , #,+i 

were elements of F then the first equation (recall <jj = 1 is the identity automorphism) 
would contradict the linear independence over F of ct \ , 0 L 2 , . . . , a n+ j. Hence at least 
one fy, i = 1, 2, . . . , n + 1, is not an element of F. 

Among all the nontrivial solutions , . . . , p n +\) of the system (5) choose one with 
the minimal number r of nonzero fa. By renumbering if necessary we may assume 
f }\ , . . . , p r are nonzero. Dividing the equations by fi r we may also assume p r = {. We 
have already seen that at least one of P\, , p r -\ , 1 is not an element of F (which 

shows in particular that r > 1), say P\ £ F. Then our system of equations reads 


0i(“i)A H 1- ^(Or-il/lr-i + aj(a r ) = 0 


oii («t) A + H o n {a r ~\)P r - i + Oii(Ofr) = 0 


(14.6) 


or more briefly 


0i(ai)0i H + cfj(ar-i)/3 r _i + o,(a r ) =0 i= 1, 2, .... n. (14.7) 


Since /Ji £ F, there is an automorphism (ko e {l, 2, . . . , n}) with o ka P\ ± Pi- 
tt we apply the automorphism o* 0 to the equations in (6), we obtain the system of 
equations 

crk a <yj(ca)cr k(> (Pi) + ■•■ + c ko Oj(ct T ^)<y ka {p r _ x ) + o ko Oj(ci r ) = 0 (14.8) 

for j = 1 , 2, . . . , n. But the elements 


O ko CT i , O> 0 0"2 i > * ■ , 

are the same as the elements 

0~l > 0"2, ■ . - , O n 

in some order since these elements form a group. In other words, if we define the index 
i by o ka Oj — Oi then i and j both run over the set { 1 , 2, . . . , n). Hence the equations in 
(8) can be written 

Vi(ui)ok 0 <J3i) + ■ - • + Oi(a r ^i)o ko ifi r -i) + Oi(a r ) = 0. (14.80 

If we now subtract the equations in (80 from those in (7) we obtain the system 

Oi(a\)[P\ - Okv 03,)] H 1- G\ (Ct r — i ) \P r — 1 - O kQ (Pr-l)] = 0 

for i = 1,2 ,...,n. But this is a solution to the system of equations (5) with 

Xi = fit -fThifil) ^ 0 

(by the choice of ko), hence is nontrivial and has fewer than r nonzero x, . This is a 
contradiction and completes the proof. 

Our first use of this result is to prove that the inequality of Proposition 5 holds for 
any finite extension K/F. 
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Corollary 10. Let K/F be any finite extensioa Then 

lAut(A'/F)| < {K : F] 

With equality if and only if F is the fixed field of Aut(A" /F). Put another way, K /F is 
Galois if and only if F is the fixed field of Aut(A" /F). 

Proof: Let F\ be the fixed field of Aut(F/F), so that 

F c F] c K. 

By Theorem 9, [K : F t ] = |Aut(iC/F)|. Hence [K : F] = lAut(A7F)|[Fj : F], 
which proves the corollary. 

Corollary 11. Let G be a finite subgroup of automorphisms of a field K and let F 
be the fixed field. Then every automorphism of K fixing F is contained in G , i.e., 
Aut(Af/F) = G, so that K /F is Galois, with Galois group G. 

Proof: By definition F is fixed by all the elements of G so we have G < Aut(AT /F) 
(and the question is whether there are any automorphisms of AT fixing F not in G i.e., 
whether this containment is proper). Hence \G\ < |Aut(F/F)j. By the theorem we 
have |G| = [AT : F] and by the previous corollary [Aut(A7F)| < [ K : F], This gives 

[AT : F] = |G| < |Aut(AT/F)| < [K : F] 

and it follows that we must have equalities throughout, proving the corollary. 

Corollary 12. If G\ ^ G 2 are distinct finite subgroups of automorphisms of a field K 
then their fixed fields are also distinct. 

Proof: Suppose Fi is the fixed field of Gi and F 2 is the fixed field of G 2 . If Fj = F 2 
then by definition F] is fixed by G^- By the previous corollary any automorphism fixing 
F t is contained in Gi, hence G 2 < G\. Similarly G] < G 2 and so Gy — G 2 . 

By the corollaries above we see that taking the fixed fields for distinct finite sub- 
groups of Aut(F) gives distinct subfields of K over which K is Galois. Further, the 
degrees of the extensions are given by the orders of the subgroups. We saw this ex- 
plicitly for the fields K = Q(\/2, V3) and K — Q(-v/2, p) above. A portion of the 
Fundamental Theorem states that these are all the subfields of K. 

The next result provides the converse of Proposition 5 and characterizes Galois 
extensions. 

Theorem 13. The extension K/F is Galois if and only if AT is the splitting field of some 
separable polynomial over F. Furthermore, if this is the case then every irreducible 
polynomial with coefficients in F which has a root in AT is separable and has all its roots 
in A" (so in particular K/F is a separable extension). 

Proof: Proposition 5 proves that the splitting field of a separable polynomial is 
Galois. 
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We now show that if K/F is Galois then every irreducible polynomial p(x) in F[x] 
having a root in K splits completely in K. Set G = Gal(A7F). Let a e K be a root 
of p(x) and consider the elements 

a, 02 (a), ...,a n (a) G AT (14.9) 

where {1, a 2 , . . . , o n ) are the elements of Gal(Af /F). Let 

a, a 2 ,a 3 ,...,a r 

denote the distinct elements in (9). If r e G then since G is a group the elements 

{r, zo 2 , . . . , rcr n } are the same as the elements (1 , <72 a n } in some order. It follows 

that applying r e G to the elements in (9) simply permutes them, so in particular 
applying r to a. a 2 , . . . , a r also permutes these elements. The polynomial 

fix) = (x- a)(x - a 2 ) ‘ - <*r) 

therefore has coefficients which are fixed by all the elements of G since the elements of 
G simply permute the factors. Hence the coefficients lie in the fixed field of G, which 
by Corollary 10 is the field F. Hence f(x)e F[x], 

Since p(x) is irreducible and has a as a root, p(x) is the minimal polynomial for a 
over F, hence divides any polynomial with coefficients in F having a as a root (this is 
Proposition 13.9). It follows that p(x) divides fix) in F[x] and since fix) obviously 
divides p(x) in K[x] by Proposition 2, we have 

P(x) = fix). 

In particular, this shows that p(x) is separable and that all its roots lie in A! (in fact they 
are among the elements a , c 2 a o n a ), proving the last statement of the theorem. 

To complete the proof, suppose K/F is Galois and let , a> 2 , . . . , to n be a basis for 
K/F. Let pi OO be the minim al polynomial for u>t over F, i = 1, 2, . . . , n. Then by 
what we have just proved, p, (x) is separable and has all its roots in K. Let g(x) be the 
polynomial obtained by removing any multiple factors in the product p\ (x) ■ ■ • p„(x) 
(the “squarefree part”). Then the splitting field of the two polynomials is the same and 
this field is K (all the roots lie in K, so K contains the splitting field, but 0 ) 1 , co 2s . . . , 0 ^ 
are among the roots, so the splitting field contains K\ Hence K is the splitting field of 
the separable polynomial g(x). 

Definition. Let K/F be a Galois extension. If or € AT the elements a a for cr in 
Gal(F /F) are called the conjugates (or Galois conjugates) of a over F. If £ is a 
subfield of K containing F, the field o (F) is called the conjugate field of E over F. 

The proof of the theorem shows that in a Galois extension K /F the other roots 
of the minimal polynomial over F of any element a e K are precisely the distinct 
conjugates of a under the Galois group of K /F. 

The second statement in this theorem also shows that K is not Galois over F if we 
can find even one irreducible polynomial over F having a root in K but not having all its 
roots in K. This justifies in a very strong sense the intuition from earlier examples that 
Galois extensions are extensions with “enough” distinct roots of irreducible polynomials 
(namely, if it contains one root then it contains all the roots). 
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Finally, notice that we now have 4 characterizations of Galois extensions K/F: 

(1) splitting fields of separable polynomials over F 

(2) fields where F is precisely the set of elements fixed by Aut(K /F) (in general, the 
fixed field may be larger than F) 

(3) fields with [K : F] = |Aut(F/F)| (the original definition) 

(4) finite, normal and separable extensions. 

Theorem 14. (Fundamental Theorem of Galois Theory ) Let K/F be a Galois extension 
and set G — Gal (K/F). Then there is a bisection 


subfields E 

K 

1 


subgroups H 

1 

1 

of K 

E 

< — > < 

of G 

H 

containing F 

1 

F 



1 

G 


given by the correspondences 

f the elements of G 

fixing E 

f the fixed field 1 

t olH \ H 

which are inverse to each other. Under this correspondence, 

(1) (inclusion reversing) If E \ , E 2 correspond to H\,H 2 , respectively, then E\ C E 2 
if and only if H 2 < H\ 

(2) (K : E] = \H\ and [E : F] = \G : H\, the index of H in G: 

K 

I } \H\ 

E 

I } \C:H\ 

F 

(3) K/E is always Galois, with Galois group Ga\(K/ E) = H: 

K 

I H 
E 

(4) E is Galois over F if and only if FI is a normal subgroup in G. If this is the 
case, then the Galois group is isomorphic to the quotient group 

Gal(£/F) S G/H. 

More generally, even if H is not necessarily normal in G, the isomorphisms of 
E (into a fixed algebraic closure of F containing K) which fix F are in one to 
one correspondence with the cosets [oH} of H in G. 

(5) If Ey, E 2 correspond to Hi, H 2 , respectively, then the intersection E x fl E 2 
corresponds to the group (Hi, H 2 ) generated by H\ and H 2 and the composite 
field E 1 E 2 corresponds to the intersection H\C\H 2 . Hence file lattice of subfields 
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of K containing E and the lattice of subgroups of G are “dual” (the lattice 
diagram for one is the lattice diagram for the other turned upside down). 

Proof: Given any subgroup H of G we obtain a unique fixed field E = K H by 
Corollary 12. This shows that the correspondence above is injective from right to left. 

If K is the splitting field of the separable polynomial /(x) e E[x] then we may 
also view /(x) as an element of £[x] for any subfield E of K containing F . Then K 
is also the splitting field of /(x) over £, so the extension K/Eis Galois. By Corollary 
10, E is the fixed field of Aut (K / E ) < G, showing that every subfield of K containing 
F arises as the fixed field for some subgroup of G. Hence the correspondence above is 
surjective from right to left, hence a bijection. The correspondences are inverse to each 
other since the automorphisms fixing E are precisely Aut (E/E) by Corollary 10- 

We have already seen that the Galois correspondence is inclusion reversing in 
Proposition 4, which gives (1). 

If E = Kh is the fixed field of H, then Theorem 9 gives [K : £] = \H\ and 
[E : E] = |G|. Taking the quotient gives [E : E] = |G : H\, which proves (2). 

Corollary 1 1 gives (3) immediately. 

Suppose £ = Kh is the fixed field of the subgroup H. Every a e G = Gal(E/E) 
when restricted to E is an embedding a \ e of E with the subfield a (E) of K . Conversely, 
let t : E — > r (£) c E be any embedding of E (into a fixed algebraic closure E of 
E containing K ) which fixes E. Then r(E) is in fact contained in K: if a e E has 
minimal polynomial m a (x) over E then r(a) is another root of m a (x) and K contains 
all these roots by Theorem 13. As above K is the splitting field of / (x) over E and so 
also the splitting field of rf (x) (which is the same as / (x) since / (x) has coefficients 
in E) over r(£). Theorem 13.27 on extending isomorphisms then shows that we can 
extend r to an isomorphism o : 

a : K 

I 

r : E 

Since cr fixes E (because r does), it follows that every embedding t of E fixing E 
is the restriction to £ of some automorphism cr of E fixing F, in other words, every 
embedding of E is of the form a fr for some a e G. 

Two automorphisms cr, o' € G restrict to the same embedding of E if and only if 
o~ x o' is the identity map on E. But then o~ x a' e H (i.e., o' e oH) since by (3) the 
automorphisms of K which fix E are precisely the elements in H. Hence the distinct 
embeddings of E are in bijection with the cosets cr H of H in G. In particular this gives 

|Emb(£/E)| = [G : H] = [E :F] 

where Emb(£/E) denotes the set of embeddings of E (into a fixed algebraic closure 
of E) which fix E. Note that Emb(£/E) contains the automorphisms Aut (E/E). 

The extension E/F wifi be Galois if and only if |Aut(£/E)| = [E : E]. By the 
equality above, this wifi be the case if and only if each of the embeddings of E is actually 
an automorphism of E, i.e., if and only if cr(£) = £ for every o e G. 

If cr e G, then the subgroup of G fixing the field cr(£) is the group oHo~ l , i.e., 

cr(E) = K aHo -'- 
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To see this observe that if a a € o(E) then 

(<xha~ l )(<ra) = cr(ha ) =cra for all h e H , 

since h fixes a € £, which shows that oHo~ l fixes cr(E). The group fixing a(E) has 
order equal to the degree of K over cr(£). But this is the same as the degree of K over 
E since the fields are isomorphic, hence the same as the order of H. Hence crHa~ l is 
precisely the group fixing o (£) since we have shown containment and their orders are 
the same. 

Because of the bijective nature of the Galois correspondence already proved we 
know that two subfields of K containing F are equal if and only if their fixing subgroups 
are equal in G. Hence cr(£) = E for all <r e G if and only if a Ha~ l = H for all 
<r e G, in other words E is Galois over F if and only if H is a normal subgroup of G. 

We have already identified the embeddings of E over F as the set of cosets of H in 
G and when H is normal in G seen that the embeddings are automorphisms. It follows 
that in this case the group of cosets G/H is identified with the group of automorphisms 
of the Galois extension E /F by the definition of the group operation (composition of 
automorphisms). Hence G/H = Gal (E/F) when H is normal in G, which completes 
the proof of ( 4 ). 

Suppose H i is the subgroup of elements of G fixing the subfield E\ and H2 is the 
subgroup of elements of G fixing the subfield £2- Any element in H\ n H 2 fixes both 
£1 and £2, hence fixes every element in the composite £i£ 2, since the elements in 
this field are algebraic combinations of the elements of £1 and £2. Conversely, if an 
automorphism a fixes the composite £1 £2 then in particular c fixes £1, i.e., a € Hi, 
and cr fixes £2, i.e., c e H 2) hence c e Hi D H 2 . This proves that the composite £(£2 
corresponds to the intersection H\C\H 2 . Similarly, the intersection E\ n E 2 corresponds 
to the group ( H \ , H 2 ) generated by Hi and H% , completing the proof of the theorem. 

Example: (Q(\/2, V5) and Q(>y2, p)) 

We have already seen examples of this theorem at the beginning of this section. We now see 
that the diagrams of subfields for the two fields Q(V2 , VS ) and <Q>(4/2 , p) given before 
indicate all the subfields for these two fields. 

Since every subgroup of the Kiein 4 -group is normal, all the subfields of Q(VS , VS ) 
are Galois extensions of <Q>. 

Similarly, since the only nontrivial normal subgroup of S3 is the subgroup of order 3 , 
we see that only the subfield Q(p) of K — Q(-v/ 2 , p) is Galois over (Q>, with Galois group 
isomorphic to S^/{o ), i.e., the cyclic group of order 2 . For example, the nontrivial auto- 
morphism of Q(p) is induced by restricting any element (r, for instance) in the nontrivial 
coset of ( o ) to Q(p). This is dear from the explicit descriptions of these automorphisms 
given before — each of the elements r, to, to 2 in this coset map p to p 2 . The restrictions 
of the elements of Gal ( K /Q) to the (non-Galois) cubic subfields do not give automorphisms 
of these fields in general, rather giving isomorphisms of these fields with each other, in 
accordance with ( 4 ) of the theorem. 

Example: (Q(>/2 + VS)) 

Consider the field Q(V 2 + VS). This is clearly a subfidd of the Galois extension 
Q(V 2 , VS ). The other roots of the minimal polynomial for VS. + VS over Q are therefore 
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the distinct conjugates of \/2 + V3 under the Galois group. The conjugates are 

W2±v/3 

which are easily seen to be distinct The minim al polynomial is therefore 

[x - (V2+ %/3)J[jt - (v/2 - V3)][x - (-V2 + Vl)][x - {-Jl - V3)) 

which is quickly computed to be the polynomial x 4 — 10x 2 + 1. It follows that this 
polynomial is irreducible and that 

Q(V2 t V3) = Q(V2 + ^/3), 

either by degree considerations or by noting that only the automorphism 1 of {1 , a, r, at) 
fixes -y/2 + \/3 so the fixing group for this field is the sa m e as for , \/3 ). 

Example: (Splitting Field of x & — 2) 

The splitting field of x 8 — 2 over Q is generated by 0 — Xf2 (any fixed 8 th root of 2, say 
the real one) and a primitive 8 th root of unity £ = fa. Recall from Section 13.6 that 

Q(fc)=Q0\>/2). 

Since O 4 = we see that the splitting field is generated by 6 and i. The subfield Q(<9) is 
of degree 8 over Q (since x 8 — 2 is irreducible, being Eisenstein), and all the elements of 
this field are real. Hence i £ Q(6) and since i generates at most a quadratic extension of 
this field, the splitting field 

Q(V2,te) = ®(V2J) 

is of degree 16 over Q. 

The Galois group is determined by the action on the generators 6 and i which gives 
the possibilities 

0^$“$ a = 0,1,2 7 

i h-* ±i 

Since we have already seen that the degree of the extension is 16 and there are only 16 
possible such maps, it follows that in fact each of the maps above is an automorphism of 
Q(v^,i) overQ. 

Define the two automorphisms 

{ 6 i-» £0 r 6 S 

i \-+ i I / i->- — i 

(t is the map induced by complex conjugation). Since 

y/2 ,V2 l rr 

K = Cs = + i— = 2^ + 

= ^(1 +i)0 4 

we can easily compute what happens to £ from the explicit expressions for the powers of 
£ in the following Figure 1 . 

Using these explicit values we find 


■ 6 H* £6 

■ 6\^9 

i i-* i 

z : ■ i ^ —i 

. K ~K = C 5 
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Fig. 1 


Note that the reason we are interested in also keeping track of the action on the element 
f is that it will be needed in computing the composites of automorphisms, for example in 
computing 

a\e) = o(£0) = cr(f)cr(0) = (-?)(?0) = ~C 2 0 
= -iO. 


We can similarly compute the following automorphisms: 


cr : 


a 2 : 


a 3 : 


a 4 : 


a 5 : 


,6 . 


Oh* {6 
i i 

( *-> C 5 

e ^ < 6 e 

i v* i 

0 ** fe 

1 i 

t ^ ~t 

e h+-e 

i \-y i 

f ^ C 

6 

i t-+ i 

O- -? 

6 

i i-* i 






ra 


ra 


TO - ' 




ra' 


za v 


0 f 7 0 

1 i-> — i 

f 2 0 

i i-> —i 

f i-> ? 7 

0 H* £0 
/ I — >- — r 

1 (->■ — i 

f »-» f 7 

0 c 3 0 

1 h-y -i 

B 

i t-y —i 

^f 7 


C 6 0 
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a 


7 . 


6 i-> $ 3 0 
i i->- i 

^ -c 


za 


9 h-> f 5 0 
i i-> — i 


c »-> r 


Since this exhausts the possibilities, these elements (together with 1 and r) are the Galois 
group. We see in particular that a and r generate the Galois group. To determine the 
relations satisfied by these elements, we observe first that clearly r 2 = 1 and (it 4 ) 2 = 1 , 
so that 


= = 1. 


Also, we compute 


err : 


6 £6 
i —i 


so that 


ctt = rcr 3 . 

It is not too difficult to show that these relations define the group completely, i.e., 
GaKQf-^, i)/Q) = ( c, x | <r 8 = t 2 = 1, ax = x a 3 ). 

Such a group is called a quasidihedral group (recall that the dihedral group of order 16 
would have the relation ax = x a 1 instead of ax = rcr 3 ) and is a subgroup of since the 
Galois group is a subgroup of the permutations of the 8 roots of x g — 2. 

This example again illustrates that one must take care in determining Galois groups 
from the actions on generators. We first computed the degree of the Galois extension above 
to determine the number of elements in the Galois group. Had we proceeded directly from 
the original generators 6 = Vl and f f 8 we might have (incorrectly) concluded that 
there were a total of 32 elements in the Galois group, since the first generator is mapped to 
any of 8 possible roots of x E - 2 and the second generator is mapped to any of 4 possible 
roots of its minim al polynomial 0a(x) = x 4 + 1. The problem, as previously indicated, 
is that these choices are not independent. Here the reason is provided by the algebraic 
relation 


9 4 = s/2 = t; + Z 1 


which shows that one cannot specify the images of 9 and £ independently — their images 
must again satisfy this algebraic relation. This relation is perhaps sufficiently subtle to serve 
as a caution against rashly concluding maps are automorphisms. We note that in general it is 
necessary to provide justification that maps are automorphisms. This can be accomplished 
for example by using the extension theorems or by using degree considerations as we did 
here. 

Determining the lattice of subgroups of this group G is a straightforward problem. 
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The lattice is the following: 



(rcr) 


Determining the subfields corresponding to these subgroups (which by the Funda- 
mental Theorem gives all the subfields of Q(\/2, 0) is quite simple for a number of the 
subgroups above using (2) of the Fundamental Theorem, which stales that the degree of 
the extension over Q is equal to the index of the fixing subgroup. It then suffices to find 
a subfield of the right degree which is fixed by the subgroup in question. Remember also 
that if a subfield is fixed by the generators of a subgroup, then it is fixed by the subgroup. 
For example, from the explicit description for the automorphism a we see that Q(i) is fixed 
by the group generated by o. Since this is a subgroup of index 2 and Q(i) is of degree 2 
over <Q>, it must be the full fixed field. Most of the fixed fields for the subgroups above can 
be determined in as simple a manner. 

For the subgroups of order 4 on the right (namely, generated by to 3 and by to), it is 
perhaps not so easy to see how to determine the corresponding fixed field. For the subgroup 
H generated by rcr 3 we may proceed as follows: the element 6 2 = is clearly fixed by 
o 4 . By the diagram above, cr 4 is a normal subgroup of H of index Z with representatives 
1, rcr 3 for the cosets. Consider the element 

a = (1 + xo 3 )0 2 = 6 2 + to 3 0 2 . 

Then a is fixed by a 4 (we are in a commutative group H of order 4, so a 4 commutes with 
1 and rn 3 and we already know 6 2 is fixed by cr 4 ). But (and this is the point), a is also 
fixed by rcr 3 : 

r o 3 a = rcr 3 (l + to 3 )6 2 = [rcr 3 -|- (rcr 3 ) 2 ]# 2 
= (to 3 +o 4 )6 2 

and the last expression is just a since o 4 6 2 = B 2 . Hence a is an element of the fixed field 
for H . Explicitly 

a = -v/2 + i yfl = (1 + /) \/2. 

A quick check shows that a is not fixed by the automorphism a 2 , so by the diagram of 
subgroups above, it follows that the fixing subgroup for the field Q (a) is no larger than H, 
hence is precisely H. which gives us our fixed field. This also gives the fixed field for ( to ) 
by recalling that in general if £ is the fixed field of H then the fixed field of tHt~ [ is the 
fieldr(£). For/f = ( to 3 ) ,r Ht~ x = (rcr ), with fixed field given by r (a) = (1— i)-Vz 
In general one tries to determine elements which are fixed by a given subgroup H 
of the Galois group (cf. the exercises, which indicate where the element above arose) and 
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attempts to generate a sufficiently large field to give the full fixed field. In our case we 
were able to accomplish this with a single generator. We shall see later that every finite 
extension of Q is a simple extension, so there will be a single generator of this type, but in 
general it may be difficult to produce it directly. 

The element a is a root of the polynomial 

x 4 + 8 

which must therefore be irreducible since we have already determined that a root of this 
polynomial generates an extension of degree 4 over Q. 

In a similar way it is possible to complete the diagram of subfields of <Q>( V2 , i), which 
we have inverted to emphasize its relation with the subgroup diagram above (6 = \/2): 



Note that the group ( a 4 ) is normal in G (in fact it is the center of G) with quotient 
G/(o 4 ) = D%, so the corresponding fixed field <Q>(i, X/T) is Galois over (Q> with Dg as 
Galois group. Being Galois it is a splitting field, evidently the splitting field for x 4 — 2. 
The lattice of subfields for this field is then immediate from the lattice above. 

We end this example with the following amusing aspect of this Galois extension. It is 
an easy exercise to verify that 

(o 2 ,r) = Dg (u)=Z/SZ (cr 2 , ra* ) = Qg 

where Dg is the dihedral group of order 8 and fig is the quaternion group of order 8. It 
follows that the field Q( \/2 , i) is Galois of degree 8 over its three quadratic subfields 

Q(V2) Q(0 Q(V=2) 

with dihedral, cyclic and quaternion Galois groups, respectively, so that three of the 5 
possible groups of order 8 (and both non-abelian ones) appear as Galois groups in this 
extension. 

We shall consider additional examples and applications in die following sections. 

EXERCISES 

1. Determine the minimal polynomial over Q for the element \/2 -f %/5. 

2. Determine the minim al polynomial over Q for the element 1 + l/l + l/A. 

3. Determine the Galois group of (x 2 — 2)(x 2 — 3)(x 2 — 5). Determine all the subfields of 
the splitting field of this polynomial. 
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4. Let p be a prime. Determine the elements of the Galois group of x p — 2. 

5. Prove that the Galois group ofx p — 2 for p a prime is isomorphic to the group of matrices 

^ where a, b € F p , a ^ 0. 

6. Let K = Q( 4/5,0 and let F\ = Q(i), F 2 = Q(\/2) , F 3 = Prove that 

Gal(AT/F0 ^ Z 8 , Gal(tf/F 2 ) = D g , Gal(K/F 3 ) ^ Qs. 

7. Determine all the subfields of the splitting field of x 8 — 2 which are Galois over Q. 

8. Suppose K is a Galois extension of F of degree jf for some prime p and some n > 1. 
Show there are Galois extensions of F contained in AT of degrees p and p r ~ l . 

9. Give an example of fields F\, F 2 , F 3 with Q c F\ C F 2 C F 3 , [F 3 : Q] = 8 and each 
field is Galois over all its subfields with the exception that F 2 is not Galois over Q. 

10. Determine the Galois group of the splitting field over Q of x 8 — 3. 

11. Suppose / (jc) e Z[x] is an irreducible quartic whose splitting field has Galois group S 4 
over 0 (there are many such quarries, cf. Section 6). Let 6 be a root of / (x) and set 
K = Q(0). Prove that K is an extension of Q of degree 4 which has no proper subfields. 
Are there any Galois extensions of (Q> of degree 4 with no proper subfields? 

12. Determine the Galois group of the splitting field over Q of x 4 — 14x 2 + 9. 

13. Prove that if the Galois group of the splitting field of a cubic over Q> is the cyclic group of 
order 3 then all the roots of the cubic are real. 

14. Show that Q(\/ 2 + \/2 ) is a cyclic quartic field, i.e., is a Galois extension of degree 4 with 
cyclic Galois group. 

15. ( Biquadratic Extensions) Let F be a field of characteristic ^ 2. 

(a) If K = F(^/D[ , ) where D\, D% e F have the property that none of D\, D 2 

or D\D 2 is a square in F, prove that K/F is a Galois extension with Gal (K/F) 
isomorphic to the Klein 4-group. 

(b) Conversely, suppose K/F is a Galois extension with Gal(Af/F) isomorphic to the 
Klein 4-group. Prove that K = F(a/dT , \/D 2 ) where D\ , D 2 e F have the property 
that none of D 2 or D\ D 2 is a square in F. 

16. (a) Prove that x 4 — 2x 2 — 2 is irreducible over <Q. 

(b) Show the roots of this quartic are 

ai = yj 1 + \/3 as = — yj 1 + \/3 

a 2 = \/ 1 — \/3 0(4 = — yj 1 — sfl. 

(c) LetK] = Q(«i) and K 2 = Q(ot2)- Show that Ki ^ K 2 , and K\ C\K 2 = Q(\/3) = F. 

(d) Prove that K\,K 2 and Ki K 2 are Galois over F with Gal( K \ K 2 /F) the Klein 4-group. 
Write out the elements of Gal(Ki K 2 /F) explicitly. Determine all the subgroups of 
the Galois group and give their corresponding fixed subfields of K\ K 2 containing F. 

(e) Prove that the splitting field of x 4 — 2x 2 — 2 over Q is of degree 8 with dihedral Galois 
group. 

The following two exercises indicate one method for constructing elements in subfields of a 
given field and are quite useful in many computations. 

♦ 

17. Let K /F be any finite extension and let a e K. Let L be a Galois extension of F containing 
K and let H < Gal(L /F) be the subgroup corresponding to K . Define the norm of a from 


(a b 

1 0 1 
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K to F to be 


Nk/f(u) = n<7(cr). 


where the product is taken over all the embeddings of K into an algebraic closure of F 
(so over a set of coset representatives for H in Gal(L/F) by the Fundamental Theorem 
of Galois Theory). This is a product of Galois conjugates of a. In particular, if K/F is 
Galois this is V\oeGd(KiF) 

(a) Prove that N k/ F( a ) e F. 

(b) Prove that Nx/f(afl) = Nk/f(<x)Nk/f(&), so that the norm is a multiplicative map 
from K to F. 

(c) Let K = F(y/D) be a quadratic extension of F. Show that N k/f(o + by/fi) = 
(p- — Db 2 . 

(d) Let m a (x) = x d + ad- \x d ~ l + ■ ■ ■ + a\x + ao e F[x] be the minimal polynomial 
for a e K over F. Let n = [K : F]. Prove that d divides n, that there are d distinct 
Galois conjugates of a which are all repealed n /d times in the product above and 
conclude that Nj k/f(&) = (— l) n ^ d . 

18. With notation as in the previous problem, define the trace of a from K to F to be 


Tr/£:/F(a) = ^a(a), 

a 


a sum of Galois conjugates of a. 

(a) Prove that Ttk/f (a) e F. 

(b) Prove that Ttk/f 0* +$) = ^k/f (a)-t-Tr*/f G8), so that the trace is an additive map 
from K to F. 

(c) Let K = F(\fD ) be a quadratic extension of F. Show that Tr * jria -t- by/D ) = 2a. 


n 


(d) Let m a (x) be as in the previous problem. Prove that Ttk/f (a) = — ^ a d-i- 


19. With notation as in the previous problems show that Nk/f(oo) = (PNk/fM and 
7t k/f(w) = air*' /f(o) for all a in the base field F, In particular show that N K/FiP) = 
a n and Tr k/f(o) = na for all a e F. 

20. With notation as in the previous problems show more generally that — o(or)) = 

(m a (x)) n/d . 


21. Use the linear independence of characters to show that for any Galois extension K of F 
there is an element a e K with Tr * //- (a) =£■ 0. 

22. Suppose K/F is a Galois extension and let a be an element of the Galois group. 

(a) Suppose a e K is of the form a = — for some nonzero fi e K. Prove that 
Nk/f(c*) = 1 . 

(b) Supposes e K is of the forma = fi—ofl for some f} e K. Prove that Tr^/f (a) = 0. 


The next exercise and Exercise 26 following establish the multiplicative and additive forms of 
Hilbert’s Theorem 90. These are instances of the vanishing of a first cohomology group, as 
will be discussed in Section 17.3. 


23. ( Hilbert's Theorem 90) Let K be a Galois extension of F with cyclic Galbis group of order 

^ j8 

n generated by o . Supposea e K has N* /p(a) = 1. Prove that a is of the form a = — 

for some nonzero fi € K. [By the linear independence of characters show there exists 
some 6 e K such that 

£ = 6 + ao (6) + (a aa)cr 2 (0) + 1- (a oa . . . o r '~ 1 a)o r, ~ [ ( 6 ) 
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is nonzero. Compute — using the fact that a has norm 1 to F.] 


24. Prove that the rational solutions a, b € Q of Pythagoras’ equation a 2 + b 2 = 1 are of 

2 st 


die form a = 


s 2 — t 2 


and b = 


S 2 +t ^ 


for some s,teQ and hence show that any right 


triangle with integer sides has sides of lengths (m 2 — n 2 , 2mn, m 2 + n 2 ) for some integers 
m, n. [Note that a 2 + b 2 = 1 is equivalent to Nq(,)/q(/2 + ib) = 1, then use Hilbert’s 
Theorem 90 above with — s + it .] 

25. Generalize the previous problem to determine all the rational solutions of the equation 
a 2 + Db 2 = 1 for D G Z, D > 0, D not a perfect square in Z. 


26. (Additive Hilbert's Theorem 90) Let A" be a Galois extension of F with cyclic Galois group 
of order n generated by a . Suppose a e K has Tr^ /f(a) = 0. Prove that a is of the form 
a = — ofi for some /J G K. [Let 6 G K be an element with Tr*- /f(&) A 0 by a previous 

exercise, let 


£ = m 1 /m [acr(fl) + (« + (TOi)a 2 (9 ) H (- (a + era H 1 - a n 2 a)a n *(0)] 

Txk/f(v) 

and compute — afi.\ 

27. Let a = y (2 + V2)(3 + VI) (positive real square roots for concreteness) and consider 
the extension E = <Q(a). 

(a) Show that a = (2 + V2 )(3 + VI ) is not a square in F = Q(V2 , VI ). [If a = c 2 , 
c G F, then = (2 4- V2) 2 (6) = (c ipe ) 2 for the automorphism tp g Gal(F/Q) 
fixing Q(V2). Since ape = N ( c ) € Q(V2) conclude that this implies 

V6 G Q( V2 ), a contradiction.] 

(b) Conclude from (a) that [E : Q] = 8. Prove that the roots of the minim al polynomial 
over <Q) for a are the 8 elements (2 ± V2)(3 ± VI ) . 

(c) Let — ij(2 — V2)(3 + VI). Show that aft = V2(3 4- VI ) e F so that fi G E. 
Show similarly that the other roots are also elements of E so that £ is a Galois 
extension of Q. Show that the elements of the Galois group are precisely the maps 
determined by mapping a to one of the eight elements in (b). 

(d) Let a G Gal(F/Q) be the automorphism which maps a to fl. Show that since 
o(a 2 ) = f} 2 that o(V 2) = -V5 and ct(VI) = VI- From aft = V2(3 4- VI) 
conclude that o(afi) = —aft and hence er(/3) = —a. Show that a is an element of 
order 4 in Gal(£/Q). 

(e) Show similarly that the map r defined by r(a) = -y/(2 + V 2)(3 — VI) is an element 
of order 4 in Gal(F/Q). Prove that a and z generate the Galois group, a 4 = r 4 = 1, 
a 2 = z 2 and that err — za 2 . 

(f) Conclude that Gal(£/Q) = 08, the quaternion group of order 8. 

28. Let f(x) e F[r] be an irreducible polynomial of degree n over the field F, let L be the 
splitting field of f(x) over F and let a be a root of f(x) in L. If K is any Galois extension 
of F contained in F, show that the polynomial f(x) splits into a product of m irreducible 
polynomials each of degree d over K, where m = [F(a) n K : F] and d = [K(a) : A] 
(cf. also the generalization in Exercise 4 of Section 4). [If H is the subgroup of the Galois 
group of L over F corresponding to K then the factors of f(x) over K correspond to the 
orbits of H on the roots of f(x). Then use Exercise 9 of Section 4.1 .] 
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29. Let k be a field and let k(t) be the field of rational functions in the variable t. Define the 

maps a and r of k(t) to itself by of (/) = / ( ^ ) and rf (/) = / ( j-) for / (0 e k(t). 

(a) Prove that a and r are automorphisms of k(t) (cf. Exercise 8 of Section 1) and that 
the group G = ( o, r ) they generate is isomorphic to S3. 


(b) Prove that the element t = 


(t 2 -t + 1)~ 


is fixed by all the elements of G. 


t 2 (t - l) 2 

(c) Prove that k(t) is precisely the fixed field of G in k(t) [compute the degree of the 
extension]. 


30. Prove that the fixed field of the subgroup of automorphisms generated by r in the previous 
problem is k(t + Prove that the fixed field of the subgroup generated by the automor- 
phism to 2 (which maps t to 1 — t) is k (r (1 — r)) . Determine the fixed field of the subgroup 
generated by ra and the fixed field of the subgroup generated by a. 

31. Let K be a finite extension of F of degree n. Let a be an element of K. 

(a) Prove that a acting by left multiplication on AT is an F-linear transformation T a of K. 

(b) Prove that the minimal polynomial for a over F is the same as the minimal polynomial 
for the linear transformation T a . 

(c) Prove that the trace Txk/f (“) is the trace of the n x n matrix defined by T a (which 
justifies these two uses of the same word “trace’’)- Prove that the norm N k/f(&) is 
the determinant of T a . 


143 FINITE FIELDS 

A finite field F has characteristic p for some prime p so is a finite dimensional vector 
space over F p . If the dimension is /7,i.e., [F : F p ] = n, then F has precisely p n elements. 
We have already seen (following Proposition 13.37) that F is then isomorphic to the 
splitting field of the polynomial — x s hence is unique up to isomorphism. We denote 
the fini te field of order p n by F^n . 

The field F^ is Galois over F p , with cyclic Galois group of order n generated by 
the Ffobenius automorphism 

GalflFpn/Fp) = (o p }= %/nZ 

where 

o p • B* p n ► Fpji 
a i-> a p 

(Example 7 following Corollary 6). By the Fundamental Theorem, every sub field of F^ 
corresponds to a subgroup of Z/«Z. Hence for every divisor d of n there is precisely 
one subfield of of degree d over F p , namely the fixed field of the subgroup generated 
by Op of order n/d, and there are no other subfields. This field is isomorphic to F^, 
the unique finite field of order p d . 

Since the Galois group is abelian, every subgroup is normal, so each of the subfields 
F pI ( (d a divisor of n) is Galois over F p (which is also clear from the fact that these 
are themselves splitting fields). Further, the Galois group GalfFpd /F p ) is generated by 
the image of o p in the quotient group Gal(F p n/F p )/( a d ). If we denote this element 
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again by a p , we recover the Frobenius automorphism for the extension F^ f¥ p . (Note, 
however, that a p has order n in GaI(Fp* /F p ) and order d in Gal(F^ /F p ).) 

We summarize this in the following proposition. 

Proposition 15. Any finite field is isomorphic to Fp* for some prime p and some integer 
n > 1 . The field Fp* is the splitting field over F p of the polynomial —x, with cyclic 
Galois group of order n generated by the Frobenius automorphism a p . The subfields 
of F^ are all Galois over F p and are in one to one correspondence with the divisors d 
of n. They are the fields F^ , the fixed fields of a p d . 

The corresponding statements for the finite extensions of any finite field are easy 
consequences of Proposition 15 and are outlined in the exercises. 

As an elementary application we have the following result on the polynomial x 4 + 1 
in Z[jc], 

Corollary 16. The irreducible polynomial x 4 + 1 e Z[x] is reducible modulo every 
prime p. 

Proof: Consider the polynomial x 4 + l over F p [jc] for the prime p. If p = 2 we 
have x 4 + 1 = (x + l) 4 and the polynomial is reducible. Assume now that p is odd. 
Then p 2 — 1 is divisible by 8 since p is congruent mod 8 to 1 , 3, 5 or 7 and all of 
these square to 1 mod 8. Hence x^~ y — 1 is divisible by x 8 — 1. Then we have the 
divisibilities 

x 4 + 1 | JC 8 - 1 | -\\ X ?- X 

which shows that all the roots of x 4 + 1 are roots of x^ — x. (Equivalently, these roots 
are fixed by the square of the Frobenius automorphism a 2 .) Since the roots of x pl — x 
are the field Fp 2 , it follows that the extension generated by any root of x 4 + 1 is at most 
of degree 2 over F p , which means that x 4 + 1 cannot be irreducible over F p . 

The multiplicative group Fp* * is obviously a finite subgroup of the multiplicative 
group of a field. By Proposition 9.18, this is a cyclic group. If 8 is any generator, then 
clearly Fp* = F p (8). This proves the following result 

Proposition 17. The finite field Fp* is simple. In particular, there exists an irreducible 
polynomial of degree n over W p for every n > 1 . 

We have described the finite fields Fp« above as the splitting fields of the polynomials 
x ^ — x. By the previous proposition, this field can also be described as a quotient of 
F p [x], namely by the minimal polynomial for 6. Since 8 is necessarily a root of x^ —x, 
we see that the minimal polynomial for 8 is a divisor of x^ — x of degree n. 

Conversely, let p(x) be any irreducible polynomial of degree d, say, dividing 
x^—x. If a is a root of p(x), then the extension F p (ar) is a subfield of Fp* of de- 
gree d. Hence d is a divisor of n and the extension is Galois by Proposition 15 (in fact, 
the extension F pd) so in particular all the roots of p(x) are contained in ¥ p (a). 
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The elements of are precisely the roots of — x. If we group together the 
factors x — a of this polynomial according to the degree d of their minimal polynomials 
over F p , we obtain 

Proposition 18. The polynomial — x is precisely the product of all the distinct 
irreducible polynomials in F p [x] of degree d where d runs through all divisors of n. 

This proposition can be used to produce irreducible polynomials over F p recur- 
sively. For example, the irreducible quadratics over F 2 are the divisors of 



x(x — 1) 


which gives the single polynomial x 2 + x + l. Similarly, the irreducible cubics over 
this field are the divisors of 

— = x 6 + x 5 + x 4 +x 3 + x 2 +x + \ 

x(x — 1) 

which factors into the two cubics x 3 + x -T 1 and x 3 4- x 2 + 1. The irreducible quartics 
are given by dividing x 16 — x by x(x — 1) and the irreducible quadratic x 2 + x + l 
above and then factoring into irreducible quartics: 

— * 2 X — — — - = (x 4 + x 3 + x 2 + x + l)(x 4 + x 3 + l)(x 4 + x + 1). 

x(x — l)(x 2 -h X -h 1) 

This gives a method for determining the product of all the irreducible polynomials 
over F p of a given degree. There exist efficient algorithms for factorization of polyno- 
mials mod p which will give the individual irreducible polynomials (cf. the exercises) 
in practice. The importance of having irreducible polynomials at hand is that they give 
a representation of the finite fields F^ (as quotients F p [x]/(/(x)) for / (x) irreducible 
of degree n ) conducive to explicit computations. 

Note also that since the finite field Fpn is unique up to isomorphism, the quotients 
of F p [x] by any of the irreducible polynomials of degree n are all isomorphic. If 
/i(x) and / 2 (x) are irreducible of degree n, then h(x) splits completely in the field 
Fpn = F p[x]/(/i(x)). If we denote a root of fiix) by a(x) (to emphasize that it is a 
polynomial of degree < n in x in F p [x]/(/i(x)) ), then the isomorphism is given by 

F p [x]/(/ 2 (x)) £ F p [x]/(Mx)) 
x i-y a(x) 

(we have mapped a root of / 2 (x) in the first field to a root of / 2 (x) in the second field). 
For example, if /j(x) = x 4 + x 3 + 1, / 2 (x) = x 4 + x + l are two of the irreducible 
quartics over F 2 determined above, then a simple computation verifies that 

a(x) = x 3 + x 2 

is a root of / 2 (x) in Fi6 = F 2 [x]/(x 4 + x 3 + 1). Then we have 

F 2 M/C * 4 +x + 1) = F 2 [x]/(x 4 +x 3 + 1) (£ Fig) 

x x 3 + x 2 . 
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If we assume a result from elementary number theory we can give a formula for 
tiie number of irreducible polynomials of degree n. Define the Mobius /^-function by 

1 for n = 1 

pin) = 0 if n has a square factor 

(— l) r if n has r distinct prime factors. 

If now f in) is a function defined for all nonnegative integers n and F(n) is defined by 

F(n) = Y,f(d) n = 1,2.... 

d\n 

then the Mobius inversion formula states that one can recover the function fin) from 
Fin): 

= « = 1 . 2 , .... 
d\n a 

This is an elementary result from number theory which we take for granted. Define 
V'(n) = the number of irreducible polynomials of degree n in F p [jc]. 
Counting degrees in Proposition 18 we have 

P n ~ 

d\n 

Applying the Mobius inversion formula (for fin) = nty(n)) we obtain 

n\J/(n) = 'Y^ii(d)p n/d 

d\n 

which gives us a formula for the number of irreducible polynomials of degree n over 
F p : 

= ^Y2 fi ^p n/d - 

d\n 

For example, in the case p = 2, n = 4 we have 

lK4) = + m(2)2 2 + M(4)2‘) = 1(16 - 4 + 0) = 3 

4 4 

as we determined directly above. 

We have seen above that 

Fpm C F p « if and only if m divides n. 

In particular, given any two finite fields E^-i and there is a third finite field containing 
(an isomorphic copy of) them, namely . This gives us a partial ordering on these 
fields and allows us to think of their union. Since these give all the finite extensions 
of F p , we see that the union of Fp* for all n is an algebraic closure of F p , unique up to 
isomorphism.* 

Fp = F^ . 

/!> I 

This provides a simple description of the algebraic closure of F p . 
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EXERCISES 


1. Factor x 8 — x into irreducibles in Z[x] and in F 2 I*]. 

2. Write out the multiplication table for F 4 and Fs. 

3. Prove that an algebraically closed field must be infinite. 

4. Construct the finite field of 16 elements and find a generator for the multiplicative group. 
How many generators are there? 

5. Exhibit an explicit isomorphism between the splitting fields of x 3 — x + 1 and jc 3 — jc — 1 
over F 3 . 

6 . Suppose K = Q(0) = Q(*/Di , y/Th ) with Du £>2 E is a biquadratic extension and 
that 0 = a + by/Di + cyfDi + d -JD i D 2 where a, b, c, d e Z are integers. Prove that the 
minimal polynomial mo(x ) for 0 over Q is irreducible of degree 4 over Q but is reducible 
modulo every prime p. In particular show that the polynomial jc 4 — 10x 2 + 1 is irreducible 
in Z[x] but is reducible modulo every prime. [Use the fact that there are no biquadratic 
extensions over finite fields.] 

7. Prove that one of 2, 3 or 6 is a square in F p for every prime p. Conclude that the polynomial 

x 6 - Ux 4 + 36x 2 - 36 - (x 2 - 2)(x 2 - 3)(x 2 - 6 ) 

has a root modulo p for every prime p but has no root in Z. 

8 * Determine the splitting field of the polynomial x p — x — a over F p where a ^ 0, a e F p . 
Show explicitly that the Galois group is cyclic. [Show a i-v a + 1 is an automorphism.] 
Such an extension is called an Artin-Schreier extension (cf. Exercise 9 of Section 7). 

9- Let q = p m be a power of the prime p and let F ? = F^m be the finite field with q elements. 
Let a q = a™ be the m* power of the Frobenius automorphism Op, called the g-Frobenius 
automorphism. 

(a) Prove that o q fixes F^. 

(b) Prove that every finite extension of F^ of degree n is the splitting field of x g " — x over 
F ? , hence is unique. 

(c) Prove that every finite extension of F 9 of degree n is cyclic with a q as generator. 

(d) Prove that the subfields of the unique extension of F ¥ of degree n are in bijective 
correspondence with the divisors d of n. 

10. Prove that n divides <p(jf — 1). [Observe that <p(p n - 1) is the order of the group of 
automorphisms of a cyclic group of order p” — 1 .] 

11. Prove that xp" — x + 1 is irreducible over ¥ p only when n — \ arn — p — 2. [Note that 
if a is a root, then so is a + a for any a € F^. . Show that this implies F p (a) contains F^ 
and that [F p (a) : Fp«] — p.] 

( Berlekamp's Factorization Algorithm ) The following exercises outline the Berlekamp factor- 
ization algorithm for factoring polynomials in F p [x]. The efficiency of this algorithm is based 
on the efficiency of computing greatest common divisors in F p [x] by the Euclidean Algorithm 
and on the efficiency of row-reduction matrix algorithms for solving systems of linear equa- 
tions. 

Let / (x) e F p [x] be a monic polynomial of degree n and let /(jc) = pi(x)p 2 (x) . . . Pk(x) 
where p\ (x), p 2 (x), . . . , p*(x) are powers of distinct monic irreducibles in F p [x]. 

12. Show that in carder to write /( x) as a product of irreducible polynomials in F p [x] it suffices 

to determine the factors pi(x) Pk( x )- [If p(x) — q(x) N e F p [x] with q(x) monic 
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and irreducible, show that q(x) can be determined from p(x) by checking for p^ powers 
and by computing greatest common divisors with derivatives.] 

13 . Let g(x ) € F p [x] be any polynomial of degree < n. Denote by R(h(x)) the remainder of 
h(x) after division by / (x). Prove the following are equivalent: 

(a) R(g(xn) = g(x). 

(b) f{x) divides [g(x)-0][£(x)-l] . . . (g(x)-(p-l)]. [Use the fact that g(x?) = g{x)? 
together with the factorization of x p — x in F p [x].] 

(c) pi (jc) divides the product in (b) for / = 1 , 2 k. 

(d) For each i, i = 1,2 , ,k there is an j,- e F p such that pi(x) divides g(x) — si, i.e., 
gix) = Sj (mod pi(x)) . 

14 . Prove that the polynomials gix) of degree < « satisfying the equivalent conditions of 
the previous exercise form a vector space V over F p of dimension k. [Use the Chinese 
Remainder Theorem applied to the p k possible choices for the Sj in 13(d)], 

15. Let g(x) = bo + b\ x + b„- ix" -1 € V. For j = 0. 1, .... n — 1 let 


R(x pj ) = a 0 j +aijx + ---+ a n -\jx Jl 1 
and let A be the n x n matrix 


( "0.0 

flo.l 

&0,n— 1 ^ 

fll.o 




a n~ l. J 

&n~ i./i— 1 / 


Show that condition (a) of Exercise 13 for g( x) e V is equivalent to 


(*) 


(A - I)B = 0 


(**) 


where B is the column matrix with entries bo, b\ b„- 1 . Conclude that the rank of the 

matrix A — / is n — k. Note that this already suffices to determine if / (x) is irreducible, 
without actually determining the factors. 

16 . Let gi(jc), g 2 (x), . gk(x) be a basis of solutions to (**) (so a basis for V), where we 
may take g\ (jc) = 1. Beginning with uj(x) = / (x), compute the greatest common divisor 
(ui(x), gi(x)-s) for i = 2, 3, .... k and $ e IF p for every factor of / (x) already computed. 
Note by Exercise 13(d) that every factor p,(x) of / (x) divides such a g.c.cL The process 
terminates when k relatively prime factors have been determined. 

Prove that this procedure actually gives all the factors p\(x), P 2 (x) Pk(x), i.e., 

one can separate the individual factors pi (jc), p 2 (x), .... pjt(x) by this procedure, as fol- 
lows: 

If this were not the case, then for two of the factors, say p\(x) and p 2 (x), for each 

i = 1, 2 k there would exist s ; € F p such that g,-(jc) — s/ is divisible by both pi(x) 

and p 2 (x). By the Chinese Remainder Theorem, choose a g(x) e V satisfying g(x) = 0 
(mod pi(jc)) and g(x) = 1 (mod p 2 (x)). Write g(x) = c iS>(x) in terms of the 
basis for V and let s = 5^f =1 CjSj (x) € F p . Show that r = 0 (mod pi(x)) so that s = 0 
and s = 1 (mod piix)) so that s = 1, a contradiction. 

17 . This exercise follows Berlekamp’s Factorization Algorithm outlined in the previous exer- 
cises to determine the factorization of fix) = x 5 + x 2 + 4x + 6 in F 7 [x]. 

(a) Show that x 7 = x 2 + 3x 3 + 6x 4 (mod / (x)). Similarly compute x 14 , x 21 , and x 26 
modulo / (x) (note that x 14 can most easily be computed by squaring the result for 
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x 7 and then reducing, etc.) to show that in this case the matrix A in Exercise 15 is 

/I 0 5 1 4\ 

0 0 112 
0 1 3 3 3. 

0 3 4 2 2 

\0 6 3 1 1/ 

(b) Show that the reduced row echelon form for A — l is the matrix 

✓0 1 0 0 6 \ 

0 0 10 6 
0 0 0 1 2 . 

0 0 0 0 0 

Vo 0 0 0 0 / 

Conclude that k = 2 (so / (x) is the product of precisely two factors which are powers 
of irreducible polynomials) and that gi (x) = 1 and g 2 (x) = x 4 4- 5x 3 + x 2 + x give 
a basis for the solutions to (**) in Exercise 15. 

(c) Following the procedure in Exercise 16, show that (f(x), g2 (x) — 1 ) = x 2 + 3x +5 — 
Pi(x), with f(x)/pi(x) = x 3 + 4x 2 4jc + 4 = p 2 (x), giving the powers of the 
irreducible polynomials dividing f(x) in F 7 [x]. Show that neither factor is a 7* 
power in F 7 [x] and that each is relatively prime to its derivative to conclude that both 
factors are irreducible polynomials, giving the complete factorization of f(x ) into 
irreducible polynomials: 

/ ( x ) = (x 2 + 3x+ 5)(x 3 + 4x 2 + 4x + 4) e F 7 [xJ. 


14.4 COMPOSITE EXTENSIONS AND SIMPLE EXTENSIONS 

We now consider the effect of taking composites with Galois extensions. The first result 
states that “sliding up” a Galois extension gives a Galois extension. 

Proposition 19. Suppose K/F is a Galois extension and F'/F is any extension. Then 
K F'/F' is a Galois extension, with Galois group 

Gal (KF'/F') = Ga \(K/K n F') 

isomotphic to a subgroup of Gal(AT/F). Pictorially, 


KF' 



KHF' 

F 


Proof: If K/F is Galois, then K is the splitting field of some separable polynomial 
/ (x) in F[x]. Then K F'/F' is the splitting field of / (x) viewed as a polynomial in 
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F'[jc], hence this extension is Galois. Since K/F is Galois, every embedding of K 
fixing F is an automorphism of K, so the map 

< V : GaHFF'/F') -+ Gal (K/F) 

a i-> o\k 

defined by restricting an automorphism a to the subfield K is well defined. It is clearly 
a homomorphism, with kernel 

ker^ = {a € Ga\(KF'/F') \ o\ K = 1>. 

Since an element in Gal (A - F' / F') is trivial on F', the elements in the kernel are trivial 
both on K and on F', hence on their composite, so the kernel consists only of the 
identity automorphism. Hence tp is injective. 

Let FI denote the image of <p in Gal( K / F) and let K « denote the corresponding fixed 
subfield of K containing F. Since every element in H fixes F', K H contains KC\F'. On 
the other hand, the composite KhF' is fixed by G&\(KF' /F’) (any a e Gb1(KF'/F') 
fixes F' and acts on Kh ^ * via its restriction o\ K e Ft, which fixes K H by definition). 
By the Fundamental Theorem it follows that KhF' = F', so that Kh Q F\ which 
gives the reverse inclusion Kh ^ K C\ F'. Hence Kh = K H F', so again by the 
Fundamental Theorem, H = Gal(£ /KC\ F') t completing the proof. 


Corollary 20. Suppose K/F is a Galois extension and F'/F is any finite extension. 
Then 


[KF ' : F] = 


[K : F][F' : F] 
[K 0 F' : F] ' 


Proof: This follows by the proposition from the equal ity[F'F / : F^] = [F : KHF'] 
given by the orders of the Galois groups in the proposition. 

The example F = Q, K = Q( 1/7. ), F' — Q(p X/7 ), p a primitive 3 rd root of unity, 
shows that the formula of Corollary 20 does not hold in general if neither of the two 
extensions is Galois. 


Proposition 21. Let K\ and K 2 be Galois extensions of a field F. Then 

(1) The intersection K\ H K 2 is Galois over F. 

(2) The composite K\ K 2 is Galois over F. The Galois group is isomorphic to the 
subgroup 

H = {(cr, r) | o\^nK 7 = ^lxinx 2 } 

of the direct product Gal(Fj/F) x Gal (K 2 /F) consisting of elements whose 
restrictions to the intersection K\C\K 2 are equal. 


* 1*2 



*, c\k 2 


F 
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Proof: (1) Suppose p(x) is an irreducible polynomial in F[x] with a root a in 
K\ fi K 2 . Since a e K\ and K\/F is Galois, all the roots of p(x) lie in K\. Similarly 
all the roots lie in K 2 , hence all the roots of p(x) lie in K\ n K 2 . It follows easily that 
K\ fi K 2 is Galois as in Theorem 13. 

(2) If K\ is the splitting field of the separable polynomial f\{x) and K 2 is the 
splitting field of the separable polynomial / 2 (jc) then the composite is the splitting field 
for the squarefree part of the polynomial f (x)f 2 (x), hence is Galois over F. 

The map 


(p : Gal(K i K 2 /F) GalfATj/F) x Gal (K 2 /F) 

O {o\ K ^o\ K2 ) 

is clearly a homomorphism. The kernel consists of the elements a which are trivial on 
both K\ and K 2i hence trivial on the composite, so the map is injective. The image lies 
in the subgroup 77, since 

(0'k r )lA',nA' 2 = o' I KiC\K 2 — (o'|A' 2 )|A' 1 nA' 2 - 

The order of H can be computed by observing that for every a € GaI(A!j// r ) there 
are IGaKA^/ATj n AT 2 )| elements x e Ga \(K 2 /F) whose restrictions to K\ fi K 2 are 
o r k,nx' 2 - Hence 


\H\ = |Gal(A r ,/F)| . |Gal( K 2 /K x n K 2 )\ 

\G*\(K 2 /F)\ 


= \G^{K,/F)\ 


\Gal(K 1 r\K 2 /F)\ 


By Corollary 20 and the diagram above we see that the orders of H and Gal(A!j K 2 /F ) 
are then both equal to 


[KiK 2 -.F[ = 


IKj : F][K 2 : F) 
[K\ C\K 2 : F] * 


Hence the image of <p is precisely H, completing the proof. 


Corollary 22. Let K\ and K 2 be Galois extensions of a field F with K\ fl K 2 = F. 
Then 

Gal(KiK 2 /F ) = Gal(ATi/F) x Gal(A: 2 /F). 

Conversely, if AT is Galois over F and G = Gal(Af /F) = G\ x G 2 is the direct product 
of two subgroups G\ and G 2 , then AT is the composite of two Galois extensions K\ and 
K 2 of F with KiH K 2 — F. 


Proof: The first part follows immediately from the proposition. For the second, 
let ATi be the fixed field of G\ C G and let K 2 be the fixed field of G 2 C G. Then 
K\ H K 2 is the field corresponding to the subgroup G\G 2 , which is all of G in this 
case, so K\ D K 2 = F. The composite K\ K 2 is the field corresponding to the subgroup 
G\ fl G 2 , which is the identity here, so ATi AT 2 = K, completing the proof. 
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Corollary 23. Let E/F be any finite separable extension. Then E is contained in an 
extension K which is Galois over F and is minima l in the sense that in a fixed algebraic 
closure of K any other Galois extension of F containing E contains K. 

Proof: There exists a Galois extension of F containing E, for example the com- 
posite of the splitting fields of the minimal polynomials for a basis for E over F (which 
are all separable since E is separable over F). Then the intersection of all the Galois 
extensions of F containing £ is the field K. 

Definition. The Galois extension K of F containing E in the previous corollary is 
called the Galois closure of E over F. 

It is often simpler to work in a Galois extension (for example in computing degrees 
as in Corollary 20). The existence of a Galois closure for a separable extension is 
frequently useful for reducing computations to consideration of Galois extensions. 

Recall that an extension K of F is called simple if K = F (6) for some element 6 , 
in which case 6 is called a primitive element for K . 

Proposition 24. Let K/F be a finite extension. Then K = F (6) if and only if there 
exist only finitely many subfields of K containing F. 

Proof: Suppose first that K = F(6) is simple. Let E be a subfield of K containing 
F: F c E c K. Let / (x) € £[x] be the minimal polynomial for 6 over F and let 
g(x) € £[x] be the minimal polynomial for 0 over E. Theng(x) divides fix) in £[x]. 
Let E' be the field generated over F by the coefficients of g(x). Then E' c E and 
clearly the minimal polynomial for 6 over £' is still g(x). But then 

[K:E] = deg g(x) = [K : £'] 

implies that £ = £\ It follows that the subfields of K containing F are the subfields 
generated by the coefficients of the monk: factors of fix), hence there are finitely many 
such subfields. 

Suppose conversely that there are finitely many subfields of K containing F. If F 
is a finite field, then we have already seen that AT is a simple extension (Proposition 17). 
Hence we may suppose F is infinite. It clearly suffices to show that £( or, /3) is generated 
by a single element since K is finitely generated over F. Consider the subfields 

£(o + c/3), c € F. 

Then since there are infinitely many choices for c e F and only finitely many such 
subfields, there exist c, d in £, c ^ d , with 

Fia + c/3) = Fia + c'/3). 

Then or + c/3 and a + d (5 both he in Fia + c/3), and taking their difference shows that 
(c — d)fi € Fia + c/3) Hence /3 e Fia + c/3) and then also a € £(a + c/3). Therefore 
Fict, /3) C Fia + c/3) and since the reverse inclusion is obvious, we have 

Fia, /3) = Fia + c/3), 

completing the proof. 
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Theorem 25. (The Primitive Element Theorem ) If K/F is finite and separable, then 
K/F is simple. In particular, any finite extension of fields of characteristic 0 is simple 

Proof: Let L be the Galois closure of K over F. Then any subfield of K containing 
F corresponds to a subgroup of the Galois group Gal(L/F) by the Fundamental Theo- 
rem. Since there are only finitely many such subgroups, the previous proposition shows 
that K/F is simple. The last statement follows since any finite extension of fields in 
characteristic 0 is separable. 

As the proof of the proposition indicates, a primitive element for an extension can 
be obtained as a sim ple linear combination of the generators for the extension. In the 
case of Galois extensions it is only necessary to determine a linear combination which 
is not fixed by any nontrivial element of the Galois group since then by the Fundamental 
Theorem this linear combination could not lie in any proper subfield. 

Examples 

(1) The element a/2 + a/ 3 generates the field Q(a/2 , a/ 3) as we have already seen (it is 
not fixed by any of the four Galois automorphisms of this field). 

(2) The field W p (x, y) of rational functions in the variables x and y over the algebraic 
closure F p of F p is not a simple extension of the subfield F =F p (x p , y p ). It is easy 
to see that 

Wp(x, y) : T p {x p t y p )) = p 2 

and that the subfields 

F ( x 4 - cy), ceW p 

are all of degree p over F p (x p , y p ) (note that (x + cy) p = x p + c p y p e F p (x p , y p )). 
If any two of these subfields were equal, then just as in the proof of Proposition 24 we 
would have 

y ) = F(x + cy) 

which is impossible by degree considerations. Hence there are infinitely many such 
subfields and the extension cannot be simple. 


EXERCISES 

1. Determine the Galois closure of the field Q(y / 1 + a/ 2 ) over (Q). 

2. Find a primitive generator for Q(a/2 , a/3 , a/ 5 ) over Q. 

3. Let F be a field contained in the ring of n x n matrices over Q. Prove that [F : Q] < n. 
(Note that, by Exercise 19 of Section 13.2, the ring of n x n matrices over Q does contain 
fields of degree n over Q.) 

4. Let /(*) e F[: t] be an irreducible polynomial of degree n over the field F, let L be 
the splitting field of f(x) over F and let a be a root of /( x) in L. If K is any Galois 
extension of F, show that the polynomial f (x) splits into a product of m irreducible 
polynomials each of degree d over K, where d = [K(a) : F] = [(L n AT) (a) : L O K] 
and m = n/d = [F(of) n K : F]. [Show first that the factorization of f(x) over K is the 
same as its factorization over Lf)K. Then if H is the subgroup of the Galois group of L 
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over F corresponding to L H K the factors of /(x) over LC\K correspond to the orbits of 
H on the roots of / (x). Use Exercise 9 of Section 4.1.] 

5. Let p be a prime and let F be a field. Let K be a Galois extension of F whose Galois 
group is a p- group (i.e., the degree [If : F] is a power of p). Such an extension is called 
a p-extension (note that p-extensions are Galois by definition). 

(a) Let L be a p-extension of K . Prove that the Galois closure of L over F is a p-extension 
of F. 

(b) Give an example to show that (a) need not hold if [ K : F] is a power of p but K / F 
is not Galois. 

6. Prove thatF p (x, y) /F p (x p , y p ) is not a simple extension by explicitly exhibiting an infinite 
number of intermediate subfields. 

7. Let F C K C L and let 9 e L with p(x) = (x). Prove that K ®p F(9) = 

K[x]/(p(x)) as If -algebras. 

8. Let K i and K*i be two algebraic extensions of a field F contained in the field L of charac- 
teristic zero. Prove that the F-algebra K\ <8f Ki has no nonzero nilpotent elements. [Use 
the preceding exercise.] 


14.5 CYCLOTOMIC EXTENSIONS AND ABEUAN EXTENSIONS 
OVER Q 

We have already determined that the cyclotomic field Q(f n ) of n* roots of unity is 
a Galois extension of Q of degree <p(n) where <p denotes the Buler ^p-function. Any 
automorphism of this field is uniquely determined by its action on the primitive n* root 
of unity This element must be mapped to another primitive root of unity (recall 

these are the roots of the irreducible cyclotomic polynomial <P„ (x)). Hence <T(f„) = f “ 
for some integer a y 1 < a < n> relatively prime to n. Since there are precisely <p(ri) 
such integers a it follows that in fact each of these maps is indeed an automorphism of 
Note also that we can define a Q for any integer a relatively prime to n by the 
same formula and that o a depends only on the residue class of a modulo n. 

Theorem 26. The Galois group of the cyclotomic field Q(£„) of / 1 th roots of unity is 
isomorphic to the multiplicative group (Z/nZ) x . The isomorphism is given explicitly 
by the map 

(Z/«Z ) x Gal(Q(f„)/Q) 

a (mod n ) i — > o a 

where o a is the automorphism defined by 

oifo) = tf- 


Proof: The discussion above shows that o 0 is an automorphism for any a (mod /i), 
so the map above is well defined. It is a homomorphism since 


(wbHM = <7.(0 = 
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which shows that a a Ob = The map is bijective by the discussion above since 
we know that every Galois automorphism is of the form o a for a uniquely defined a 
(mod n). Hence the map is an isomorphism. 


Examples 

(1) The field Q(£s) is Galois over Q with Galois group (Z/5Z)* = Z/4 Z. This is our 
first example of a Galois extension of Q of degree 4 with a cyclic Galois group. 
The elements of the Galois group are [ay = 1 , 02< ^ 3 . 04 } m the notation above. A 
generator for this cyclic group is cri : ft k> (since 2 has order 4 in (Z/5Z) K ). 

There is precisely one nontrivial subfield, a quadratic extension of Q, the fixed 
field of the subgroup f 1 , 04 = o_i }. An element in this subfield is given by 

“ = ft + <7-1 ft = ft + C 5 “' 

since this element is clearly fixed by o -\ . The element £5 satisfies 

ft 4 + ft 3 + <5 + ft + 1 = 0 - 

Notice then that 

a 2 + a — l = (f 2 + 2 + 2 ) + (fg + £5 1 ) — 1 

= f 2 +2 + £ 5 3 + &+ft 4 -I =0. 

Solving explicitly for a we see that the quadratic extension of Q generated by a is 
QG/5): 

<Q(ft + =Q(V5). 

It can be shown in general (this is not completely trivial) that for p an odd prime 
the field Q($ p ) contains the quadratic field Q (y/3tp), where the + sign is correct if 
p = 1 mod 4 and the — sign is correct if p = 3 mod 4 (cf. Exercise 1 1 in Section 7). 

( 2 ) <Q>(ft 3 )JPor pan odd prime we can construct a primitive element for any of the subfields 
of Q(ft>) as in the previous example. A basis for Q(ft>) over Q is given by 


Since 

f/ _1 + sr 2 + ■ • ■ + i P + 1 = 0 


we see that also the elements 





form a basis. The reason for choosing this basis is that any a in the Galois group 
Gal(Q(£ p )/Q) simply permutes these basis elements since these are precisely the 
primitive p* roots of unity. Note that it is at this point that we need p to be a prime — 
in general the primitive n 1 * 1 roots of unity do not give a basis for the cyclotomic field 
of /i 01 roots of unity over Q (for example, the primitive 4 th roots of unity, ± 2 , are not 
linearly independent). 

Let H be any subgroup of the Galois group of <Q(ft) over Q and let 


oeH 


(14.10) 


the sum of the conjugates of by the elements in H. For any re W, the elements to- 
run over the elements of H as a runs over the elements of H . It follows that ra = a, so 
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that lies in the fixed field for H. If now z is not an element of H, then za is the sum 
of basis elements (recall that any automorphism permutes the basis elements here), 
one of which is r(£ p ). If we had za = a then since these elements are a basis, we 
must have r(£ p ) = ct(£ p ) for one of the terms cr£ p in (10). But this implies zo~ l = 1 
since this automorphism is the identity on J p . Then z = a e H,& contradiction. This 
shows that a is not fixed by any automorphism not contained in H, so that Q(a) is 
precisely the fixed field of H . 

For a specific example, consider the subfields of Q(f 13 ), which correspond to the 
subgroups of (Z/13Z)* = Z/12Z. A generator for this cyclic group is the automor- 
phism 0—02 which maps £13 to f 2 3 . The nontrivial subgroups correspond to the 
nontrivial divisors of 12, hence are of orders 2, 3, 4, and 6 with generators o 6 , a 4 , a 3 
and ct 2 , respectively. The corresponding fixed fields will be of degrees 6 , 4, 3 and 2 
over <Q>, respectively. Generators are given by (£ = £ 13 ) 

? +cr 6 ? = ; -k 26 = £ + £“* 

; +a 4 S + cr 8 f = ? + K * + C 2 * = ? + C 3 + C 9 
c + + O 9 ; = £ + C 8 + ; 12 -1- c 5 

{ +a 2 f + a 4 f +o 6 S +a 8 C + cr l0 f = £ +£ 4 + £ 3 + £ U + £ 9 -l-£ 10 . 

The lattice of subfields for this extension is the following: 



The elements constructed in equation (10) and their conjugates are called the periods 
of ? and are useful in the study of the arithmetic of die cyclotomic fields. The study 
of their combinatorial properties is referred to as cyclotomy. 

Suppose that n = p° ] • ■ ■ p k * is the decomposition of n into distinct prime powers. 

°2 ... °k 

Since 2 * is a primitive p a x ' -th root of unity, the field K\ = Q(£ p *i ) is a sub field 

of Q(£ n ). Similarly, each of the fields K/ — Q(£ p °i)> i = l, 2, . . . , k is a subfield of 
Q(£„). The composite of the fields contains the product £ p «i £^2 ■ • ■ £ p «*, which is a 

primitive w* root of unity, hence the composite field is Q(£„). Since the extension 
degrees [K- t : Q) equal <p(p?), i = 1, 2, . . . , k and <p(n) = <p(p°')<p(P 2 ) • ■ • <p(p° k k )> 
the degree of the composite of the fields K, is precisely the product of the degrees of 
the Ki. It follows from Proposition 21 (and a simple induction from the two fields 
considered in the proposition to the fc fields here) that the intersection of all these fields 
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is precisely (Q). Then Corollary 22 shows that the Galois group for Q(f„) is the direct 
product of the Galois groups over Q for the subfields AT,-. We summarize this as the 
following corollary. 

Corollary 27. Let n = p^ * * ■ be the decomposition of the positive integer n 
into distinct prime powers. Then the cyclotomic fields (Q>(f ?; ), i = 1,2, ... ,k intersect 
only in the field Q and their composite is the cyclotomic field Q(£„). We have 

Gal(Q(f„)/Q) = Gal(Q(^)/Q) x Gal(Q(^)/Q) x ■ • ■ x Gal(Q(^)/Q) 

which under the isomorphism in Theorem 26 is the Chinese Remainder Theorem: 

(Z/nZ) x = (Z /p° l Zj* x (Z/p?Z) x x ■ * * x (Z/pfZ)*. 


Proof: The only statement which has not been proved is the identification of the 
isomorphism of Galois groups with the statement of the Chinese Remainder Theorem 
on the group (Z/nZ) x , which is quite simple and is left for the exercises. 

By Theorem 26 the Galois group of Q(% n )/Q is in particular an abelian group. 

Definition. The extension K/F is called an abelian extension if K/F is Galois and 
Gal (AT /F) is an abelian group. 

Since all the subgroups and quotient groups of abelian groups are abelian, we see 
by the Fundamental Theorem of Galois Theory that every subfield containing A 7 of an 
abelian extension of F is again an abelian extension of F. By the results on composites 
of extensions in the last section, we also see that the composite of abelian extensions is 
again an abelian extension (since the Galois group of the composite is isomorphic to a 
subgroup of the direct product of the Galois groups, hence is abelian). 

It is an open problem to determine which groups arise as the Galois groups of 
Galois extensions of (Q>. Using the results above we can see that every abelian group 
appears as the Galois group of some extension of Q, in fact as the Galois group of some 
subfield of a cyclotomic field. 

Let n = pip 2 ■ ■ - pt be the product of distinct primes. Then by the Chinese 
Remainder Theorem 

(Z/nZ) x = (Z/piZ)* x (Z/p 2 Z) x x • 1 • x (Z/p k Z) x 

= Z p ,-i x Zp,_i x - ■ • x Z Pi _,. (14.11) 

Now, suppose G is any finite abelian group. By the Fundamental Theorem for 
Abe lian Groups, 

G = Z W] x Z„ 2 x ■ ■ ■ x Z nk 

for some integers n\ , « 2 , . . . , n*. We take as known that given any integer m there are 
infinitely many primes p with p = 1 modw (see the exercises following Section 13.6 


Sec. 14.5 Cyclotomic Extensions and Abelian Extensions over Q 


599 



for one proof using cyclotomic polynomials). Given this result, choose distinct primes 
P\,P 2 ,---,Pk such that 

Pi = 1 mod/ii 
P 2 = 1 mod n 2 


Pk = 1 modn* 

and let n = p\ p 2 ■ ■ ■ pk as above. 

By construction, n ■, divides pt — 1 for / = 1, 2, . . . , fc, so the group Z P( _i has a 

subgroup H, of order — for / = 1, 2, . . . , k, and the quotient by this subgroup is 

cyclic of order/?; . Hence the quotient of (Z / rcZ) x in equation (1 1 ) by H\ x H 2 x ■ ■ - x Hk 
is isomorphic to the group G. 

By Theorem 26 and the Fundamental Theorem of Galois Theory, we see that there is 
a subfield of <Q>(fp, pi ... pk ) which is Galois over <Q> with G as Galois group. We summarize 
this in the following corollary. 


Corollary 28. Let G be any finite abelian group. Then there is a subfield K of a 
cyclotomic field with Gal(tf /Q) = G. 


There is a converse to this result (whose proof is beyond our scope), the celebrated 
Kronecker- Weber Theorem: 


Theorem (Kronecker-Weber) Let K be a finite abelian extension of Q. Then K is 
contained in a cyclotomic extension of Q. 

The abelian extensions of <Q> are the “easiest’' Galois extensions (at least in so far 
as the structure of their Galois groups is concerned) and the previous result shows they 
can be classified by the cyclotomic extensions of Q. For other finite extensions of Q 
as base field, it is more difficult to describe the abelian extensions. The study of the 
abelian extensions of an arbitrary finite extension F of 0 is referred to as class field 
theory. There is a classification of the abelian extensions of F by invariants associated 
to F which greatly generalizes the results on cyclotomic fields over Q. In general, 
however, the construction of abelian extensions is not nearly as explicit as in the case of 
the cyclotomic fields. One case where such a description is possible is for the abelian 
extensions of an imaginary quadratic field (Q(V— D ) for D positive), where the abelian 
extensions can be constructed by adjoining values of certain elliptic functions (this is 
the analogue of adjoining the roots of unity, which are the values of the exponential 
function e* for certain x). The study of the arithmetic of such abelian extensions and 
the search for similar results for non-abelian extensions are rich and fascinating areas 
of current mathematical research. 

We end our discussion of the cyclotomic fields with the problem of the constructibil- 
ity of the regular n-gon by straightedge and compass. 
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Recall (cf. Section 13.3) that an element a is constructible over Q if and only if the 
field Q(a) is contained in a field K obtained by a series of quadratic extensions: 


Q= K 0 C K\ C-- C Ki C K i+ i C ■ ■ ■ C K m — K (14.12) 

with 

[K i+l :Ki] = 2, i=0, 1 m- 1. 

The construction of the regular n-gon in R 2 is evidently equivalent to the construc- 
tion of the /I th roots of unity, since the roots of unity form the vertices of a regular 
n-gon on the unit circle in C with one vertex at the point 1 . 

The construction of is equivalent to the constructibility of the first coordinate x 
in R 2 of namely the real part of £„ . Since the complex conjugate of is just f 

the real part of is x = -(£„ + £“'). Note that satisfies the quadratic equation 

f 2 - 2x£„ + 1—0 over Q(jt) . Since Q(x) consists only of real numbers, it follows 
that [Q(£ n ) : Q(x)] = 2, so that (Q>(jt) is an extension of degree (pin)/ 2 of Q. 

It follows that if the regular n-gon can be constructed by straightedge and compass 
then (pin ) must be a power of 2. Conversely, if (p{n) = 2 m is a power of 2, then the 
Galois group Gal(Q(f„) /Q) is an abelian group whose order is a power of 2, so the same 
is true for the Galois group Gal(Q(r) /Q). It is easy to see by the Fundamental Theorem 
for Abelian Groups that an abelian group G of order 2 m has a chain of subgroups 

G = G m > G nl - 1 > ■ ■ ■ > G/ + 1 > Gi > - ■ - > Go = 1 


with 


[G;+i : Gi] = 2, i = 0, 1, 2, . . . , m — 1. 


Applying this to the group G = Gal((Q(x)/Q) and taking the fixed fields for the sub- 
groups Gy, i = 0, 1 , .... m — 1 , we obtain (by the Fundamental Theorem of Galois 
Theory) a sequence of quadratic extensions as in ( 1 2) above. 

We conclude that the regular n-gon can be constructed by straightedge and compass 
if and only if (p{n) is a power of 2. Decomposing n into prime powers to compute (pin) 
we see that this means n = 1 k p\ ■ ■ ■ p r is the product of a power of 2 and distinct odd 
primes /?, where p, — 1 is a power of 2. It is an elementary exercise to see that a prime 
p with p — la power of 2 must be of the form 

p = 2 y + 1 

for some integer s. Such primes are called Fermat primes. The first few are 

3 = 2 1 + 1 
5 = 2 2 + 1 
17 = 2 4 + 1 
257 = 2 8 + 1 
65537 = 2 16 + 1 


(but 2 32 + 1 is not a prime, being divisible by 641). It is not known if there are infinitely 
many Fermat primes. We summarize this in the following proposition. 
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Proposition 29. The regular n-gon can be constructed by straightedge and compass if 
and only if n = 2 k pi ■ - ■ p r is the product of a power of 2 and distinct Fermat primes. 

The proof above actually indicates a procedure for constructing the regular n-gon 
as a succession of square roots. For example, the construction of the regular 17-gon 
(solved by Gauss in 1796 at age 19) requires the construction of the subfields of degrees 
2, 4, 8 and 16 in Q(^n). These subfields can be constructed by forming the periods of 
£17 as in the example of the 13 th roots of unity above. In this case, the fact that QfCn) 
is obtained by a series of quadratic extensions reflects itself in the fact that the periods 
can be ‘halved” successively (i.e., if H i < H 2 are subgroups with [H 2 : H{\ = 2 then 
the periods for H\ satisfy a quadratic equation whose coefficients involve the periods 
for Hi). For example, the periods for the subgroup of index 2 (generated by o 2 ) in the 
Galois group are (£ = £17) 

m = f + ^2 + ^4 + ^8 + ^9 + ^13 + f 15 + ^16 

m = £ 3 + £ 5 + £ 6 + £ 7 + f 10 + £ 1 1 + £ 12 + £ 14 
which ‘halve” the period for the full Galois group and which satisfy 

m + m = -1 

(from the minim al polynomial satisfied by fn) and 


mm = -4 


(which requires computation — we know that it must be rational by Galois Theory, 
since this product is fixed by all the elements of the Galois group). Hence these two 
periods are the roots of the quadratic equation 

x 2 + x — 4 = 0 

which we can solve explicitly. In a similar way, the periods for the subgroup of index 4 
(generated by <j 4 ) naturally halve these periods, so are quadratic over these, etc. In this 
way one can determine £17 explicitly in terms of iterated square roots. For example, one 

2n 

finds that 8(£ + £ ) = 16cos(— ) (which is enough to construct the regular 17-gon) 
is given explicitly by 

-1+717 + ^2(17 - VV7) + 2 yJ 17 + 3x/l7 - ^2(17 - Vvj ) - 2^2(17 +>/rj). 

A relatively simple construction of the regular 17-gon (shown to us by J.H. Conway) 
is indicated in the exercises. 

While we have seen that it is not possible to solve for using only successive 
square roots in general, by definition it is possible to obtain by successive extraction 
of higher roots (namely, taking an /1 th root of 1). This is not the case for solutions 
of general equations of degree n, where one cannot generally determine solutions by 
radicals, as we shall see in the next sections. 
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EXERCISES 


1. Determine the minim a] polynomials satisfied by the primitive generators given in the text 
for the subfields of <Q>(Ci3)- 

2. Determine the subfields of QfCs) generated by the periods of fg and in particular show 
that not every subfield has such a period as primitive element. 

3. Determine the quadratic equation satisfied by the period a = ft + ( 5 -1 of the 5 th root 
of unity £5 . Determine the quadratic equation satisfied by (5 over Q(or) and use this to 
explicitly solve for the 5 th root of unity. 

4. Let a a e Gal(Q(f„)/Q) denote the automorphism of the cyclotomic field of roots of 
unity which maps to where a is relatively prime to n and is a primitive /2 th root of 
unity. Show that o a (£) = K a f° r every n 1 * 1 root of unity. 

5. Let p be a prime and let €1,62, . e p _i denote the primitive roots of unity. Set 

p„ = e" + £2 H + the sum of the /I th powers of the e, . Prove that p„ = — 1 if p 

does not divide n and that p n = p — 1 if p does divide n. [One approach; p\ = — 1 from 

show that p n is a Galois conjugate of p\ for p not dividing n, hence is also —1.] 

6. Let £„ denote a primitive /1 th root of unity and let K = Q(f n ) be the associated cyclotomic 
field. Let a denote the trace of from K to Q (cf. Exercise 18 of Section 2). Prove that 
n = lifn = l,a = Oifnis divisible by the square of a prime, and a = (- l) r if n is the 
product of r distinct primes. 

7. Show that complex conjugation restricts to the automorphism a_ 1 e Gal(Q(f„)/Q) of 

the cyclotomic field of n* roots of unity. Show that the field K* = + f" 1 ) is the 

subfield of real elements in K = Q(&,), called the maximal real subfield of K. 

8. Let K n = ) be the cyclotomic field of 2" +z -th roots of unity, n > 0. Set a„ — 

and Kf = Q(a„), the maximal real subfield of K n . 

(a) Show that for all n > 0, [K n : Q] = 2 n+l , [K„ : Kf\ = 2, [K+ : Q] = 2", and 

IC. : O = 2 

(b) Determine the quadratic equation satisfied by fe +2 over Kf in terms of a n . 

(c) Show that for n > 0, a 2 +| = 2 + a„ and hence show that 


cc„ = 


}j 2. + }j 2 + yf—-- 


■f -v/2 ( n times). 


giving an explicit formula for the (constructible) 2” +2 -th roots of unity. 

9. Notation as in the previous exercise. 

(a) Prove that Kf is a cyclic extension of Q of degree 2". [Use an explicit isomorphism 
(Z/2 rt+z Z) x = Z/2Z x Z/2 rt Z as abelian groups (Le., (Z/2" +z Z) x is isomorphic to 
a cyclic group of order 2 and a cyclic group of order 2” — cf. Exercises 22 and 23 of 
Section 2.3J 

(b) Prove that K„ is a biquadratic extension of and that two of the three intermediate 

subfields are Kf and K„- 1 . Prove that the remaining field intermediate between K+_ j 
and K n is a cyclic extension of of degree 2 n . 

10. Prove that <Q( 1/2 ) is not a subfield of any cyclotomic field over Q. 

11. Prove that the primitive n* roots of unity form a basis over <Q> for the cyclotomic field of 

roots of unity if and only if n is squarefree (i.e., n is not divisible by the square of any 
prime). 
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12. Let o p denote the Frobenius automorphism x t-> x p of the finite field W q of q = jf 
elements. Viewing as a vector space V of dimension n over F p we can consider o p as a 
linear transformation of V to V . Determine the characteristic polynomial of a p and prove 
that the linear transformation a p is diagonalizable over F p if and only if n divides p — 1 , 
and is diagonalizable over the algebraic closure of F p if and only if («, p) = 1. 

13. Let n = p°' p ^ 2 ... be the prime factorization of n and let £* be a primitive n* root of 

unity. For each i = 1, 2, .... k define d- t by n = pf'di and let to, = so that £ is 

Pi "i 

a particular primitive p°‘ -th root of nnity. Let o u e Gal(Q(f„)/<Q») be the automorphism 
mapping £„ to for a relatively prime to n. 

(a) Prove that for i = 1,2 k, o a maps £ «,■ to £ a fl( and gives an automorphism of 

Pi p { 

Q(£ p «; )/Q) which depends only on a (mod pf), which we may denote o a (mod p °iy 

(b) Prove that the map o a (o a (mod p * 1 5 , . , . , <r a (mod p o k) ) is the isomorphism of Corol- 
lary 27 corresponding to the Chinese Remainder Theorem for (Z/nZ) x . 

The following Exercises 14 to 18 determine the periods associated to a primitive 17 th root of 
unity and provide a proof for the simple geometric construction indicated in Exercise 17 for 

the regular 17-gon. Let £ = £ 17 = cos — + / sin — be a fixed primitive 17 th root of unity 
in C. 

14. Define the periods of £ as follows: 


m = £+£ 2 +£ 4 +£ 8 -|.£ 9 ^-£ 1 3 ^-£ l 5 ^.£ ,6 

m = £ 3 + f 5 + f 6 +£ 7 +£ ,0 +£ ll +£ 12 +£ 14 
^ =£+£ 4 +£ 13 +£ 16 
*72 = s 2 +;*+; 9 +; 15 


q' 3 = f 6 +f 7 +f 10 +? n 

>4 = c 3 +c 5 +f 12 +f 14 
^2 = ? 4 +? 13 - 


(a) Show that all of these periods are real numbers and that 17" = 2 cos — . Show that 
as real numbers these periods are approximately 

rn ~ 1.562 q\ ~ 2.049 q' 3 ~ -2.906 rf[ ~ 1.865 

772 ~ —2.562 7 ) 2 ' — 0.488 ~ 0.344 7?2~ 0.185. 


(b) Prove that p\ and 772 are roots of the equation x 2 + x — 4 = 0. 

(c) Prove that tj'j and are roots of the equation x 2 — q\x — 1=0 and that rf 3 and q 4 
are roots of the equation x 2 — q^x —1=0. 

(d) Prove that q’[ and are roots of the equation x 2 — q[x + q 4 = 0. 

2 

-x - 1 = 0. 
a 

16. Let C be the circle in R 2 having the points ( h , k) and (0, 1) as a diameter. Prove that this 
circle intersects the je-axis if and only if h 2 — 4k > 0 and in this case the two intercepts 
are the roots of the equation x 2 — hx + k = 0. 

17. (Construction of the Regular 1 7-gon) Draw a circle of radius 2 centered at the origin (0, 0) . 
(a) Join the point (4, 0) to the point (0, 1) and construct the line l\ bisecting the angle 


15. Prove that if tan 26 — a (0 < 29 < then tan 6 satisfies the equation x 2 
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between this line and the y-axis. Construct the line £2 perpendicular to £1 in Figure 2. 

y 



(b) Using the intersection of £\ and the x-axis as center and radius equal to the distance 
to (0, 1), construct the circle C\ and let A = (s, 0) be the right-hand point of inter- 
section of Ci with the x-axis. Similarly, let B = (t , 0) denote the right-hand point of 
intersection of the x-axis and the circle C 2 whose center is the intersection of £2 and 
the x-axis and whose radius is equal to the distance to (0, 1) as in Figure 3. 


Fig. 3 

(c) Construct a perpendicular to the x-axis at the point A and mark off the distance t from 
(0, 0) to B to construct the point (s,t). Construct the circle with (s, t) and (0, 1) as a 
diameter and let P denote the right-hand point of intersection of this circle with the 
x-axis. The perpendicular to the x-axis at P intersects the circle of radius 2 at the 
second vertex of a regular 17-gon whose first vertex is at (2,0), hence constructs the 
regular 17-gon by straightedge and compass as in Figure 4. 
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18. Notation as in the previous exercises. 

(a) Prove that l\ intersects the x-axis in the point ( 771 /2, 0) and that 1 2 intersects the 
x-axis in the point ( 772 / 2 , 0). 

(b) Prove that C\ is the circle having the points ( 771 , —1) and (0, 1) as diameter. Prove 
that s = r}[. Similarly prove that C 2 is the circle having the points (? 72 i — 1) and (0,1) 
as diameter and that t = 77 ^. 

(c) Prove that P has coordinates ( 77 ", 0) and hence that the construction in the previous 
problem constructs the regular 17-gon by straightedge and compass. 


14.6 GALOIS GROUPS OF POLYNOMIALS 

Recall that the Galois group of a separable polynomial fix) € F[x] is defined to be 
the Galois group of the splitting field of fix) over F. 

If AT is a Galois extension of F then K is the splitting field for some separable poly- 
nomial f(x) over F. Any automorphism a € Gal(AVF) maps a root of an irreducible 
factor of fix) to another root of the irreducible factor and a is uniquely determined by 
its action on these roots (since they generate K over F). If we fix a labelling of the roots 
ai,..., a„ of fix) we see that any er e Gal(AT/F) defines a unique permutation of 
aq , ... ,a„, hence defines a unique permutation of the subscripts { 1 , 2 , . . . , n} (which 
depends on the fixed labelling of the roots). This gives an injection 

GaI(A:/F) S H 

of the Galois group into the symmetric group on n letters which is clearly a homo- 
morphism (both group operations are composition). We may therefore think of Galois 
groups as subgroups of symmetric groups. Since the degree of the splitting field is the 
same as the order of the Galois group by the Fundamental Theorem, this explains from 
the group- theoretic side why the splitting field for a polynomial of degree n over F is 
of degree at most n ! over F (Proposition 13.26). 

In general, if the factorization of fix) into irreducibles is fix) = fiix) ■ ■ - fk(x) 
where fix) has degree n,, i = 1, 2, . . . , k, then since the Galois group permutes the 
roots of the irreducible factors among themselves we have Gal(AT/F) < S ni x ■ ■ - x S„ k . 

If fix) is irreducible, then given any two roots of fix) there is an automorphism 
in the Galois group G of fix) which maps the first root to the second (this follows from 
our extension Theorem 13.27). Such a group is said to be transitive on the roots, i.e., 
you can get from any given root to any other root by applying some element of G. The 
fact that the Galois group must be transitive on blocks of roots (namely, the roots of the 
irreducible factors) can often be helpful in reducing the number of possibilities for the 
structure of G (cf. the discussion of Galois groups of polynomials of degree 4 below). 

Examples 

(I) Consider the biquadratic extension Q(V 2 , \/3 ) over Q, which is the splitting field of 
(x 2 — 2)(x 2 — 3). Label the roots as ai — V2 , a .2 — — V2 , «3 — V3 and 014 — — \/3. 
The elements of the Galois group are {1, a, t,az} where a maps %/2 to ~V2 and 
fixes y/3 and z fixes V2 and maps V3 to — V3. As permutations of the roots for this 
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labelling we see that a interchanges the first two and fixes the second two and x fixes 
the first two and interchanges the second two, i.e., 

a — (12) and x = (34) 

as elements of 54. Similarly, or by taking the product of these two elements, we see 
that 

ax = (12) (34) € S4. 

Hence 

Gal(Q(V2, V3)/Q) = {1, (12), (34), (12) (34)} C S 4 

identifying this Galois group with the Klein-4 subgroup of £4. Note that if we had 
changed the labelling of the roots above we would have obtained a different (isomor- 
phic) representation of the Galois group as a subgroup of £4 (for example, interchang- 
ing the second and third roots would have given the subgroup {1, (13), (24), (13)(24)}). 
(2) The Galois group of x 3 — 2 acts as permutations on the three roots \f2, p Ifl and 
p 2 Ifi where p is a primitive 3 rd root of unity. With this ordering, the generators a 
and t we have defined earlier give the permutations 

a = (123) r = (23) 

which gives 

{1, a, <r 2 . r. xa, ror 2 ] = {1, (123), (132), (23), (13), (12)} = S3, 
in this case the full symmetric group on 3 letters. 

Recall that every finite group is isomorphic to a subgroup of some symmetric group 
S n . It is an open problem to determine whether every finite group appears as the Galois 
group for some polynomial over Q. We have seen in the last section that every abelian 
group is a Galois group over 0 (for some subfield of a cyclotomic field). We shall 
explicitly determine the Galois groups for polynomials of small degree (< 4) below 
which will in particular show that every subgroup of S 4 arises as a Galois group. 

We first introduce some definitions and show that the “general” polynomial of 
degree n has S n as Galois group (so the second example above should be viewed as 
“typical”). 

Definition. Letx t ,X 2 , . . . , x„ be indeterminates. The elementary symmetric functions 
si , S 2 , . . . , s n are defined by 

•Si = X\ + X 2 + • ■ ■ + x„ 

S 2 — X\X 2 + X\X-$ + ■ ■ • + X2X3 + X 2 X 4 + • ■ • + 

S n — X{X 2 ■ ‘ ' X n 

i.e., the i* symmetric function si of jq , X 2 , ... ,x„i s the sum of all products of the x/s 
taken i at a time. 

Definition. The general polynomial of degree n is the polynomial 

(x -Xi)(x~X 2 ) ■ * ■ (x -x„) 
whose roots are the indeterminates x\,X 2 , . . . , x„. 
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It is easy to see by induction that the coefficients of the general polynomial of 
degree n are given by the elementary symmetric functions in the roots: 

(x — *i)(x — x 2 ) ■ * * (x — x n ) = x n - Six' 7-1 + six* -2 H 1- (— l) w s„. (14.13) 

For any field F, the extension F(x i, X 2 , . . . , x„) is then a Galois extension of the 
field F(s\, S 2 , ■ . ■ > .$„) since it is the splitting field of the general polynomial of degree 
n . 

If a e S n is any permutation of { 1 , 2, . . . , n] , then cr acts on the rational functions in 
F(x i , x 2 , . . . , x n ) by permuting the subscripts of the variables x\ , x 2 , . . . , x„. It is clear 
that this gives an automorphism of F(xi , X 2 , . . . , x„). Identifying o' e S„ with this au- 
tomorphism of F(xi, X 2 , . . . >x n ) identifies S„ as a subgroup of AutfF’fxi, X 2 x„)). 

The elementary symmetric functions , St, ■ ■ ■ , s n are fixed under any permutation 
of their subscripts (this is the reason they are called symmetric ), which shows that the 
subfield F(s i , si , . . . , $„) is contained in the fixed field of S n . By the Fundamental The- 
orem of Galois Theory, the fixed field of S„ has index precisely n ! in F(x i , X 2 , . . . , x„). 
Since F(x \ y X 2 , .... x„) is the splitting field over F(s\, s 2 , - . . , of the polynomial 
of degree n in (13), we have 

[F(x u x 7> ...,x n ) : F(si,s 2 , <nl . (14.14) 

It follows that we actually have equality and that F(si , s 2 , . . . , s„) is precisely the fixed 
field of S n - This proves the following result. 

Proposition 30. The fixed field of the symmetric group S„ acting on the field of 
rational functions in n variables F(x i, X 2 , . . . , x„) is the field of rational functions in 
the elementary symmetric functions F(su S 2 , . . . , s n ). 

Definition. A rational function / (x i , X 2 , - - - , x„) is called symmetric if it is not changed 
by any permutation of the variables x \ , X 2 , x„. 

Corollary 31. (Fundamental Theorem on Symmetric Functions ) Any symmetric func- 
tion in the variables x\, X 2 , . . . . x n is a rational function in the elementary symmetric 
functions s\, Si, . . . ,s„. 

Proof: A symmetric function lies in the fixed field of S n above, hence is a rational 
function in Si , . . . , s„. 

This corollary explains why these are called the elementary symmetric functions. 

Remark: If f(x i, . . . , x„) is a polynomial inxi, X 2 , . . . , x„ which is symmetric then it 
can be seen that / is actually a polynomial in si, S 2 , . . . , s n , which strengthens the state- 
ment of the corollary. It is in fact true that a symmetric polynomial whose coefficients 
lie in R, where R is any commutative ring with identity, is a polynomial in the elemen- 
tary symmetric functions with coefficients in R. A proof of this fact is implicit in the 
algorithm outlined in the exercises for writing a symmetric polynomial as a polynomial 
in the elementary symmetric functions. 
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Examples 

(1) The expression (xi — xi ) 2 is symmetric in x\ , xi- We have 

(xi - Xl) 2 = (JC 1 + X2) 2 - 4xiX2 = s 2 - 4S2, 

a polynomial in the elementary symmetric functions. 

(2) The polynomial x 2 + xf + x 2 is symmetric in xj.x2.x3, and in this case we have 

X? + *2 + X3 = (X1 + X 2 + X3) 2 - 2 (xiX 2 + X1X3 + X2X3) 

= s 2 - 2S2- 

(3) The polynomial x 2 x\ + x\x\ + x|x| is symmetric. Since 

(XJX2 + X1X3 + X2X3) 2 = x 2 x| + x 2 x 2 + x|x| + 2 (x 2 x 2 X 3 + x|xiX3 + X3XIX2) 

= X \*2 + x \ x 3 + * 2*3 + 2xiX 2 X3(xi + X2 + X3) 

we have 

x 2 xf + x 2 xf + xf*! = s\ — 2rir3. 


Suppose now we start with the general polynomial 

x" - Sl x n - 1 + s 2 x n ~ 2 + • • • + (~1)% 

over the field F(s\ , £2, . . . , £„) where we view the S; , i = 1 , 2, . . . , n as indeterminates. 
If we define the roots of this polynomial to be xi , X2, . . . , x n then the s; are precisely 
the elementary symmetric functions in the roots x^, ... ,x n . Moreover, these roots 
are indeterminates as well in the sense that there are no polynomial relations over F 
between them. For suppose p{t \, . . . , t n ) is a nonzero polynomial in n variables with 
coefficients in F such that p(x\, . . . , x n ) = 0. Then the product, p, over all a in 
S n of p(t c r(j), . . . , is a nonzero symmetric polynomial with p(x \ , . . . , x n ) = 0. 
This gives a nonzero polynomial relation over F among ... ,s n , a contradiction. 
Conversely, if the roots of a polynomial fix') are independent indeterminates over F, 
then so are the coefficients of fix) — cf. the beginning of Section 9. Thus defining the 
general polynomial over F as having indeterminate roots or indeterminate coefficients 
is equivalent. From this point of view our result can be stated in the following form. 


Theorem 32. The general polynomial 

x n - six' 1 " 1 + s 2 *"~ 2 + - ■ - + i-D n s n 
over the field F is \ , S2, . ■ - , s n ) is separable with Galois group S n . 


This result says that if there are no relations among the coefficients of a polynomial 
of degree n (which is what we mean when we say the s, are indeterminates above) then 
the Galois group of this polynomial over the field generated by its coefficients is the 
full symmetric group S n . Loosely speaking, this means that the “generic” polynomial 
of degree n will have S n as Galois group. Note, however, that over finite fields every 
polynomial has a cyclic Galois group (all extensions of finite fields are cyclic), so that 
“generic” polynomials in this sense do not exist. Over Q one can make precise the 
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notion of “generic’* polynomial and then it is true that most polynomials have the full 
symmetric group as Galois group. 

For n > 5 there is only one normal subgroup of S„, namely the subgroup A„ 
of index 2. Hence in general there is only one normal subfield of F(x \ , jc 2 , - - - > x n ) 
containing F(s \ , 52 , - . . , j„) and it is an extension of degree 2. 

Definition. Define the discriminant D of x \ , X 2 , . . . , x n by the formula 

D = Y\(x, - x,? 

><) 

Define the discriminant of a polynomial to be the discriminant of the roots of the 
polynomial. 

The discriminant D is a symmetric function in jti , . . . , x n , hence is an element of 
K = F(si,Si, s n ). 

When we first defined the alternating group A n we saw that a permutation a e S n 
is an element of the subgroup A n if and only if a fixes the product 

VD = ]”[(x ; - Xj) € Z|>i , x 2 , ■ . . , x„]. 

It follows (by the Fundamental Theorem) that if F has characteristic different from 2 
then \/~D generates the fixed field of A n and generates a quadratic extension of K. This 
proves the following proposition. 

Proposition 33. If ch(F) ^ 2 then the permutation a e S n is an element of A n if and 
only if it fixes the square root of die discriminant D. 

We now consider the Galois groups of separable polynomials of small degree (< 4) 
over a field F which we assume is of characteristic different from 2 and 3. Note that 
over <Q> or over a finite field (or, more generally, over any perfect field) the splitting field 
of an arbitrary polynomial f(x ) is the same as the splitting field for the product of the 
irreducible factors of f(x) taken precisely once, which is a separable polynomial. 

If the roots of the polynomial f(x) = x n + a„-\x n ~ { + ■ ■ ■ + a\x + ao are 
a\, qt 2 , . . . , a„, then the discriminant of f(x) is 2 

D = ]”[(«< - QTj) 2 . 
i<J 

Note that D = 0 if and only if fix) is not separable, i.e., if the roots a \ , .... a„ are 
not distinct. Recall that over a perfect field (e.g., Q or a finite field) this implies fix) 
is reducible since every irreducible polynomial over a perfect field is separable. 

The discriminant D is symmetric in the roots of / (jt), hence is fixed by all the au- 
tomorphisms of the Galois group of fix). By the Fundamental Theorem it follows that 

2 If f(x) = a n x n -1 1- ao is not monk then its discriminant is defined to be of? 2 times the D 

defined above. 
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D G F. The discriminant can in general be written as a polynomial in the coefficients 
of / (x) (by Corollary 31) which are fairly complicated for larger degrees (we shall give 
formulas for n <4 below). Finally, note that since 

= ]~"[(af - ctj) 
i<j 

we have the useful fact that ^ /D is always contained in the splitting field for / (x). 

If the roots of / (x) are distinct, fix some ordering of the roots and view the Galois 
group of /(x) as a subgroup of S n as above. 

Proposition 34. The Galois group of / (x) G F[x] is a subgroup of A n if and only if 
the disc rimin ant D G F is the square of an element of F. 

Proof: This is a restatement of Proposition 33 in this case. The Galois group is 
contained in A„ if and only if every element of the Galois group fixes 

V2> = ]”[(«; - Ctj) 
i<j 

i.e., if and only if yJ~D g F. 

This property, together with the fact that D — 0 determines the presence of multiple 
roots, is the reason D is called the discriminant . 

Polynomials of Degree 2 

Consider the polynomial x 2 + ax + b with roots a, /?. The discriminant D for this poly- 
nomial is (a — fi) 2 , which can be written as a polynomial in the elementary symmetric 
functions of the roots. We did this in Example 1 above: 

D = s 2 4s 2 = (-a) 2 - 4(b) = a 2 - 4b, 

the usual discriminant for this quadratic. 

The polynomial is separable if and only if a 2 — 4b ^ 0. The Galois group is a 
subgroup of S 2 , the cyclic group of order 2 and is trivial (i.c., A 2 in this case) if and 
only if a 2 — 4b is a rational square, which completely determines the possible Galois 
groups. 

Note that this restates results we obtained previously by explicitly solving for the 
roots: if the polynomial is reducible (namely D is a square in F), then the Galois group 
is trivial (the splitting field is just F), while if the polynomial is irreducible the Galois 
group is isomorphic to Z/2Z since the splitting field is the quadratic extension F(\/~D ). 

Polynomials of degree 3 

Suppose the cubic polynomial is 

/(x) = x 3 + ax 2 + bx + c. (14.15) 

If we make the substitution x = y — a/3 the polynomial becomes 

g(y) = y 3 + py + g (14.16) 
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where 


(14.17) 


p = i Ob - a 1 ) q = ^( 2 d 3 ~ 9ab + Tic). 

The splitting fields for these two polynomials are the same since their roots differ by the 
constant a/3 e F and since the formula for the discriminant involves the differences of 
roots, we see that these two polynomials also have the same discriminant. 

Let the roots of the polynomial in (16) be a, and y. We first compute the 
discriminant of this polynomial in terms of p and q. Note that 

g(y ) = (y - cc)(y - P)iy - y) 

so that if we differentiate we have 

D y g(y) = (y- a)(y - P) + (y - a)(y - y) + (y - fi)(y - y). 

Then 

D y g(cx) = (a -P)(a - y) 

D y g(0) = (0~ “)(£ - Y) 

D y g(y) -(Y- a)(y - P). 

Taking the product we see that 

£> = [(«- 0)(ot - y)(0 - y)f = -D,g(a)D,gtf)D,g(y). 

Since D y g(y) = 3y 2 + p, we have 

-D = (3a 1 + p)( 3P 2 + p)(3y 2 + p) 

= Tla 2 fi 2 y 2 + 9 p(ct 2 p 2 + a 2 y 2 + p 2 y 2 ) + 3p 1 (a 2 + f$ 2 + y 2 ) + p 3 . 

The corresponding expressions in the elementary symmetric functions of the roots were 
determined in Examples 2 and 3 above. Note that here s\ = 0, s% = p and S 3 = —q. 
We obtain 

-D = m-q) 2 + 9p(p 2 ) + 3p 2 (—2p) + p 3 

so that 

D — —4p 3 — 2lq 2 . (14.18) 

This is the same as the discriminant of fix) in (15). Expressing D in terms of a, b,c 
using (17) we obtain 

D = a 2 b 2 - 4 £> 3 - 4a 3 c - 27 c 2 + 18 abc (14.180 


(Galois Group of a Cubic) 

a. If the cubic polynomial fix) is reducible, then it splits either into three linear 
factors or into a linear factor and an irreducible quadratic. In the first case the Galois 
group is trivial and in the second case the Galois group is of order 2. 

b. If the cubic polynomial f(x ) is irreducible then a root of fix) generates an 
extension of degree 3 over F, so the degree of the splitting field over F is divisible by 
3. Since the Galois group is a subgroup of S 3 , there are only two possibilities, namely 
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As or S3. The Galois group is A3 (i.e., cyclic of order 3) if and only if the discriminant 
D in (18) is a square. 

Explicitly, if D is the square of an element of F, then the splitting field of the 
irreducible cubic fix ) is obtained by adjoining any single root of fix) to F. The 
resulting field is Galois over F of degree 3 with a cyclic group of order 3 as Galois 
group. If D is not the square of an element of F then the splitting field of fix) is of 
degree 6 over F, hence is the field F (0, >/D ) for any one of the roots 6 of fix). This 
extension is Galois over F with Galois group S3 (generators are given by a, which takes 
6 to one of the other roots of / (jc) and fixes VZ), and r, which takes *J~D to —*Jd and 
fixes 0). 

We see that in both cases the splitting field for the irreducible cubic / (x) is obtained 
by adjoining yfD and a root of fix) to F. 

We shall give explicit formulas for the roots of (16) ( Cardano's Formulas) in the 
next section after introducing the notion of a Lagrange Resolvent . 

Polynomials of Degree 4 

Let the quartic polynomial be 

fix) = x 4 + ax 3 + bx 2 + cx + d 
which under the substitution x — y — a / 4 becomes the quartic 

g(y) = / + Py 2 + qy + r 

with 

p - ^(-3o 2 + 8h) 

1 3 

q = -(cr — 4 ab + 8c) 

O 

r = —^—(—3a 4 + 1 6a 2 b — 64 ac + 256d). 

256 

Let the roots of giy) be ct\, (* 2 , <23, and <24 and let G denote the Galois group for the 
splitting field of g(y) (or of fix)). 

Suppose first that giy) is reducible. If g(y) splits into a linear and a cubic, then 
G is the Galois group of the cubic, which we determined above. Suppose then that 
g(y) splits into two irreducible quadratics. Then the splitting field is the extension 
F(VA , 'jDi ) where D\ and D 2 are the discriminants of the two quadratics. If Z>i 
and Z>2 do not differ by a square factor then this extension is a biquadratic exten s ion 
and G is isomorphic to the Klein 4-subgroup of 54. If D\ is a square times D 2 then this 
extension is a quadratic extension and G is isomorphic to Z/2Z. 

We are reduced to the situation where giy) is irreducible. In this case recall that 
the Galois group is transitive on the roots, i.e., it is possible to get from a given root 
to any other root by applying some automorphism of the Galois group. Ex aminin g the 
possibilities we see that the only transitive subgroups of £4, hence the only possibilities 
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for our Galois group G, are the groups 
1 S 4 , A4 

Dg = {1, (1324), (12) (34), (1423), (13) (24). (14)(23). (12), (34)} and its conjugates 
V = {1, (12) (34), (13) (24), (14)(23)} 

C = {1, (1234), (13)(24), (1432)} and its conjugates. 

(Dg is the dihedral group, a Sylow 2-subgroup of £ 4 , with 3 (isomorphic) conjugate 
subgroups in £ 4 , V is the Klein 4-subgroup of £4, normal in £ 4 , and C is a cyclic group, 
with 3 (isomorphic) conjugates in £ 4 ). 

Consider the elements 


0 i = (cti + a 2 )(a 3 + 04) 

&2= (a { + cc 3 )(a 2 + 014) 

03 = (<* 1 + ce 4 )(a 2 + a 3 ) 

in the splitting field for g(y). These elements are permuted amongst themselves by the 
permutations in £ 4 . The stabilizer of 6\ in £4 is the dihedral group Dg. The stabilizers 
in £4 of 6 1 and 63 are the conjugate dihedral subgroups of order 8 . The subgroup of 
£4 which stabilizes all three of these elements is the intersection of these subgroups, 
namely the Klein 4- group V. 

Since £4 merely permutes 0[, 62, 03 it follows that the elementary symmetric func- 
tions in the 0’s are fixed by all the elements of £ 4 , hence are in F. An elementary 
computation in symmetric functions shows that these elementary symmetric functions 
are 2p, p 2 — 4r, and — q 2 , which shows that 6 \ , # 2 , 63 are the roots of 

h(x) = x 3 — 2 px 2 + (p 2 — 4 r)x + q 2 

called the resolvent cubic for the quartic g(y). Since 

Oi — 6 2 = orior 3 + ct 2 a4 — ol\cl 2 — <*304 

= -(«i - «4)(«2 - «3) 

and similarly 

01 - #3 = ~(«i - «3>(«2 - «4> 

02 - 03 = -(<*1 - <*2)(<*3 - «4) 

we see that the discriminant of the resolvent cubic is the same as the discriminant of 
the quartic g(y), hence also as the discriminant of the quartic / (x). Using our formula 
for the discriminant of the cubic, we can easily compute the discriminant in terms of 

D = 16 p 4 r - 4 p 3 q 2 - 128 p 2 r 2 + \44pq 2 r - 27 q 4 + 256 r 3 
from which one can give the formula for D in terms of a, b, c, d: 

D = - miP-d 2 - 4a 3 c 3 + 16 b 4 d - 4 b 3 c 2 - 21a 4 d 2 + 1 Sabc 3 
+ 144 a 2 bd 2 — 192 acd 2 + a 2 b 2 c 2 — 4a 2 b 3 d — 6 a 2 c 2 d 
+ I44bc 1 d + 256 d 3 ~ 27 c 4 - 8 Oab 2 cd + 18 a 3 bed. 
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The splitting field for the resolvent cubic is a subfield of the splitting field of the 
quartic, so the Galois group of the resolvent cubic is a quotient of G. Hence knowing 
the action of the Galois group on the roots of the resolvent cubic h (x) gives information 
about the Galois group of g(y), as follows: 

(Galois group of a quartic) 

a. Suppose first that the resolvent cubic is irreducible. If D is not a square, then G 
is not contained in A4 and the Galois group of the resolvent cubic is £3, which implies 
that the degree of the splitting field for g(y) is divisible by 6. The only possibility is 
then G = S4. 

b. If the resolvent cubic is irreducible and D is a square, then G is a subgroup of 
A 4 and 3 divides the order of G (the Galois group of the resolvent cubic is A3). The 
only possibility is G = A4. 

cl. We are left with the case where the resolvent cubic is reducible. The first 
possibility is that h(x) has 3 roots in F (i.e., splits completely). Since each of the 
elements #1, 62, 63 is in F, every element of G fixes all three of these elements, which 
means G C V. The only possibility is G = V. 

c 2 . If /i(jc) splits into a linear and a quadratic, then precisely one of 6 \ , 62, #3 is in 
F, say 6\. Then G stabilizes 6\ but not 62 and #3, so we have G c Dg and G g V. 
This leaves two possibilities: G = As or G = C. One way to distinguish between 
these is to observe that F(a/Z) ) is the fixed field of the elements of G in A4- For the 
two cases being considered, we have A Gl A4 = V, C fl A4 = { 1 , ( 13 )( 24 )}. The first 
group is transitive on the roots of g(y), the second is not It follows that the first case 
occurs if and only if g(y) is irreducible over F(y/D). We may therefore determine 
G completely by factoring g(y) in F (-«//) ), and so completely determine the Galois 
group in all cases, (cf. the exercises following and in the next section, where it is shown 
that over Q the Galois group cannot be cyclic of degree 4 if D is not the sum of two 
squares — so in particular if D < 0 .) 

We shall give explicit formulas for the roots of a quartic polynomial at the end of 
the next section. 

The Fundamental Theorem of Algebra 

We end this section with two proofs of the Fundamental Theorem of Algebra. We need 
two facts regarding the field C: 

(a) Every polynomial with real coefficients of odd degree has a root in the reals. Equiv- 
alently, there are no nontrivial finite extensions of® of odd degree. 

(b) Quadratic polynomials with coefficients in C have roots in C. Equivalently, there 
are no quadratic extensions of C. 

The first result follows from the Intermediate Value Theorem in calculus, since the 
graph of a monic polynomial f(x) e ®[x] of odd degree is negative for large negative 
values of x and positive for large positive values of x, hence crosses the axis somewhere. 
The equivalence with the second statement follows since a finite extension of ® is a 
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simple extension and the minimal polynomial of a primitive element would have odd 
degree, hence would be both irreducible over R and have a root in R, hence must be of 
degree 1. 

The second result follows by a direct computation. By the quadratic formula it 
suffices to show that every complex number a = a + bi , a, b e R, has a square root 
in C. Write a = re tB for some r > 0 and some 6 € [0, 2n). Then yfre ' BI 2 is a square 

a + a 2 + b 2 

root of a. (Explicitly, let c € R be a square root of the real number — 


let d € R be a square root of the real number 


—a + a 1 + b 2 


and 


where the signs of the 


two square roots are chosen so that cd has the same sign as b. Then multiplying out we 
see that (c + di) 2 = a + bi .) 


Theorem 35. (Fundamental Theorem of Algebra) Every polynomial fix) € C[x] of 
degree n has precisely n roots in C (counted with multiplicity). Equivalently, C is 
algebraically closed. 

Proof: I. It suffices to prove that every polynomial / (x) € C[x] has a root in C. 
Let z denote the automorphism complex conjugation. If f(x) has no root in C then 
neither does the conjugate polynomial fix) = tf (x) obtained by applying z to the 
coefficients of fix) (since its roots are the conjugates of the roots of / (x)). The product 
/(x)/(x) has coefficients which are invariant under complex conjugation, hence has 
real coefficients. It suffices then to prove that a polynomial with real coefficients has a 
root in C. 

Suppose that f(x) is a polynomial of degree n with real coefficients and write 
n = 2 k m where m is odd. We prove that fix) has a root in C by induction on k. 
For k = 0, fix) has odd degree and by (a) above fix) has a root in R so we are 
done. Suppose now that k > 1. Let , ct 2 , be the roots of f(x) and set 

K = R(ai, <* 2 , . . . , ctn, i). Then K is a Galois extension of R containing C and the 
roots of fix). For any f el consider the polynomial 

L t = n l- + a J + <«<«>)]■ 

l</< j <n 

Any automorphism of KfR permutes the toms in this product so the coefficients of L, 
are invariant under all the elements of Gal(AT/R). Hence L t is a polynomial with real 
coefficients. The degree of L, is 

= 2 *-> m( 2‘ m - 0 = 2*- V 

where m! is odd (since k > 1). The power of 2 in this degree is therefore less than k s 
so by induction the polynomial L, has a root in C. Hence for each / € R one of the 
elements +aj + tctjCtj for some i, j (1 < i < j < n) is an element of C. Since there 
are infinitely many choices for t and only finitely many values of i and j we see that 
for some i and j (say, i — 1 and j — 2) there are distinct real numbers s and t with 

«i + «2 + sctidi € € a\ a-i + € C. 
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Since s ^ t it follows that a = a i + c*2 € C and b = a\cti € C. But then oq and ai 
are the roots of the quadratic x 2 — ax + b with coefficients in C, hence are elements of 
C by (b) above, completing the proof. 

II. The second proof again uses (a) and (b) above, but replaces the computations with the 
polynomials L t above with a simple group-theoretic argument involving the nilpotency 
of a Sylow 2-subgroup of the Galois group: 

Let f(x) be a polynomial of degree n with real coefficients and let K be the splitting 
field of /( x) over K. Then K(i) is a Galois extension of E. Let G denote its Galois 
group and let Pi denote a Sylow 2-subgroup of G. The fixed field of Pi is an extension 
of E of odd degree, hence by (a) is trivial. 

It follows that Gal(/f (J)/€) is a 2-group. Since 2-groups have subgroups of all 
orders (recall this is true of a finite p-group for any prime p, cf. Theorem 6. 1), if this 
group is nontrivial, there would exist a quadratic extension of C, impossible by (b), 
completing the proof. 

The Fundamental Theorem of Algebra was first rigorously proved by Gauss in 
1816 (his doctoral dissertation in 1798 provides a proof using geometric considerations 
requiring some topological justification). The first proof above is essentially due to 
Laplace in 1795 (hence the reason for naming the polynomials L,). The reason Laplace’s 
proof was deemed unacceptable was that he assumed the existence of a splitting field 
for polynomials (ie., that the roots existed somewhere in some field), which had not 
been established at that time. The elegant second proof is a simplification due to Artin. 

EXERCISES 

1. Show that a cubic with a multiple root has a linear factor. Is the same true for quardcs? 

2. Determine the Galois groups of the following polynomials: 

(a) x 3 - x 2 - 4 

(b) x 3 - lx + 4 

(c) x 3 - x + 1 

(d) x 3 + x 2 - 2 x - 1. 

3. Prove for any a, b e Fp* that if x 3 + ax + b is irreducible then —4a 3 — 21b 1 is a square 
in F p n. 

4. Determine the Galois group of x 4 — 25. 

5. Determine the Galois group of x 4 + 4. 

6. Determine the Galois group of x 4 + 3x 3 — 3jc — 2. 

7. Determine the Galois group of x 4 + 2x 2 + x + 3. 

8. Determine the Galois group of x 4 + 8x + 12. 

9. Determine the Galois group of x 4 + 4x — 1 (cf. Exercise 19). 

10 . Determine the Galois group of x 5 + x — 1. 

11. Let F be an extension of <Q> of degree 4 that is not Galois over (Q>. Prove that the Galois 
closure of F has Galois group either S4, A 4 or the dihedral group Dg of order 8. Prove 
that the Galois group is dihedral if and only if F contains a quadratic extension of Q. 

12. Prove that an extension F of Q of degree 4 can be generated by the root of an irreducible 
biquadratic x 4 + ax 2 + b over Q if and only if F contains a quadratic extension of (Q>. 
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13. (a) Let ±«, ±/J denote the roots of the polynomial f(x) = x 4 + ax 2 + b e Z[x\. Prove 

that / ( x ) is irreducible if and only if a 2 , a ± p are not elements of <Q>. 3 

(b) Suppose / (x) is irreducible and let G be the Galois group of /(x). Prove that 

(i) G = V, the Klein 4-group, if and only if b is a square in (Q if and only if afi e Q 
is rational. 

(ii) G = C, the cyclic group of order 4, if and only if b(a 2 — 4b) is a square in Q if 
and only if Q(a($) = <Q>(a 2 ). 

(iii) G = Dg, the dihedral group of order 8, if and only if b and b(a 2 — 4b) are not 
squares in Q if and only if af) $ Q(a 2 ). 

14. Prove the polynomial x 4 — px 2 + q € <Q>[x] is irreducible for any distinct odd primes p 
and q and has as Galois group the dihedral group of order 8. 4 

15. Prove the polynomial x 4 + px + p € Q[x] is irreducible for every prime p and for p ^ 3,5 
has Galois group 54 . Prove the Galois group for p = 3 is dihedral of order 8 and for p = 5 
is cyclic of order 4. 5 

16. Determine the Galois group over <Q> of the polynomial x 4 + 8x 2 + 8x + 4. Determine which 
of the subfields of this field are Galois over Q and for those which are Galois determine a 
polynomial / (x) € Q[x] for which they are the splitting field over Q. 

17. Find the Galois group of x 4 — 7 over <Q> explicitly as a permutation group on the roots. 

18. Let 9 be a root of x 3 — 3x + 1. Prove that the splitting field of this polynomial is <Q)(0) and 
that the Galois group is cyclic of order 3. In particular the other roots of this polynomial 
can be written in the form a + bO 4- cO 2 for some a, b, c € Q. Determine the other roots 
explicitly in terms of 9. 

19. Let /(x) be an irreducible polynomial of degree 4 in Q[x] with discriminant D. Let K 
denote the splitting field of / (x), viewed as a subfield of the complex numbers <C. 

(a) Prove that Q(y/~D ) c K. 

(b) Let t denote complex conjugation and let xk denote the restriction of complex con- 
jugation to K. Prove that r ^ is an element of Gal(A7<0?) of order 1 or 2 depending 
on whether every element of AT is real or not 

(c) Prove that if D < 0 then K cannot be cyclic of degree 4 over Q (i.e., Gal(AT/Q) 
cannot be a cyclic group of order 4). 

(d) Prove generally that <Q >(VD ) for squareffee D < 0 is not a subfield of a cyclic quartic 
field (cf. also Exercise 19 of Section 7). 

20. Determine the Galois group of (x 3 — 2)(x 3 — 3) over Q. Determine all the subfields which 
contain Q(p) where p is a primitive 3 rd root of unity. 

21. Let G < S n be a subgroup of the symmetric group and suppose o\ , .... o* are generators 
for G. If the function f(x \ , X 2 , ■ ■ - , x„) is fixed by the generators a,- show it is fixed by G. 

22. (Newton's Formulas) Let f(x) be a monic polyuomial of degree n with roots a\, ... ,a n . 

Let si be the elementary symmetric function of degree i in the roots and define s,- = 0 for 
i > n. Let p t = a'j + i- a' n , i > 0, be the sum of the i* powers of the roots of f(x). 


3 cf. the note An Elementary Test for the Galois Group of a Quartic Polynomial, Luise-Charlotie 
Kappe and Bette Warren, Amer. Math. Monthly, 96(1989), pp. 133-137. 

4 Ibid. 

5 Ibid. 
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Prove Newton 's Formulas: 

Pi ~.?i = 0 

P2 ~ s lPl + 2 s2 = 0 
P3 -SIP2 +S2Pi - 3^3 = 0 


Pi - SiPi-\ +S2PI-2 + (-!■)' ^t-tpi + (-iy«, = 0 

23. (a) If x + y + z = 1, x 2 + y 2 + z 2 = 2 and x 3 + y 3 + z 3 = 3, determine x 4 + y 4 + z 4 . 
(b) Prove generally that x. y. z are not rational but that x” + y" + z n is rational for every 

positive integer n. 

24. Prove that an n x n matrix A over a field of characteristic 0 is nil potent if and only if the 
trace of A k is 0 for all k > 0. 

25. Prove that two n x n matrices A and B over a field of characteristic 0 have the same 
characteristic polynomial if and only if the trace of A k equals the trace of B k for all k > 0. 

26. Use the fact that the trace of AB is the same as the trace of BA for any two nxn matrices 
A and B to show that AB and BA have the same characteristic polynomial over a field of 
characteristic 0 (the same result is true over a field of arbitrary characteristic). 

27. Let /(x ) be a monic polynomial of degree n with roots cti,ct 2 a n . 

(a) Show that the discriminant D of f(x) is the square of the Vandermonde determinant 


1 

on 

a\ ... 

"i" 1 


1 

Ct2 

a\ ... 


= ]“[(Q'; - a ,). 


. 

I 


l>j 

1 

<*n 

al ... 

a n-i 



(b) Taking the Vandermonde matrix above, multiplying on the left by its transpose and 
taking the determinant show that one obtains 


P0 

P 1 

P2 

. . P/i— 1 

P\ 

PI 

Pi ■ 

Pn 

Pn—1 

Pn 

Pn + 1 • 

■■ Pin— 2 


where p\ = orj + - - - + a' n is the sum of the powers of the roots of /(x), which 
can be computed in terms of the coefficients of / (x) using Newton’s formulas above. 
This gives an efficient procedure for calculating the disc rimin ant of a polynomial. 

28. Let o be a root of the irreducible polynomial / (x) e F[x] and let K — F( a). Let D be the 

discriminant of /(x). Prove that D = (— k / f{/' (a)), where /'(x) = D x f(x) 

is the derivative of /(x). 

The following exercises describe the resultant of two polynomials and in particular provide 
another efficient method for calculating the discriminant of a polynomial. 

29. Let F be a field and let / (x) = a nx” + -\ a\x + ao and g(x) = b m x m + 

b m -ix m ~ 1 H h i»ix + bo be two polynomials in F[x]. 

(a) Prove that a necessary and sufficient condition for / (x) and g(x) to have a common 
root (or, equivalently, a common divisor in F[x]) is the existence of a polynomial 
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a(x) € F[x] of degree at most m — 1 and a polynomial b(x) e F[x] of degree at most 
n — 1 withu(x)/(x) = b(x)g( x). 

(b) Writing a(x) and b(x) explicitly as polynomials show that equating coefficients in 
the equation a(x) / (x) = b(x)g(x) gives a system of n + m linear equations for the 
coefficients of a(x) and b(x). Prove that this system has a nontrivial solution (hence 
/ (x) and g(x) have a common zero) if and only if the determinant 


*(/.*) = 


a n 

&n— ] 


ao 




On 

a, 1-1 


ao 




Q.ji 

a n -l 


ao 






O n -i 

bm 

bm — \ 


bo 




bm 

bm — 1 


bo 




bm 

bm—\ 


bo 





bm 

bm - 1 


... a Q 


... bo 


is zero. Here R(f, g), called the resuhanl of the two polynomials, is the determinant 
of an (n+m) x (n+m) matrix R with m rows involving the coefficients of / (x) and 
n rows involving the coefficients of g(x). 

30. (a) With notations as in the previous problem, show that we have the matrix equation 

(x m ~ l f(x)\ 
x m ~ 2 f(x) 


fix) 

x n ~'g(x) ■ 
x n ~ 2 g(x) 

\ g(x) ) 

(b) Let R' denote the matrix of cofactors of R as in Theorem 30 of Section 11.4, so 
R'R = R(f, g)l, where / is the identity matrix. Multiply both sides of the matrix 
equation above by R' and equate the bottom entry of the resulting column matrices 
to prove that there are polynomials r(x), s(x) e F[x] such that R(f, g) is equal to 
r(x)f (x) + s(x)g(x), Le., the resultant of two polynomials is a linear combination 
(in F[x]) of the polynomials. 

31. Consider / (x) and g(x) as general polynomials and suppose the roots of / (x) arexj, . . . , x„ 
and the roots of g(x) are yi , . . . , y m . The coefficients of / (x) are powers of a n times the 

elementary symmetric functions in xj . X 2 , x„ and the coefficients of g(x) are powers 

of b m times die elementary symmetric functions in yj , yi , ... , y m . 

(a) By expanding the determinant show that R(f,g) is homogeneous of degree m in the 
coefficients at and homogeneous of degree n in the coefficients bj. 

(b) Show that R(f, g) is a^b^ n times a symmetric function in X[ , . . . , x„ and yj y m . 

(c) Since R(f, g) is 0 if /(x) and g(x) have a common root, say x; = yj, show that 

R(f, g) is divisible by x/ — yj for / = 1 , 2, . . . , n, j = 1,2, Conclude by 


/x" + -" , -t \ 

x n+tn—2 




X 

1 


) 
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degree considerations that 


n m 

R = a n b m FI I>< “ 3V>- 

i=lj= 1 

(d) Show that the product in (c) can be also be written 

n m 

s) = < n /c^)- 

i=i j=i 

This gives an interesting reciprocity between the product of g evaluated at the roots 
of / and the product of / evaluated at the roots of g. 

32. Consider now the special case where g(x) = fix) is the derivative of the polynomial 

fix ) = x” +On- ix" _l -f \-a\x+aQ and suppose the roots of fix) areai, <*2, - - - . “n- 

Using the formula 

n 

Riff') = Y\f'i*i) 

i = 1 


of the previous exercise, prove that 

D = (-ir^-^Rif /') 
where D is the discriminant of fix). 

33. (a) Prove that the discr imin ant of the cyclotomic polynomial <£> p (x) of the roots of 

unity for an odd prime p is (— l) (p_1)/,2 p p_2 [One approach: use Exercise 5 of the 
previous section together with the determinant form for the discriminant in terms of 
the power sums p -, .] 

(b) Prove that Qifi — 1) 1 V 2 p ) c Q(f p ) for p an odd prime. (Cf. also Exercise 1 1 of 

Section 7.) 

34. Use the previous exercise to prove that every quadratic extension of Q is contained in a 
cyclotomic extension (a special case of the Kronecker- Weber Theorem). 

35. Prove that the discriminant D of the polynomial x n + px + q is given by the formula 

(_!)«(»- D/Vyj-i + (n -2)/2( n _ jyi-1 p n 

36. Prove that the disc rimin ant of x" +nx"~ l -f-n(n — l)x n-2 H hn(n — 1) . . . (3)(2)x +n! 

is i-l) n ^- i ^ 2 in\) n . 

The following exercises 37 to 43 outline two procedures for writing a symmetric function in 
terms of the elementary symmetric functions. Let fix i, . . . , x„) be a polynomial which is 

symmetric in xi x„. Recall that the degree (sometimes called the weight ) of the monomial 

AjfyX^ . . . (a i > 0) is a\ +ai -| \-a n and that a polynomial is homogeneous (of degree 

m) if every monomial has the same degree (m). 

37. (a) Show that every polynomial f(x \ , . . . , x n ) can be written as a sum of homogeneous 

polynomials. Show that if fix \ , . . . , x n ) is symmetric then each of these homoge- 
neous polynomials is also symmetric. 

(b) Show that the monomial Bs^'s ^ 1 . . . sf? in the elementary symmetric functions is a 
homogeneous polynomial inxi,X 2 , ... ,x„ of degree a\ + lai -| 1- na n . 

In writing fix i , . .. ,x„) as a polynomial in the symmetric functions it therefore suffices to 
assume that fix \ x„) is homogeneous. 
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Recall the lexicographic monomial order with jcj > x% > - ■ ■ > x„ defined in Section 9.6, 

where the nonzero monomial term with exponents («i, a„) comes before the nonzero 

monomial term with exponents [b\,b 2 b n ) if the initial components of the two n-tuples 

of exponents are equal and the first component where they differ has a , > b\. If f(xi, . . . , x n ) 
contains the monomial Ax" 1 x % 2 . . . x°” then since / (xi , . . . , x n ) is symmetric it also contains 
all the permuted monomials. Among these choose the lexicographically largest monomial, 
which therefore satisfies a\ > > • ■ ■ > a n > 0. 

38. (a) Show that the monomial As" 1-02 ■ ■ -s%" in the elementary symmetric functions 

has the same lexicographic initial term. 

(b) Show that subtracting from f(x) yields either 0 or a symmetric 

polynomial of the same degree whose terms are lexicographically smaller than the 
terms in fix i, . . . ,x„). 

(c) Show that the iteration of this procedure (lexicographic ordering, choosing the lex- 
icographically largest term, subtracting the associated monomial in the elementary 
symmetric functions) terminates, expressing f(x \, . . . , x n ) as a polynomial in the 
elementary symmetric functions. 

39. Use the algorithm described in Exercise 38 to prove that a polynomial /(xi, . . . , x„) that 
is symmetric in x \ , . . . , x n can be expressed uniquely as a polynomial in the elementary 
symmetric functions. 

40. Use the procedure in Exercise 38 to express each of the following symmetric functions as 
a polynomial in the elementary symmetric functions: 

(a) (xt - xf ) 2 

(b) x 2 + x| + X3 

(c) x 2 x% + x 2 x% + x|x| . 

41. Use the procedure in Exercise 38 to express YLi^j x f x j 25 a polynomial in the elementary 
symmetric functions. 

We now know that a symmetric polynomial fix 1 x n ) can be written uniquely as a poly- 

nomial in the elementary symmetric functions. Using this existence and uniqueness we can 
describe an alternate and computationally useful method for deter mining the coefficients of 
the elementary symmetric functions in this polynomial. As in Exercise 37 we may assume 

that /(x[ x n ) is homogeneous of degree M. Let A be the maximum degree of any of the 

variables xi x„ in fix 1 , . . . , x n ). 

(a) Determine all of the possible monomials AjS^s ^ 2 ■ ■ ■ appearing in /( xi x n ) 

from the constraints 


a\ + 2 a ,2 H h nan = M 

a\+a 2 .-\ 1- a n < N. 

(b) Since fix\ , . . . , x n ) = £ A/s°' s ^ 1 ■ ■ ■ Sn" is a polynomial identity , it is valid for 
any substitution of values for xi, . . . , x n . Each substitution into this equation gives a 
linear relation on the coefficients A/ and so a sufficient number of substitutions will 
determine the A,- . 

42. Show that the function On + X 2 — *3 — X4)(xi + x$ — X 2 — X4)(xi + X 4 — X 2 — *3) is 
symmetric in xi, X2, X3, xa and use the preceding procedure to prove it can be expressed 
as a polynomial in the elementary symmetric functions as - As{S 2 + 8.93. 

43. Express each of the following in terms of the elementary symmetric functions: 

(») Ei/; x fxj 0>) E/J,r distinct x ?*j x k (C) E/,>, * distinct x f x j x l- 
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44. Let a i , a2 , a3 , «4 be the roots of a quartic polynomial / (x) over <Q>. Show that the quantities 

a\ct 2 4- «t«3 4- 02014 , and a 10/4 4- 012013 are permuted by the Galois group of /(x). 

Conclude that these elements are the roots of a cubic polynomial with coefficients in Q 
(also sometimes referred to as the resolvent cubic of / (x)). 

45. If/(x) = x 3 + px+q e Z[x\ is irreducible, prove that its discriminant D = —Ap i —21q 2 
is an integer not equal to 0, ±1 . 

46. Prove that every finite group occurs as the Galois group of a field extension of the form 
F(x\,X 2 , ...,x n )/E. 

47. Let F be a field of characteristic 0 in which every cubic polynomial has a root Let / (x) be 
an irreducible quartic polynomial over F whose discriminant is a square in F. Determine 
die Galois group of / (x). 

48. This exercise determines the splitting field K for the polynomial f(x) = x 6 — 2x 3 — 2 
over Q (cf. also Exercise 2 of Section 8). 

(a) Prove that f(x) is irreducible over <Q) with roots the three cube roots of 1 ± \/3. 

(b) Prove that K contains the field (Q(V— 3 ) of 3 rd roots of unity and contains <Q>(V3 ), 
hence contains the biquadratic field F — Q(i, -/%). Take the product of two of the 
roots in (a) to prove that K con tains 1/2 and conclude that AT is an extension of the 
field L = 0(^/2. 

(c) Prove that [L : Q] — 12 and that K is obtained from L by adjoining the cube root of 
an element in L, so that [K : Q] = 12 or 36. 

(d) Prove that if [tf : Q] = 12 then K = Q(-^2 , i, V3 ) and that Gal (A' /<Q>) is isomorphic 
to the direct product of the cyclic group of order 2 and S 3 . Prove that if [A" : Q] = 12 
then there is a unique real cubic subfield in K, namely Q(4^2). 

(e) Take the quotient of the two real roots in (a) to show that 3/2 + >/3 and 1/2 — V3 (real 

roots) are both elements of K . Show that a — 1/2 4- \/3 + 1/2 — V3 is a real root of 
the irreducible cubic equation x 3 — 3x — 4 whose disc riminan t is — 2 2 3 4 . Conclude 
that the Galois closure of Q(cx) contains <Q(i) so in particular Q(a) / Q(-v/2). 

(1) Conclude from (e) that G = Gal(AT /Q) is of order 36. Determine all the elements of 
G explicitly and in particular show that G is isomorphic to S3 x S3 . 

49. Prove that the Galois group over Q of x 6 — 4x 3 + 1 is isomorphic to the dihedral group of 
order 12. [Observe that the two real roots are inverses of each other.] 

50. ( Criterion for the Galois Group cf an Irreducible Cubic over an Arbitrary Field) Suppose 
K is a field and / (x) = x 3 + ax 2 + hx -f c € K[x) is irreducible, so the Galois group of 
/ (x) over K is either S 3 or A 3 . 

(a) Show that the Galois group of /(x) is A3 if and only if the resultant quadratic poly- 
nomial g(x) — x 2 +(ab—3c)x+(b 3 +a 3 c—6abc+9c 2 ) has a root in K. [If a, f), y 
are the roots of f (x) show that the Galois group is A3 if and only if the element 
6 — ap 2 + fiy 2 + ya 2 is an element of K and that 6 is a root of g(x).] Show that the 
discriminant of g(x) is the same as the discriminant of / (x). 

(b) (ch(-K) / 2) If K has characteristic different from 2 show either from (a) or directly 
from the definition of the discriminant that the Galois group of / (x) is A3 if and only 
if the discriminant of /(x) is a square in K. 

(c) (ch(A') = 2) If K has characteristic 2 show that the discriminant of / (x) is always 
a square. Show that / (x) can be taken to be of the form x 3 4- px + q and that the 
Galois group of / (x) is A3 if and only if the quadratic x 2 + qx -|- (p 3 4- q 2 ) has a root 
in K (equivalently, if (p 3 + q 2 )/i ? 2 e K is in the image of the Artin-Schreier map 
x 1 — > x 2 — x mapping K to K ). 
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(d) If K = F 2 (f) where t is transcendental over F 2 . Prove that the polynomials jc 3 + 
t 1 x + t 3 , x 3 + (t 2 + t + 1 )jc + (t 2 + t + 1), and x 3 + (t 2 + t + 1 )jc + (t 3 +t 2 + t) 
have A 3 as Galois group while x 3 + P'x + 1 and jc 3 = x 4 - 1 have S 3 as Galois group. 

51. This exercise proves Sturm ’s Theorem determining the number of real roots of a polynomial 
/ (jc) € R[jc] in an interval [a, h\. The multiple roots of f(x) are zeros of the g.c.d. of 
/ (jc) and its derivative /'(jc), and it follows that to dete rmin e the real roots of / (x) in 
\_a, b] we may assume that the roots of / (jc) are simple. 

Apply the Euclidean algorithm to /o(x) = /( x) and its derivative /i(x) = /'(jc) 
using the negative of the remainder at each stage to find a sequence of polynomials 
fix), fix), /*(*), ...,/„( jc) with 

fi-l (x) = q- t ( x)f (x) - /+ 1 (jc) 1 = 0,1 n - 1 

where f„ (jc) e R is a nonzero constant 

(a) Prove that consecutive polynomials /-(jc), /+i(jc) for / = 0, 1, . . . , n — 1 have no 
common zeros. [Show that otherwise /+ 2 (c) = /^3 (c) = ■ • • = 0, and derive a 
contradiction.] 

(b) If /■ (c) = 0 for some i = 0, 1, . . . , n — 1, prove that one of the two values /•_ 1 (c), 
fi+l(c) is strictly negative and the other is strictly positive. 

For any real number a, let V (a) denote the number of sign changes in the Sturm sequence 
of real numbers 

f(a), f(a), f 2 (a ), .... f n (a), 

ignoring any 0’s that appear (for example — 1 , —2, 0, +3, —4 has signs 1 — disregarding 

the 0, so there are 2 sign changes, the first from —2 to +3, the second from +3 to —4). 

(c) Suppose a < p and that all the elements in the Sturm sequences for a and for P 
are nonzero. Prove that unless /(c) = 0 for some a < c < P and some i = 

0, 1 n — 1, then the signs of all the elements in these two Sturm sequences are 

the same, so in particular V (a) = V (/). 

(d) If / (c) = 0 prove that there is a sufficiently small interval (a, P) containing c so that 
/-(x) has no zero other than c for a < x < p. 

(e) If j > 1 in (d), prove that the number of sign changes in /- i(a), /(a), /+i (a) 
and in fj-i(P), fjiP), fj + 1 (P) are the same. [Observe that fj~\(c) and /-+ 1 (c) have 
opposite signs by (b) and /-i(Jt) and /+ i(jc) do not change sign in (a, £).] 

(f) If j — 0 in (d) show that the number of sign changes in / (a), /'(a) is one more than 
the number of sign changes in f(P), /'(/). [If /'(c) > 0 then /(jc) is increasing at 
c, so that / (a) < 0, f(P) > 0, and /'(jc) does not change sign in (a. P), so the signs 
change from — (- to ++. S imilar ly if /'(c) < 0.] 

(g) Prove Sturm’s Theorem : if / (jc) is a polynomial with real coefficients all of whose 
real roots are simple then the number of real zeros of / (jc) in an interval [a, b\ where 
f(a) and f(b) are both nonzero is given by V(a) — V(b). [Use (c), (e) and (f) to 
see that as a runs from a to b the number V(a) of sign changes is constant unless a 
passes through a zero of / (jc), in which case it decreases by precisely 1 .] 

(h) Suppose /(jc) = jc 5 + px + q € R[jc] has simple roots. Show that the sequence 
of polynomials above is given by f(x), 5jc 4 + p, (— 4p/5)jt + q, and — D/(256p 4 ) 
where D = 256 p s + 3125<? 4 is the discriminant of /(jc). Conclude for p > 0 that 
/ (jc) has precisely one real root and for p < 0 that / (jc) has precisely 1 or 3 real roots 
depending on whether D > 0 or D < 0, respectively. [E.g., if p < 0 and D < 0 then 

at —00 the signs are — I 1 - with 3 sign changes and at +00 the signs are + 4- ++ 

with no sign changes.] 
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14.7 SOLVABLE AND RADICAL EXTENSIONS: 

IN SOLVABILITY OF THE QUINTIC 

We now investigate the question of solving for the roots of a polynomial by radicals , that 
is, in terms of the algebraic operations of addition, subtraction, multiplication, division 
and the extraction of n* roots. The quadratic formula for the roots of a polynomial 
of degree 2 is familiar from elementary algebra and we shall derive below similar 
formulas for the roots of cubic and quartic polynomials. For polynomials of degree 
> 5, however, we shall see that such formulas are not possible — this is Abel’s Theorem 
on the insolvability of the general quintic. The reason for this is quite simple: we shall 
see that a polynomial is solvable by radicals if and only if its Galois group is a solvable 
group (which explains the terminology) and for n > 5 the group S„ is not solvable. 

We first discuss simple radical extensions, namely extensions obtained by adjoining 
to a field F the /I th root of an element ainF. Since all the roots of the polynomial x n —a 
for a e F differ by factors of the n* roots of unity, adjoining one such root will give a 
Galois extension if and only if this field contains the n* roots of unity. Simple radical 
extensions are best behaved when the base field F already cont ains the appropriate roots 
of unity. The symbol zja for a e F will be used to denote any root of the polynomial 
x n — a € F[x]. 

Definition- The extension K/F is said to be cyclic if it is Galois with a cyclic Galois 
group. 

Proposition 36. Let F be a field of characteristic not dividing n which contains the 
/ 1 th roots of unity. Then the extension F(%/a) for a € F is cyclic over F of degree 
dividing n. 

Proof: The extension K = F{zfa) is Galois over F if F contains the n* roots 
of unity since it is the splitting field for x" — a : For any a € Gal(> K/F), a{^fa) is 
another root of this polynomial, hence a{^fa ) = £& tfa for some 0 th root of unity £ CT . 
This gives a map 

Gal (K/F) -► p n 
a t - * Ka 

where denotes the group of /I th roots of unity. Since F contains every root 
of unity is fixed by every element of Gal(AVF). Hence 

ot(l/a) = <r(f r l/a) 

= £ r cr(V^) 

— £r£o r -Ta — yfo- 

which shows that £ ar = £ a £ r , so the map above is a homomorphism. The kernel consists 
precisely of the automorphisms which fix %/a, namely the identity. This gives an 
injection of Galf^T /F) into the cyclic group /i n of order n, which proves the proposition. 

Let now K be any cyclic extension of degree n over a field F of characteristic not 
dividing n which contains the n* roots of unity. Let o be a generator for the cyclic 
group Gal(AT/F'). 
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Definition. For a € K and any root of unity define the Lagrange resolvent 
(a, C) € K by 

(a, £) = a + £o(cl) + £ 2 <r 2 (a) H + ^” _1 or n_1 (Q;). 

If we apply the automorphism a to (a, £) we obtain 

a(a, Z) =aa + £<r 2 (a) + f 2 or 3 (a) H 1- t; n ~ l o’'(a) 

since £ is an element of the base field F so is fixed by a. We have £ n = 1 in /r„ and 
a" = I in Ga \{K/F) so this can be written 

cr(a, £) = aa + £<x 2 (ar) + £ 2 <r 3 (a) -| 1- £ _1 a 

= £ _1 (a + £<t(cO + C 2 o- 2 (a) H 1- £' ,_1 cr' ,_1 (aO) 

= r 1 (a.O- (14.19) 

It follows that 

< 7 ( 0 ,, K y = <£- x ) n {a,KT = (ct,O n 

so that (a, f ) w is fixed by Gall# /F), hence is an element of F for any a G K. 

Let £ be a primitive /i* root of unity. By the linear independence of the auto- 
morphisms 1, <t, , cr"" 1 (Theorem 7), there is an element a e K with (a, £ ) ^ 0. 

Iterating ( 19) we have 

<*'(«, K) = £“'(<*, O. t=0, 1,..., 

and it follows that <r' does not fix (a, £) for any i < n. Hence this element cannot lie in 
any proper subfield of K , so K = F((o r, £))- Since we proved (a, £)" = a € F above, 
we have F( l/a) = F((a, £)) = K. This proves the following converse of Proposition 
36. 

Proposition 37. Any cyclic extension of degree n over a field F of characteristic not 
dividing n which contains the n^ 1 roots of unity is of the form F( zfa ) for some a € F. 

Remark: The two propositions above form a part of what is referred to as Kutmner 
theory. A group G is said to have exponent n if g n = 1 for every g e G. Let F be a 

field of characteristic not dividing n which contains the n 03 roots of unity. If we take 

elements a\ , . . . , a* € F x then as in Proposition 36 we can see that the extension 

F{Zfa~\, S/°k) (14.20) 

is an abelian extension of F whose Galois group is of exponent n. Conversely, any 
abelian extension of exponent n is of this form. 

Denote by (F x )” the subgroup of the multiplicative group F x consisting of the n 111 
powers of nonzero elements of F. The quotient group F*/(F*) n is an abelian group 
of exponent n. The Galois group of the extension in (20) is isomorphic to the group 
generated in F X /(F X )" by the elements a\, . . . , a* and two extensions as in (20) are 
equal if and only if their associated groups in F X /(F X )" are equal. 

Hence the (finitely generated) subgroups of F X /(F X )" classify the abelian exten- 
sions of exponent n over fields containing the n 0 * roots of unity (and characteristic not 


626 


Chap. 14 Galois Theory 



dividing n). Such extensions are called Kummer extensions . 

These results generalize the case k = 1 above and can be proved in a similar way. 


For simplicity we now consider the situation of a base held F of characteristic 0. 
As in the previous propositions the results are valid over fields whose characteristics do 
not divide any of the orders of the roots that will be taken. 

Definition. 

(1) An element a which is algebraic over F can be expressed by radicals or solved 
for in terms of radicals if a is an element of a field K which can be obtained 
by a succession of simple radical extensions 

F = Kq C Ki C ■ • ■ C Kj C K i+i C.‘CK S = K (14.21) 

where tf f+1 — K/ %/di) for Mime a, € Kj, i = 0, 1, . . . , s — 1. Here %/dj 
denotes some root of the polynomial x ni — a { . Such a field K will be called a 
root extension of F . 

(2) A polynomial /( jc) € F[x] can be solved by radicals if all its roots can be 
solved for in terms of radicals. 

This gives a precise meaning to the intuitive notion that a is obtained by successive 
algebraic operations (addition, subtraction, multiplication and division) and successive 
root extractions. For example, the element 


-1 + 7F? + ^2(17 - 717) + 2y 17 + 37l7 - ^2(17 - 717) - 2^2(17 + 717) 

encountered at the end of Section 5 (used to construct the regular 17-gon) is expressed 
by radicals and is contained in the field K 4 , where 


Ko = Q 


K[ = ) 

a 0 = 17 

= at,(v^T) 

aj =2(17-717) 

K 3 = K 2 (Jd~ 2 ) 

a 2 = 2(17 + 717 ) 

K 4 — K 3 (<Jai) 

a 3 = 17 + 3717 - 


Each of these extensions is a radical extension. The fact that no roots other than square 
roots are required reflects the fact that the regular 17-gon is constructibleby straightedge 
and compass. 


In considering radical extensions one may always adjoin roots of unity, since by 
definition the roots of unity are radicals. This is useful because then cyclic extensions 
become radical extensions and conversely. In particular we have: 
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Lemma 38. If a is con taine d in a root extension K as in (21 ) above, then a is contained 
in a root extension which is Galois over F and where each extension K i+ i/Ki is cyclic. 

Proof: Let L be the Galois closure of K over F. For any a e Gal(L/F) we have 
the chain of subfields 

F = oKq C crKi c ■ ■■ C crKi C aK i+ 1 C - ■ ■ C oK s = oK 

where <tKi+i/oKi is again a simple radical extension (since it is generated by the 
element o( "i/ai), which is a root of the equation x n< — <7 (a,) over a{K{)'). It is easy 
to see that the composite of two root extensions is again a root extension (if K‘ is 
another root extension with subfields K\ % first take the composite of K[ with the fields 
K 0 , K [ , . . . , K s , then the composite of these fields with K 2 , etc. so that each individual 
extension in this process is a simple radical extension). It follows that the composite 
of all the conjugate fields <7 (A") for a € Gal(L/F) is again a root extension. Since this 
field is precisely L t we see that a is contained in a Galois root extension. 

We now adjoin to F the n, -th roots of unity for all the roots n/aj of the simple 
radical extensions in the Galois root extension K/F , obtaining the field F', say, and 
then form the composite of F' with the root extension: 

F C F' = F'Ko C F’K j C - ■ - c F'Ki C F'K i+i C - - . c F’K S = F'K . 

The field F'K is a Galois extension of F since it is the composite of two Galois 
extensions. The extension from F to F' = F'Kq can be given as a chain of subfields with 
each individual extension cyclic (this is true for any abelian extension). Each extension 
F'Kj+i/F'Kj is a simple radical extension and since we now have the appropriate roots 
of unity in the base fields, each of these individual extensions from F' to F' K is a cyclic 
extension by Proposition 36. Hence F' K/F is a root extension which is Galois over F 
with cyclic intermediate extensions, completing the proof. 

Recall from Section 3.4 (cf. also Section 6.1) that a finite group G is solvable if 
there exists a chain of subgroups 

1 = G s < < ■ ■ ■ < G i+ ] < G{ < * ■ - < Go = G (14.22) 

with G, /G/ + i cyclic, i = 0, 1, . . . , s — 1. We have proved that subgroups and quotient 
groups of solvable groups are solvable and that if H < G and G /H are both solvable, 
then G is solvable. 

We now prove Galois* fundamental connection between solving for the roots of 
polynomials in terms of radicals and the Galois group of the polynomial. We continue 
to work over a field F of characteristic 0, but it is easy to see that the proof is valid over 
any field of characteristic not dividing the order of the Galois group or the orders of the 
radicals involved. 

Theorem 39. The polynomial f(x) can be solved by radicals if and only if its Galois 
group is a solvable group. 

Proof: Suppose first that / (x) can be solved by radicals. Then each root of f(x) 
is contained in an extension as in the lemma. The composite L of such extensions is 
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again of the same type by Proposition 21. Let Gj be the subgroups corresponding to 
the subfields AT,-, i = 0, 1 , . . . , s — 1 . Since 

Gal(AT, + i/AT,) = G,-/G i+ 1 i = 0, 1, . . . , j - 1 

it follows that the Galois group G = Gal(L/F) is a solvable group. The field L contains 
the splitting field of /( Jt) so the Galois group of /(jc) is a quotient group of the solvable 
group G, hence is solvable. 

Suppose now that the Galois group G of /(jc) is a solvable group and let AT be the 
splitting field for f(x). Taking the fixed fields of the subgroups in a chain (22) for G 
gives a chain 

F = K 0 C AT! C - ■ - C Ki C AT I+ i C ■ - ■ C K s = K 

where AT,- + i/AT,-, i = 0, 1 , . . . , s — 1 is a cyclic extension of degree Let F' be the 
cyclotomic field over F of all roots of unity of order n,, i — 0, 1 , . . . , s — 1 and form 
the composite fields K[ = F'Ki. We obtain a sequence of extensions 

FQF'= F'Kq c FKi c • • • c F'Ki Q F'K i+i c • ■ • c F'K S = F’K. 

The extension F'Ki+\./F'Ki is cyclic of degree dividing i = 0, 1 , .... s — 1 (by 
Proposition 19). Since we now have the appropriate roots of unity in the base fields, 
each of these cyclic extensions is a simple radical extension by Proposition 37. Each 
of the roots of / ( x ) is therefore contained in the root extension F’ AT so that / (jt) can 
be solved by radicals. 

Corollary 40. The general equation of degree n cannot be solved by radicals for n > 5. 

Proof: For n > 5 the group S„ is not solvable as we showed in Chapter 4. The 
corollary follows immediately from Theorems 32 and 39- 

This corollary shows that there is no formula involving radicals analogous to the 
quadratic formula for polynomials of degree 2 for the roots of a polynomial of degree 
5. To give an example of a specific polynomial over Q of degree 5 whose roots cannot 
be expressed in terms of radicals we must demonstrate a polynomial of degree 5 with 
rational coefficients having S$ (or A 5 , which is also not solvable) as Galois group (cf. 
also Exercise 21 , which gives a criterion for the solvability of a quintic). 

Example 

Consider the polynomial f(x) = x 5 — 6x + 3 € Q[x]. This polynomial is irreducible since 
it is Eisenstein at 3. The splitting field K for this polynomial therefore has degree divisible 
by 5, since adjoining one root of f(x) to Q generates an extension of degree 5. The Galois 
group G is therefore a subgroup of S$ of order divisible by 5 so contains an element of 
order 5. The only elements in S$ of order 5 are 5-cycles, so G contains a 5-cycle. 

Since /(— 2) = -17, /( 0) = 3, /( 1) = -2, and /( 2) = 23 we see that f(x) has a 
real root in each of the intervals (—2, 0), (0, 1) and (1,2). By the Mean Value Theorem, 
if there were 4 real roots then the derivative f'(x) — 5x 4 — 6 would have at least 3 real 
zeros, which it does not. Hence these are the only real roots. (This also follows easily by 
Descartes’ rule of signs.) By the Fundamental Theorem of Algebra f(x) has 5 roots in C. 
Hence / (x) has two complex roots which are not real. Let z denote the automorphism of 
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complex conjugation in C. Since the coefficients of f(x) are real, the two complex roots 
must be interchanged by r (since they are not fixed, not being real). Hence the restriction 
of complex conjugation to K fixes three of the roots of / (jr) and interchanges the other 
two. As an element of G, t|a- is therefore a transposition. 

It is now a simple exercise to show that any 5-cycle together with any transposition 
generate all of S$. It follows that G = S5, so the roots of x 5 — 6 x + 3 cannot be expressed 
by radicals. 

As indicated in this example, a great deal of information regarding the Galois group 
can be obtained by understanding the cycle types of the automorphisms in G considered 
as a subgroup of S n . In practice this is the most efficient way of determining the Galois 
groups of polynomials of degrees > 5 (becoming more difficult the larger the degree, 
of course, if only because the possible subgroups of S n are vastly more numerous). We 
describe this procedure in the next section. 

By Theorem 39, any polynomial of degree n < 4 can be solved by radicals, since 
S„ is a solvable group for these n. For n = 2 this is just the familiar quadratic formula. 
For n = 3 the formula is known as Cardano 's Formula (named for Geronimo Cardano 
(1501-1576)) and the formula for n — 4 can be reduced to this one. The formulas are 
valid over any field F of characteristic / 2, 3, which are the characteristics dividing 
the orders of the radicals necessary and the orders of the possible Galois groups (which 
are subgroups of S3 and S 4 ). For simplicity we shall derive the formulas over Q. 

Solution of Cubic Equations by Radicals: Cardano's Formulas 

From the proof of Theorem 39 and the fact that a composition series for S3 as in equation 
(22) is given by 1 < A 3 < £3 we should expect that the solution of the cubic 

/ Oc) ~ x 3 + ax 2 + bx + c 

(or equivalently, undo - the substitution x = y — a/3, 

g(y) =yl + p y + q t 

where 

p=^(3b-a 2 ) q = ^(2a 3 ~9ab + 27c)) 

to involve adjoining the 3 rd roots of unity and the formation of Lagrange resolvents 
involving these roots of unity. 

Let p denote a primitive 3 rd root of unity, so that p 1 + p + ! = 0. Let the roots of 
g(y ) be a, fi t and y , so that 

a •+- /J + y =0 

(one of the reasons for changing from / (x) to g(x)). Over the field <Q>(VZ) ) where D 
is the discriminant (computed in the last section) the Galois group of g(y) is A3, i.e., 
a cyclic group of order 3. If we adjoin p then this extension is a radical extension of 
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degree 3, with generator given by a Lagrange Resolvent, as in the proof of Proposition 
37. Consider therefore the elements 

(a, 1) = a + fi -I- y — 0 

= (a, p) = a + pp + p 2 y 

&2 = (a, p 2 ) = a + p 2 P + PY- 

Note that the sum of these resolvents is 

<9j + 02 = 3a (14.23) 

since 1 + p + p 1 — 0. Similarly 

p 2 #] + p 6 2 = 3)9 

ph+p 2 6 2 = 3y. (14.23') 

We also showed in general before Proposition 37 that the cube of these resolvents must 
lie in <Q>(\/D , p). Expanding 6>, 3 we obtain 

a 3 + 1 6 3 + y 3 + 3p(a 2 p + p 2 y + ay 2 ) 

+ 3p 2 (afi 2 + fiy 2 +a 2 y) + 6af$y. (14.24) 

We have 

\fD = {a- p)(a - y)(P - y) 

= (a 2 p + j 6 2 y + ay 2 )- (ap 2 + py 2 + a 2 y). 

Using this equation we see that (24) can be written 

a 3 + p 3 + y 3 + 3 p[i(S + \/D)) + 3 p 2 [^(S -\/D)] + 6 aPy (14.240 

where for simplicity we have denoted by S the expression 

(a 2 p + P 2 y + ay 2 ) + (ap 2 + py 2 + a 2 y). 

Since S is symmetric in the roots, each of the expressions in (240 is a symmetric 
polynomial in a, P and y, hence is a polynomial in the elementary symmetric functions 
5 , =0,S2 = p y and = —q. After a short calculation one finds 

a 3 +p 3 + y 3 = -3 q S -3q 

so that from (240 we find (p + p 2 = — 1 and p — p 2 = 3) 


= - 3 q + ^P0q + y/D) + \p 2 Oq - yfo) - 6 q 


-27 3 

— q 1 V 3D. 

2 2 

(14.25) 

Similarly, we find 

-27 3 , 

6\ = —q - -V=3 D. 

(14.250 
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Equations (25) and (23) essentially give the solutions of our cubic. One small point 
remains, however, namely the issue of extracting the cube roots of the expressions 
in (25) to obtain 6 \ and fy. There are 3 possible cube roots, which might suggest a 
total of 9 expressions in (23). This is not the case since 6 \ and 62 are not independent 
(adjoining one of them already gives the Galois extension containing all of the roots). 
A computation like the one above (but easier) shows that 

9\$i = —3 p (14.26) 

showing that the choice of cube root for 6 \ determines 62 . Using D = —4 p 3 — TJq 2 , 
we obtain Cardano’s explicit formulas, as follows. 

Let 

A = y = r ,+ r /Z5S 

where the cube roots are chosen so that AB = —3 p. Then the roots of the equation 

y y + py + q = o 


are 


a = 


A + B 


P = 


p 2 A + pB 


y = 


pA + p 2 B 


(14.27) 


where p = - i + ^ V-3- 


Examples 

(1) Consider the cubic equation x 3 — x + 1 = 0. The discriminant of this cubic is 

D = — 4( — l) 3 - 27(1 ) 2 = -23 


which is not the square of a rational number, so the Galois group for this polynomial 
is 53 . Substituting into the formulas above we have 


A = 

B = 



( 2 ) 


where we choose A to be the real cube root and then from AB = 3 we see that B is 
also real The roots of the cubic are given by (27) and we see that there is one real root 
and two (conjugate) complex roots (which we could have determined without solving 
for the roots, of course). 

Consider the equation x 3 + x 2 — 2x — l =0. Letting x = r — 1/3 the equation 


7 7 

becomes r 3 — -r — — = 0. Multiplying through by 27 to clear denominators and 


letting y = 3 r we see that y satisfies the cubic equation 


y 3 — 21y— 7 = 0. 
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The discriminant D for this cubic is 


D = — 4(— 21) 3 - 27(-7) 2 = 3 6 7 2 


which shows that the Galois group for this (Eisenstein at 7) cubic is A3. Substituting 
into the formulas above we have 


A . 3 yz + |^ 


and the roots of our cubics can be expressed in terms of A and B using the formulas 
above. This cubic arises from trying to express a primitive 7 th root of unity £7 in terms 
of radicals similar to the explicit formulas for the other roots of unity of small order 
(cf. the exercises). 

In this case we have g(— 5) = —27, g(— 1) = 13, g(0) = —7 and g(5) = 13, so 
that this cubic has 3 real roots. The expressions above for these roots are sums of the 
conjugates of complex numbers. We shall see later that this is necessary, namely that 
it is impossible to solve for these real roots using only radicals involving real numbers. 


A cubic with rational coefficients has either one real root and two complex conjugate 
imaginary roots or has three real roots. These two cases can be distinguished by the 
sign of the discriminant: 

Suppose in the first case that the roots are a and b ± ic where a, b, and c are real 
and c ^ 0. Then 

\Tb = [a — (b + ic)][a — (b — ic)][(b + ic) — (b — ic)] 

= 2ic[(a — bj 2 + c 2 ] 

is purely imaginary, so that the discriminant D is negative. Then in the formulas for A 
and B above we may choose both to be real. The first root in (27) is then real and the 
second two are complex conjugates. 

If all three roots are real, then clearly \[D is real, so D > 0 is a nonnegative 
real number. If D — 0 then the cubic has repeated roots. For D > 0 (sometimes 
called the Casus irreducibilis ), the formulas for the roots involve radicals of nonreal 
numbers, as in Example 2. We now show that for irreducible cubics this is necessary. 
The exercises outline the proof of the following generalization: if all the roots of the 
irreducible polynomial /(x) e <Q»[x] are real and if one of these roots can be expressed 
by real radicals, then the degree of /(x) is a power of 2, the Galois group of / (x) is a 
2-group, and the roots of / (x) can be constructed by straightedge and compass. 

Suppose that the irreducible cubic /(x) has three real roots and that it were possible 
to express one of these roots by radicals involving only real numbers. Then the splitting 
field for the cubic would be contained in a root extension 

Q = C K\ = Q(VD ) C - • • C Ki C K i+ , C • • * C K s = K 

where each field Kj,i = 0, 1, . . . , s, is contained in the real numbers M and s > 2 since 
the quadratic extension Q(*/D) cannot contain the root of an irreducible cubic. We 
have begun this root extension with Q(*/D ) because over this field the Galois group 
of the polynomial is cyclic of degree 3. 
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Note that for any field F the extension F( m {fa ) of F can be obtained by two smaller 
simple radical extensions: let 

F, = F(yG) 

and let b = ifa e F\, so that 

F('XZS) = Fi(Vb). 

We may therefore always assume our radical extensions are of the form F( sja ) where 
p is a prime. 

Suppose now that F is a subfield of the real numbers R and let a be an element of 
F. Let p be a prime and let a = ija denote a real p ^ root of a. Then [F( sja') : F] 
must be either 1 or p, as follows. The conjugates of a over F all differ from a by a /7 th 
root of unity. It follows that the constant term of the minimal polynomial of a over F 
is a d £ where d = [F( Kfa ) : F] is the degree of the minimal polynomial and £ is some 
/7 th root of unity. Since a is real and a d £ e F is real, it follows that £ = ±1, so that 
a d € F. Then, if d ± p,o^ e F and a p = a € F implies a € F, so d = 1. 

Hence we may assume for the radical extensions above that [AT i+ i : K,] is a prime 
Pt and K i+l = Kj( %/ai) for some € K- lt i = 0, 1, . . . , s — 1. In other words, the 
original tower of real radical extensions can be refined to a tower where each of the 
successive radical extensions has prime degree. 

If any field containing \pD contains one of the roots of f(x) then it contains the 
splitting field for / (x), hence contains all the roots of the cubic. We suppose s is chosen 
so that K s -i does not contain any of the roots of the cubic. 

Consider the extension K s /K s -[. The field K s contains all the roots of the cubic 
/ ( x ) and the field K s -\ contains none of these roots. It follows that / (x) is irreducible 
over K s -i, so [K s : AT c _i] is divisible by 3. Since we have reduced to the case where 
this extension degree is a prime, it follows that the extension degree is precisely 3 and 
that the extension K s /K s -\ is Galois (being the splitting field of /(x) over A^ i). Since 
also Kx = K s -\(lfa') for some a e K s -i, the Galois extension K s must also contain 
the other cube roots of a. This implies that K s contains p, a primitive 3 rd root of unity. 
This contradicts the assumption that K s is a subfield of R and shows that it is impossible 
to express the roots of this cubic in terms of real radicals only. 


Solution of Quartic Equations by Radicals 


Consider now the case of a quartic polynomial 

/(x) = x 4 + ux 3 + bx 2 + cx + d 
which under the substitution x = y — a /A becomes the quartic 

g(.y ) = y 4 + py 2 + qy + r 


with 

P = ^(— 3tf 2 + 8 b) 

a 

q = -( a 3 — Aab + 8c) 

O 

r = — ^ 3a 4 + 1 6a 2 b — 64 ac + 256 d). 
256 
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Let the roots of g(y) be a\, a 2 , ct$, and 04. The resolvent cubic introduced in the 
previous section for this quartic is 

h(x) = x 3 - 2 px 2 + ( p 2 - 4 r)x + q 2 

and has roots 

0\ = (cti +a 2 )(an + a 4 ) 

&2 = («! + tt3)(“2 + “4) 

#3 = (ai + u 4 )(a 2 + a 3 )- 

The Galois group of the splitting field for / (x) (or g(y)) over the splitting field of 
the resolvent cubic h(x) is the Klein 4-group. Such extensions are biquadratic, which 
means that it is possible to solve for the roots au a 2 , <23, and 014 in terms of square roots 
of expressions involving the roots #1, 0 2 , and £3 of the resolvent cubic. In this case we 
evidently have 

(«! + a 2 )(ci 3 + a 4 ) = 61 (cci + a 2 ) + («3 + a 4 ) = 0 

which gives 

+ «2 = yJ—6 1 <*3 + «4 = 

Similarly, 

«i + «3 = J —&. 2 0:2 + 014 = -y]—&2 

C*i + « 4 = yf—0$ 0:2 + 013 = — v/— 0 3 . 

An easy confutation shows that y/—6i ■J—th-J—fh = —q, so that the choice of two 
of the square roots determines the third. Since ct\ + 0:2 + a 3 + a 4 = 0, if we add the 
left-hand equations above we obtain 2a 1 , and similarly we may solve for the other roots 
of g( y). We find 

2o?i = yJ—6\ + -J —& 2 + \f—(h 
2a 2 = -yj — B\ — yj — 6 2 — yj — 0 3 
2of 3 — —\J— 6 1 + yJ—B 2 — \J~&3 
2a '4 — — yj + yJ—Bs 

which reduces the solution of the quartic equation to the solution of the associated 
resolvent cubic. 

EXERCISES 

I. Use Cardano's Formulas to solve the equation x 3 + x 2 — 2 = 0. In particular show that 
the equation has the real root 

^(^26 + 1573 + y]l6 - 1573 - 1). 

3 

Show directly that the roots of this cubic are 1 , — 1 ± i . Explain this by proving that 
v/26 + 1573 = 2 + 73 ^26- 1573 = 2-73 
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so that 


^ 26 + 15^3 + ^26 - 15\/3 = 4 . 


2. Let f7 be a primitive 7 th root of unity and let a — f + f -1 . 

(a) Show that £7 is a root of the quadratic z 2 — az + 1 over Q(a). 

(b) Show using the minimal polynomial for £7 that a is a root of the cubic jc 3 +x 2 — 2x — 1. 

(c) Use (a) and (b) together with the explicit solution of the cubic in (b) in the text to 
express (7 in terms of radicals similar to the expressions given earlier for the other 
roots of unity of small order. (The complicated nature of the expression explains why 
we did not include {7 earlier in our list of explicit roots of unity.) 

3. Let F be a field of characteristic ^ 2. State and prove a necessary and sufficient condition 
on a, ft € F so that F{^/a ) = F(v^)- Use this to determine whether <Q>(>/ 1 — \/2 ) = 

0(1, V2). 

4. Let A’ = Q( l/a), where a e Q,o > 0 and suppose [A - : Q] = n(i.e.,x n — a is irreducible). 

Let E be any subfield of A and let [E : Q] = d. Prove that E = Q (i/a). [Consider 

N k/e(V°) € E .] 

5. Let AT be as in the previous exercise. Prove that if n is odd then K has no nontrivial 
subfields which are Galois over 0 and if n is even then the only nontrivial subfield of K 
which is Galois over Q is Q(Ja ). 

6. Let L be the Galois closure of K in the previous two exercises (i.e., the splitting field 

of x” — a). Prove that [L : Q] = n<p(n) or ^rup(n). [Note that Q(£ n ) Pi AT is a Galois 

extension of (Q).] 


7. 


(Kummer Generators for Cyclic Extensions) Let F be a field of characteristic not dividing 
n containing the n* roots of unity and let A - be a cyclic extension of degree d dividing n. 
Then A = F(^/d) for some nonzero a € F. Let a be a generator for the cyclic group 


Gal (A/F). 

(a) Show that a ( ifd ) = ( \fa for some primitive root of unity f. 


(b) Suppose A = F(Oja) = F(^/b). Use (a) to show that 


g( Z/a) 

\fa 



for 


some integer i relatively prime to d. Conclude that a fixes the element so this 

is an element of F. 

(c) Prove that A = F{^fa) = F(Zjb) if and only if a = b'c* and b = a>d n for some 
c, d e F, i.e., if and only if a and b generate the same subgroup of F x modulo n* 
powers. 


8. Let p, q and r be primes in TL with q ^ r. Let %Jq denote any root of x p — q and let tfr 
denote any root of x p — r. Prove that Q( %Jq) ^ <Q>( Zfr). 

9. ( Artin-Schreier Extensions) Let F be a field of characteristic p and let A be a cyclic 

extension of F of degree p. Prove that A = F(a) where a is a root of the polynomial 
x p — x — a for some a e F. [Note that 1) = 0 since F is of characteristic p so 

that —1 = a — a a for some a € A where a is a generator of Gal(A /F) by Exercise 26 
of Section 2. Show that a = ct p — a is an element of F.] Note that since F contains the 
p* roots of unity (namely, 1 ) that this completes the description of all cyclic extensions 
of prime degree p over fields containing the p* roots of unity in all characteristics. 

10. Let A s= <Q>«p) be the cyclotomic field of p* roots of unity for the prime p and let 
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G = Gal(A'/Q). Let £ denote any root of unity. Prove that J2aeG a (0 (the trace 
from K to Q of £) is —1 or p — I depending on whether £ is or is not a primitive p* root 
of unity. 

11. (The Classical Gauss Sum) Let K = Q(£ p ) be the cyclotomic held of p* roots of unity 
for the odd prime p, viewed as a subfield of C, and let G = Gal(A'/Q). Let H denote the 
subgroup of index 2 in the cyclic group G. Define m = Jfr&H t «p)> Vi = YijeaH T (£ P )> 
where a is a generator of Gal(/f/Q) (the two periods of £ p with respect to H. i.e., the s um 
of the conjugates of £ p with respect to the two cosets of H in G, cf. Section 5). 

(a) Prove thato-(?7o) = t]\, a(Vl) = VO and that 

io= E f; ■ 11= E (,■ 

a = square ir^square 


where the sums are over the squares and nonsquares (respectively) in (Z/ pZ) x . [Ob- 
serve that H is the subgroup of squares in (Z/pZ) y .1 

(b) Prove that vo + Vl = (£ p , 1) = -1 and vo~ m = (Kp-> -1) where (£ p , 1) and (£ p , -l) 
are two of the Lagrange resolvents of £ p . 

(c) Let g = LfJo' Cp (the classical Gauss sum). Prove that 

p - 2 

/=0 

(d) Prove that rg = g if t € H and rg = — g if r $ H. Conclude in particular that 
[Q(g) : <Q] = 2. Recall that complex conjugation is the automorphism o \ on K 
(cf. Exercise 7 of Section 5). Conclude that £ = g if —1 is a square mod p (i.e., if 
p = 1 mod 4) and £ = —g if — 1 is not a square mod p (i.e., if p = 3 mod 4) where 
£ denotes the complex conjugate of g. 

(e) Prove that gg = p. [The complex conjugate of a root of unity is its reciprocal. Then 
I = L / Co(-l)-' (v J (Z P ))- X gives 


P - 2 

gg= 'Ei-m-v* 


i,j=0 


g f (fr) 


P - 2 

£(-iy-v 


t.j = 0 



%P 


p — 2 p — 2 

= E(-» t E ffj 

0 j = 0 


° k ttp) 
£ p 


where k = i —j.lfk = 0 the element 




is 1, and if k 0 then this is aprimitive 


p* root of unity. Use the previous exercise to conclude that the inner sum is p — 1 
when k = 0 and is — 1 otherwise.] 

(f) Conclude that g 2 = (— l)i p-1 ^ 2 p and that Q(-/(— ljtP-ri/^p ) is the unique qua- 
dratic subfield of Q(£ p ). (Cf. also Exercise 33 of Section 6.) 

12. Let L be the Galois closure of the finite extension <Q>(or) of Q. For any prime p dividing 
the order of Gal(L/<Q>) prove there is a subfield F of L with [L : F] = p and L = F(a). 


13. Let F be a subfield of the real numbers K. Let a be an element of F and let K = F( (fa ) 
where (fa denotes a real / 1 th root of a . Prove that if L is any Galois extension of F 
contained in K then [L . F] <2. 


14. This exercise shows that in general it is necessary to use complex numbers when expressing 
real roots in terms of radicals and generalizes the Casus irreducibilis of cubic equations. 
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Let / (x) 6 Q[x] be an irreducible polynomial all of whose roots are real. Suppose further 
that one of the roots, or, of / (x) can be expressed in terms of real radicals (i.e., there is a 
root extension of real fields Q = C C ... C K m C K with A/+i = A*( n ijai) s 

i = 1,2 m — 1, for some integers m and some ai 6 K\ and a € A m ). Prove that 

the Galois group of /(x) is a 2-group. Conclude in particular that the degree of / (x) is a 
power of 2 and that the real roots of such a polynomial can be expressed entirely in terms 
of real radicals if and only if these roots can be constructed by straightedge and compass. 
[The argument is similar to the case of cubics. Let L € R be the Galois closure of Q(a) 
and suppose the order of Gal(L/Q) is divisible by some odd prime p. Let F be a subfield 
of L with [L : F] = p and L = F(a) (by Exercise 12) and consider the composite fields 
K'j = FA;, i = 0, 1 , . . . , tn. These are again real radical extensions and by the argument 
in the text for the Casus irreducibilis, we may assume each [A' + | : A(] is a prime. Since 
a $ F = FAo, there is an integer s with a g A'_j, a € K' s . Since the extensions are of 
prime degree, we have A' = A r '_ 1 (°')- Since L = F(a) is Galois of degree p, K' s is a 
Galois extension of A'^ of degree p, contradicting the previous exercise.] 

15. (‘Cardano’s Formulas’ for a Cubic in Characteristic 2) Suppose /(jc) = x 3 + px + q is 
an irreducible cubic over a field of characteristic 2. Let p be a primitive 3 rd root of unity 
and let B, & be the roots of the quadratic x 2 + qx + (p 3 + q 2 ) (cf. Exercise 50 of Section 
6). Let Oi and 62 be cube roots of pq + 0 and pq + B' t respectively, where the cube roots 
are chosen so that 0\02 = p. Prove that the roots of /(x) are given by a = 6\ + 6 2 , 
/J = pa + 6\ , and y=pa + # 2 =a-t-/J. 

16 . Let a be a non 2 ero rational number. 

(a) Determine when the extension ) (/ 2 = - 1 ) is of degree 4 over Q. 

(b) When K = Qi'Jai) is of degree 4 over <Q> show that K is Galois over Q with the 
Klein 4-group as Galois group. In this case determine the quadratic extensions of Q 
contained in K. 

17. Let D G Z be a squarefree integer and let a e <Q> be a nonzero rational number. Show that 
<Q> (yfay/D ) cannot be a cyclic extension of degree 4 over Q. 

18 . Let D € 2 be a squarefree integer and let a € Q be a nonzero rational number. Prove that 
if Qi'Ja'jD ) is Galois over Q then D = — 1 . 

19. Let D € 2 be a squarefree integer and let K — Q(VD ). 

(a) Prove that if D — s 2 + t 2 is the sum of two rational squares them there exists an 
extension L/Q containing K which is Galois over Q with a cyclic Galois group of 

order 4. [Consider the extension <Q )(%/ D + SyfD ).] (Note also that D is the sum of 
two rational squares if and only if D is also the sum of two integer squares, so one 
may assume r and t are integral without loss.) 

(b) Prove conversely that if K can be embedded in a cyclic extension L of degree 4 as 
in (a) then D is the sum of two squares. [One approach: (i) observe first that L 

is quadratic over A, so L = K(J a + b-jD ) for some a, b e <Q>, (ii) show that L 
contains the quadratic subfield Q(Va 2 —b^D), which must be Q(VD) if L/Q is 
cyclic, and use Exercise 7.] 

(c) Conclude in particular that <Q>(\/3 ) is not a subficld of any cyclic extension of degree 
4 over Q. Similarly conclude that the fields Q(\fD ) for squarefree integers D < 0 
are never contained in cyclic extensions of degree 4 over Q (this gives an alternate 
proof for Exercise 19, Section 6). 

20. Let p be a prime. Show that any solvable subgroup of S p of order divisible by p is 
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contained in the normalizer of a Sylow p-subgroup of S p (a Frobenius group of order 
p(p — 1)). Conclude that an irreducible polynomial / (x) € Q[x] of degree p is solvable 
by radicals if and only if its Galois group is contained in the Frobenius group of carder 
p(p — 1). [Let G < S p be a solvable subgroup of order divisible by p. Then G contains 
a p-cycle, hence is transitive on (1, 2, ... , p). Let H < G be the stabilizer in G of the 
element 1, so H has index p in G. Show that H contains no nontrivial normal subgroups 
of G (note that the conjugates of H are the stabilizers of the other points). Let G^ -1 ^ be 
the last nontrivial subgroup in the derived series for G. Show that H G G (n_1) = 1 and 
conclude that |G^” -1 ^| = p, so that the Sylow p-subgroup of G (which is also a Sylow 
p-subgroup in S p ) is normal in G.] 

21. (Criterion for the Solvability of a Quintic ) By the previous exercise, an irreducible poly- 
nomial / (x) in <Q)[x] of degree 5 can be solved by radicals if and only if its Galois group 
(considered as a subgroup of 5s) is contained in the Frobenius group of order 20. It is 
known that this is the case if and only if an associated polynomial g(x) of degree 6 has a 
rational root (cf. Dummit, Solving Solvable Quintics, Math. Comp., 57(1991), pp. 387- 
401). If the quintic is in the general form (where a translation is performed so that the 
coefficient of x 4 is zero) 

f(x) = x 5 + px 3 +qx 2 +rx+s p,q,r,s€Q 
then the associated polynomial of degree 6 is 
g(x) = x 6 + 8rx 5 + (2 pq 2 — 6 p 2 r + 40r 2 — 50^.?) x 4 

+ (—2 4 4 + 21p# 2 r — 40p?r 2 + 160r 3 — 15p 2 9.y — 400qrs + USps 2 ) x 3 
+ (pV - 8 q\ + 9pV - 136pV + 625 q 2 s 2 + 400r 4 - 6 p 3 q 2 r 
+ I6pq 2 r 2 — 50pq 3 s — ldOO^r 2 ^ + 500prs 2 + 90 p 2 qrs) x 2 
+ (-108pV + 32pV - 256p 2 r 4 - 3125s 4 + 512r 5 - 2 pq 6 + 3q 4 r 2 

— 5Sq s s + Y15§q 2 rP‘ — 31p 3 ^ 3 s — 500pr 2 s 2 + 19p 2 ^ 4 r 

- 51 p 3 q 2 r 2 + 76 p^V - 2A00qr 3 s - 325 p 2 q 2 s 2 + 525 p 3 rs 2 
+ 625 pqs 3 + 1 17 p 4 qrs + 105 pq 3 rs + 260 p 2 qr 2 s) x 

+ (, q 8 + 256r 6 + 17^ V - 27 pV - 4p 6 r 3 + 4$pV - 192pV 
+ 3125pV -9375rs 4 - 1600 < 7 r 4 s - 99 p 5 rs 2 - 12 5p$V 

- 124^ 5 rs + 32509 2 r 2 s 2 - 2000pr 3 s 2 - 13p<? 6 r + p S q 2 r 2 
+ 65p 2 q 4 r 2 — 12 Sp 3 q 2 r 3 - 16 pq 2 r 4 — 4 p 5 q 3 s - 12p Z q S s 

— 150 p^q^s 2 + 1200p 3 r 2 ? 2 + 18p 6 ^rs + 12p 3 ^ 3 rs + 196p 4 9r 2 s 
+ 590 pq 3 r 2 s — 160 p 2 qr 3 s — 125 p 2 q 2 rs 2 — I250pqrs 3 ). 

In the particular case where /(x) = x 5 + Ax + B this polynomial is simply 
g(x) = x 6 +8Ax 5 +40A 2 x 4 +160A 3 x 3 +400A 4 x 2 +(512A s — 3125B 4 )x-9375Afi 4 +256A 6 . 

(a) Use this criterion to prove that the Galois group over Q of the polynomial x 5 — 5x + 12 
is the dihedral group of order 10. [Show the associated sixth degree polynomial is 

x 6 - 40x 5 + lOOOx 4 - 20000x 3 + 250000x 2 - 66400000x + 976000000 

and has x = 40 as a rational root. Cf. also Exercise 35 in Section 6.) 

(b) Use this criterion to prove that x 5 — x — 1 is not solvable by radicals. 
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14.8 COMPUTATION OF GALOIS GROUPS OVERQ 


In the determination of the Galois groups of polynomials of degrees < 4 in Section 6 and 
in the determination of the Galois group of the polynomial x 5 — 6x + 3 in the previous 
section we observed that it was possible to obtain useful information regarding the 
Galois group from the cycle types of the automorphisms as elements in S n . This is very 
useful in computing Galois groups of polynomials over (Q> and we now briefly describe 
the theoretical justification. 

Let f(x) be a polynomial with rational coefficients. In determining the Galois 
group of f(x) we may assume that / (jc) is separable and has integer coefficients. Then 
the discriminant D of fix) is an integer and is nonzero. 

For any prime p f consider the reduction /(jc) e F p [jc] of fix) modulo p. If p 
divides D then the reduced polynomial / (jc) has discriminant D = 0 in F p , so is not 
separable. 

If p does not divide Z), then fix) is a separable polynomial over F p and we can 
factor fix) into distinct irreducibles 

/(*) = fi(x)f 2 (x) ■ ■ ■ f k (x) in F p [x]. 

Let ni be the degree of f { (jc), i = 1,2,...,/:. 

The importance of this reduction is provided by the following theorem from alge- 
braic number theory which is an elementary consequence of the study of the arithmetic 
in finite extensions of <Q> (and which we take for granted). 

Theorem. For any prime p not dividing the discriminant D of / (jc) e Z[jc], the Galois 
group over F p of the reduction fix) = fix) (mod p) is permutation group isomorphic 
to a subgroup of the Galois group over <Q> of / (jc). 

The meaning of the statement '‘permutation group isomorphic” in the theorem is 
that not only is the Galois group of the reduction / (jc) mod p of fix) isomorphic to 
a subgroup of the Galois group of / (jc) but that there is an ordering of the roots of 
fix) and of / (jc) (depending on p) so that under this isomorphism the action of the 
corresponding automorphisms as permutations of these roots is the same. In particular 
there are automorphisms in the Galois group of / (jc) with the same cycle types as the 
automorphisms of fix). 

The Galois group of fix) is a cyclic group since every finite extension of F p is 
a cyclic extension. Let a be a generator for this Galois group over F p (for example, 
the Frobenius automorphism). The roots of /,( jc) are permuted amongst themselves 
by the Galois group, and given any two of these roots there is a Galois automorphism 
taking the first root to the second (recall that the group is said to be transitive on the 
roots when this is the case). Similarly, the Galois group permutes the roots of each of 
the factors //(jc), i = 1,2,...,/: transitively. Since these factors are relatively prime 
we also see that no root of one factor is mapped to a root of any other factor by any 
element of the Galois group. 

View cr as an element in S n by labelling the n roots of /(jc) and consider the cycle 
decomposition of cr, which is a product of k distinct permutations since cr permutes 
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the roots of each of the factors f f (x) amongst themselves. By the observations we just 
made, the action of a on the roots of / , (x) must be a cycle of length n -, since otherwise 
the powers of a could not be transitive on the roots of / L (x). Similarly the action of o 
on the roots of f i (x) gives a cycle of length n { ,i = 1 , 2 , 

We see that the automorphism a generating the Galois group of /(x) has cycle 
decomposition («i , n 2y . . . , n*) where n ( , n 2l ■ . . , n k are the degrees of the irreducible 
factors of /(x) reduced modulo p, which gives us the following result. 

Corollary 41. For any prime p not dividing the discriminant of fix ) e Z[x], the Galois 
group of fix) over Q contains an element with cycle decomposition , n 2 , ... , nf) 
where n \ , n 2 , . . . , n* are the degrees of the irreducible factors of fix ) reduced modulo 
P- 

Example 

Consider the polynomial x 5 — x — 1. The discriminant ofthis polynomial is 2869 = 19-151 
so we reduce at primes ^ 19, 151. Reducing mod 2 (he polynomial x 5 — x — 1 factors 
as (x 2 + x + l)(x 3 + x 2 + 1) (mod 2) so the Galois group has a (2,3)-cycIe. Cubing this 
element we see the Galois group contains a transposition. 

Reducing mod 3 the polynomial is irreducible, as follows: x 5 — x — 1 has no roots 
mod 3 so if it were reducible mod 3 then it would have an irreducible quadratic factor, 
hence would have a factor in common with x 9 — x (which is the product of all irreducible 
polynomials of degrees 1 and 2 over F 3 ), hence a factor in common with either x 4 — 1 or 
x 4 + 1 , hence a factor in. common with either x 5 — x or x 5 + x, hence a factor in common 
with either — 1 or 2x + 1 which it obviously does not This shows both that x 5 — x — 1 is 
irreducible in Z[x] and that there is a 5-cycle in its Galois group. 

Since S 5 is generated by any 5-cycle and any transposition, it follows that the Galois 
group of x 5 — x — 1 is S 5 (so in particular this polynomial cannot be solved by radicals, 
(cf. Exercise 21 of Section 7). 

The arguments in the example above indicate how to construct polynomials with 
S„ as Galois group. We use the fact that a transitive subgroup of S„ containing a 
transposition and an n — 1 -cycle is S n . Let f\ be an irreducible polynomial of degree 
n over F 2 . Let f 2 e F 3 IX] be the product of an irreducible polynomial of degree 2 
with irreducible polynomials of odd degree (for example, an irreducible polynomial of 
degree n — 3 and x if n is even and an irreducible polynomial of degree n — 2 if n is 
odd). Let fy e F 5 [x] be the product of x with an irreducible polynomial of degree 
n — 1. Finally, let fix) £ Z[x] be any polynomial with 

fix) = fiix) (mod 2 ) 

= fiix) (mod 3) 

= / 3 (x) (mod 5). 

The reduction of / (x) mod 2 shows that fix) is irreducible in Z[x], hence the Galois 
group is transitive on the n roots of fix). Raising the element given by the factorization 
of fix) mod 3 to a suitable odd power shows the Galois group contains a transposition. 
The factorization mod 5 shows the Galois group contains an n — 1 -cycle, hence the 
Galois group is S„. 
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Proposition 42. For each n € Z + there exist infinitely many polynomials fix) e Z[x] 
with S n as Galois group over Q. 

There are extremely efficient algorithms for factoring polynomials f(x) € Z[x] 
modulo p (cf. Exercises 12 to 17 of Section 3), so the corollary above is an effective 
procedure for determining some of the cycle types of the elements of the Galois group. 
In using Corollary 41 some care should be taken not to assume that a particular cycle 
is an element of the Galois group. For example, one factorization might imply the 
existence of a (2,2) cycle, say (12)(34) and another factorization imply the existence of 
a transposition. One cannot conclude that the transposition is necessarily (12), however 
(nor (34), nor (13), etc.). The choice of (12)(34) to represent the first cycle fixes a 
particular ordering on the roots and this may not be the ordering with respect to which 
the transposition appears as (12). 

Corollary 41 is particularly efficient in determining when the Galois group is large 
(e.g., S „ ), since a transitive group containing sufficiently many cycle types must be 
S„ (for example, a transitive subgroup of S„ containing a transposition and an n — 1- 
cycle is S n> as used above). The most difficult Galois groups to determine in this 
way are the small Galois groups (e.g., a cyclic group of order n) t since factorization 
after factorization will produce only elements of orders dividing n and one is not sure 
whether there will be some p yet to come producing a cycle type inconsistent with the 
assumption of a cyclic Galois group. If one could “compute forever” one could at least 
be sure of the precise distribution of cycle types among the elements of the Galois group 
in the following sense: suppose the Galois group G c S n has order N and that there 
are nr elements of G with cycle type T (e.g., (2,2)-cycles, transpositions, etc.) so that 
the “density” of cycle type T in G is dr =n T /N. Then it is possible to define a density 
on the set of prime numbers (so that it makes sense to speak of “1/2” the primes, etc.) 
and we have the following result (which relies on the Tchebotarov Density Theorem in 
algebraic number theory). 

Theorem. The density of primes p for which f(x) splits into type T modulo p is 
precisely dj. 

This says that if we knew the factorization of f(x) modulo every prime we could 
at least determine the number of elements of G with a given cycle type. Unfortunately, 
even this would not be sufficient to determine G (up to isomorphism): it is known 
that there are nonisomorphic groups containing the same number of elements of all 
cycle types (there are two nonisomorphic groups of order 96 in S& both having cycle 
type distributions: 1 1-cycle, 6 (2,2)-cycles, 13 (2,2,2,2)-cycles, 32 (3,3)-cycles, 12 
(4,4)-cycles, 32 (2,6)-cycles). There are infinitely many such examples (the regular 
representation of the elementary abelian group of order p 3 and for the nonabelian group 
of order p 3 of exponent p give two nonisomorphic groups in S p 3 whose nonidentity 
elements are all the product of p 2 p-cycles for any prime p). 

In practice one uses the factorizations of f(x) modulo s ma ll primes to get an idea of 
the probable Galois group (based on the previous result). One then tries to prove this is 
indeed the Galois group — often a difficult problem. For polynomials of small degree, 
definitive algorithms exist, based in part on the computation of resolvent polynomials. 
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These are analogues of the cubic resolvent used in the previous sections to determine 
the Galois group of quartic polynomials. These resolvent polynomials have rational 
coefficients and have as roots certain combinations of the roots of f(x) (similar to the 
combinations (04 + 0(2) (<*3 + 04) for the cubic resolvent). One then deter min es the 
factorization of these resolvent polynomials to obtain information on the Galois group 
of f(x) — for example the existence of a linear factor implies the Galois group lies 
in the stabilizer in S n of the combination of the roots of /( x) chosen (for example, 
the dihedral group of order 8 for our resolvent cubic). It should be observed, however, 
that the degree of the resolvent polynomials constructed, u nlike the situation of the 
resolvent cubic for quartic polynomials, are in general much larger than the degree of 
/( jc). The effectiveness of this computational technique also depends heavily on the 
explicit knowledge of the possible transitive subgroups of S n . For n = 2, 3, .... 8 
the number of isomorphism classes of transitive subgroups of S n is 1, 2, 5, 5, 16, 7, 
50, respectively. There is a great deal of interest in the computation of Galois groups, 
motivated in part by the problem of dete rmining which groups occur as Galois groups 
over Q- 

We illustrate these techniques with some easier examples (from The Computation 
of Galois Groups , L. Soicher, Master’s Thesis, Concordia University, Montreal, 1981). 

Examples 

(1) There are 5 isomorphism classes of transitive subgroups of S 5 given by the groups Z 5 , 
Dio, F 20 , the so-called Frobenius group of order 20 (the Galois group of x 5 — 2 with 
generators (12345) and (23 54) in 5s), >4 5 and S$. The cycle type distributions for 
these groups are as follows: 


cycle type : 

1 2 ( 2 . 2 ) 

3 

(2,3) 

4 

5 

z 5 

1 




4 

Tho 

1 5 




4 

F20 

1 5 



10 

4 

>15 

1 15 

20 



24 


1 10 15 

20 

20 

30 

24. 


Given this information, the irreducibility of x 5 — x — 1 (giving the transitivity on the 5 
roots) and the cycle type (2,3) immediately shows that the Galois group of x s - x - 1 
is S 5 . 

Consider now the polynomial x 5 + 15* 4- 12. The discriminant is 2 10 3 4 5 5 so the 
Galois group is not contained in ^ 5 . There are two possibilities: S 5 or F 20 . One can 
easily determine which is more likely by factoring the polynomial modulo a number 
of small primes and comparing the distribution of cycle types with those in the table 
above. This does not prove the probable Galois group is actually correct. To decide 
which of 5s and F 2 0 is correct one can compute the resolvent polynomial R(x) of 
degree 15 whose roots are the distinct permutations under S 5 of (04 + ct 2 — as — 04 ) 2 
for 4 of the roots ct \ , 0 : 2 , < 23 , 04 of /(*). By definition, S 5 is transitive on the roots of 
R(x) and it is not difficult to check using the explicit generators for F 20 given above 
that F 20 is not transitive on these 15 values. It follows that F(x) will be a reducible 
polynomial over Q if and only if the Galois group of the quintic is F 2 Q. One finds that 
for x 5 + 15* + 12 the resolvent polynomial R(x) factors into a polynomial of degree 
5 and a polynomial of degree 10, hence the Galois group for this quintic is F 20 . One 
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can also use Exercise 21 of the previous section (cf. Exercise 6), which is also based 
on the computation of a related resolvent polynomial 
(2) Consider the polynomial x 1 - 14x 5 + 56* 3 — 56* + 22. The discriminant is computed 
to be 2 6 7 10 so the Galois group is contained in Aq. 

Factoring the polynomial for the 42 primes not equal to 7 between 3 and 193 
gives a cycle type distribution of 1 1-cycle (2J8 %), 30 (3,3)-cycles (71.43 %), 11 
7-cycles (26.19 %). There are 7 isomorphism classes of transitive subgroups of Sq, 
4 of them contained in Aq. Of these, one contains no (3,3)-cycles, which leaves the 
three possibilities Aq, GL^ 2 ), or F 2 \, the Frobenius group of order 21 (which has 
generators (1234 567) and (2 3 5) (4 7 6) in Sq). The cycle type distributions for these 
three are as follows: 

cycle type: 1 2 (2,2) 3 (2,2,3) (3,3) (2,4) 5 7 

F 2] 1 14 6 

GL 3 (F 2 ) 1 21 56 42 48 

Aq 1 21 105 70 210 280 630 504 720 

It follows that there is a strong probability that the Galois group of this polynomial 
is the Frobenius group of order 21. This is actually the case (the verification requires 
computation of a resolvent of degree 35 and factoring it over Z — there are three 
factors, of degrees 7,7, and 21). 


EXERCISES 

1. Let p be a prime. Prove that the polynomial * 4 + 1 splits mod p either into two irreducible 
quadratics or into 4 linear factors using Corollary 41 together with the knowledge that the 
Galois group of this polynomial is the Klein 4-group. 

2. (Cf. Exercise 48 of Section 6). 

(a) Let A! be the splitting field of * 6 — 2* 3 — 2. Prove that if [AC : Q] = 12 then 
K = Q(\/2, /, V 3) and K is generated over the biquadratic field F = Q(i, \/3) by 

a = y/l ~- and by /S = \/l — V3. Show that if this is the case then the elements 
of order 3 in Gal(K/Q) lie in Gal (K/F). Conclude that any element of Gal(K/Q) 
of order 3 maps a to another cube root of 1 + V3 and maps to another cube root of 
1 — \/3 and if it is the identity on a or ^ then it is the identity on all of K. 

(b) Show that the factorization of /(*) into irreducibles over F13 is the polynomial (x — 
T)(x — 8)(jc — 11)(* 3 + 3) and use Corollary 41 to show that [K : Q] = 36. 

(c) Knowing that G = Gal(K /Q) is of order 36 determine all the elements of G explicitly 
and in particular show that G is isomorphic to S3 x S3. 

3. Prove that the Galois group of jc 5 + 20* + 16 is A5. 

4. Prove that the Galois group of x 5 + * 4 — 4* 3 - 3* 2 + 3* + 1 is cyclic of order 5. [Show 
this is the minim al polynomial of ft] + 

5. Prove that the Galois group of * 5 + 1 1* -I- 44 is the dihedral group Dio (cf- Exercise 21 
of Section 7). 

6. Prove that the Galois group of * 5 + 15* + 12 is F 2 o, the Frobenius group of order 20 (cf. 
Exercise 21 of Section 7). 

7. Prove that the Galois group of * 6 + 24* — 20 is Ae- 

8. Prove that the Galois group of x 7 + 7* 4 + 14* -f 3 is Aq. 
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9. Determine a polynomial of degree 7 whose Galois group is cyclic of order 7. 

10. Determine the probable Galois group of x 7 — lx + 3. 

14.9 TRANSCENDENTAL EXTENSIONS* INSEPARABLE EXTENSIONS, 
INFINITE GALOIS GROUPS 

This section collects some results on arbitrary extensions E/F . These results supple- 
ment those of the preceding sections and complete the basic picture of how an arbitrary 
(possibly infinite) extension decomposes. Since this section is primarily intended as a 
survey, none of the proofs are included; whenever these proofs can be easily supplied 
by the reader we indicate this either in the text or (with hints) in the exercises. 

Throughout this section E/F is an extension of fields. Recall that an element of E 
which is not algebraic over F is called transcendental over F. Keep in mind that exten- 
sions involving transcendentals are always of infinite degree. We generally reserve the 
expression “t is an ‘indeterminate’ over F”, when we are thinking of evaluating t. Field 
theoretically, however, the terms transcendental and indeterminate are synonymous (so 
that the subfield Q(7 t) of M and the field Q(f) are isomorphic). 

Definition. 

(1) A subset {fli, « 2 . . • - , On} of E is called algebraically independent over F if 
there is no nonzero polynomial f(x\ , X 2 , ■ • ■ . x,i) g F[xi , X 2 , . . . , x„] such 
that /(fli, a 2 , . . . , a n ) = 0. An arbitrary subset S of E is called algebraically 
independent over F if every finite subset of S is algebraically independent The 
elements of S are called independent transcendentals over F. 

(2) A transcendence base for E/F is a maximal subset (with respect to inclusion) 
of E which is algebraically independent over F. 

Note that if E/F is algebraic, the empty set is the only algebraically independent 
subset of E. In particular, elements of an algebraically independent set are necessarily 
transcendental. Moreover, one easily checks that S c E is an algebraically independent 
set over F if and only if each s e S is transcendental over F(S — {s}). It is also an 
easy exercise to see that S is a transcendence base for E/F if and only if 5 is a set of 
algebraically independent transcendentals over F and E is algebraic over F(S). 

Theorem. The extension E/F has a transcendence base and any two transcendence 
bases of E/F have the same cardinality. 

Proof; The first statement is a standard Zorn’s Lemma argument. The proof of the 
second uses the same “Replacement Lemma” idea as was used to prove that any two 
bases of a vector space have the same cardinality. 

Definition. The cardinality of a transcendence base for E/F is called the transcen- 
dence degree of E/F. 

Algebraic extensions are precisely the extensions of transcendence degree 0. 
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One special case of this theorem is when £ is finitely generated over F, that is, 
E = F(ai,ct 2 , .... a n ), for some (not necessarily algebraically independent) elements 
oci, ... ,a n of E. It is clear that we may renumber so that a \, . . . , a m are 

independent transcendental and a m +\, . . . , a„ are algebraic over F{ot \, . . . , ot m ) (so 
£ is a finite extension of the latter field). In this case E is called a function field in 
m variables over F. Such fields play a fundamental role in algebraic geometry as 
fields of functions on m -dimensional surfaces. For instance, when F = C and m = 1, 
these fields arise in analysis as fields of meromorphic functions on compact Riemann 
surfaces. 

Note that if Sj and $2 are transcendence bases for E/F it is not necessarily the case 
that F(S\) = F(S 2 ). For example, if t is transcendental over <Q>, {*} and {t 2 } are both 
transcendence bases for Q(t)/Q but (as we shall see shortly) Q(/ 2 ) is a proper subfield 
of Q(t). 

We now see that if x \ , * 2 , ■ ■ ■ , x„ are indeterminates over F and 


f{x) = (x- xO(x - x 2 )---(x- x n ) (14.28) 

is the genera] polynomial of degree n t then the set of n elementary symmetric functions 
si, s 2t . . . , s„ in the x,-*s are also independent transcendentals over F. This is because 
*i, . . . , x n is a transcendence base for £ = Fix i , . . . , jc„) over F (so the transcendence 
degree is n) and £ is algebraic over Fis \, . . . , s, t ) (of degree n\). The theorem forces 
5i to be a transcendence base for this extension as well (in particular, they are 

independent transcendentals). The general polynomial of degree n over F may therefore 
equivalently be defined by taking a i, .... a„ to be any independent transcendentals (or 
indeterminates) and letting 


fix) = X n + +-‘-+On (14.29) 

where the roots of / are denoted by xi , . . . . x n (and s, = (- l)'a,-). 

Definition. An extension E /F is called purely transcendental if it has a transcendence 
base S such that £ = FiS). 


In the preceding discussion, both Fix i , . . . , jc„) and F(sj , .... s„) are purely tran- 
scendental over F. As an exercise (following) one can show that Q(/, V/ 3 — t ) is not 
a purely transcendental extension of <Q> even though it contains no elements that are 
algebraic over <Q> other than those in Q itself (i.e., the process of decomposing a general 
extension into a purely transcendental extension followed by an algebraic extension 
cannot generally be reversed so that the algebraic piece occurs first). 

If £ is a purely transcendental extension of F of transcendence degree n = 1 or 2 
and L is an intermediate field, F c L C £ with the same transcendence degree, then L 
is again a purely transcendental extension of F (Liiroth (n = 1), Castelnuovo in = 2)). 
This result is not true if the transcendence degree is > 3, however, although examples 
where L fails to be purely transcendental are difficult to construct. For extensions of 
transcendence degree 1 the intermediate fields are described by the following theorem. 
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Theorem. Let t be transcendental over F. 

(1) (Liiroth) If F C K C F(t), then K — F(r), for some r e F(t). In particu- 
lar, eveiy nontrivial extension of F contained in F(t) is purely transcendental 
over F. 

(2) If P = P(t)> Q = Q(t) are nonzero relatively prime polynomials in F[r] which 
are not both constant, 

[F(0 : F(P/Q)\ = max(deg P, deg Q). 

Proof: The proof of (2) is outlined in Exercise 18 of Section 13.2. 

By part (2) of this theorem we see that F(P/Q) — F(t ) if and only if P, Q 
are nonzero relatively prime polynomials of degree < 1 (not both constant). Thus 

at -f- b 

F(r) = F(t) if and only if r = , where a,b,c,d e F and ad — bc^fi 0 (called a 

Ct u 

fractional linear transformation oft). For any r € F(t) — F the map t (->• r extends to 
an embedding of F(t) into itself which is the identity on F. This embedding is surjective 
(i.e., is an automorphism of F(t)) precisely for tfie fractional linear transformations. 
Furthermore, the map 

GL 2 (F) Aut(F(t)/F) defined by A = ^ ^ a A , 

where a a denotes the automorphism of F(t) defined by mapping t to (at + b) / (ct-\-d), 
is a surjective homomorphism with kernel consisting of the scalar matrices. Thus 

Aut(F(t)/F) = PGL 2 (F) 

where PGL 2 (F) ~ GL 2 (F)/{XI \ X e F y } gives the group of automorphisms of this 
transcendental extension (cf. Exercise 8 of Section 1). 

When IF is a finite field of order q, Aut(F(r)/F) = PGL 2 (¥) is a finite group of 
order q(q — \)(q+ 1). By Corollary 11 if K is the fixed field of Aut(F(t)/F), then F(t) 
is Galois over K with Galois group equal to Aut(F(r)/IF). In particular, the fixed field 
of Aut(F(r)/IF) is not IF in this case. 

This also provides further examples of the Galois correspondence which can be 
written out completely for small values of q. For instance, if q = |IF| = 2 , PGL 2 (F) 
is nonabelian of order 6 , hence is isomorphic to S 3 , and has the following lattice of 
subgroups: 


<c 



Fig. 5 


The field F(t) is of degree 6 over the fixed field K of AutflFO) /F) and the lattice 
of sub fields K C L c IF(0 is dual to the lattice of subgroups of S 3 . The fixed field of a 
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cyclic subgroup ( a ) is easily found (via the preceding theorem) by finding a rational 
function r in / which is fixed by a such that [F(/) : F(r)] = |er|. For example, if 
o : t i-y 1/(1 + t), then a has order 3. The rational function 

, , n t 3 + t + 1 

r = * + *(0 + „(,)= 1 j rTTr 

is fixed by a and [F(f) : IF(r)] = 3 (by part (2) of the theorem). Since F(r) is contained 
in the fixed field of ( a ) and the degree of F(r) over the fixed field is 3, F(r) is the fixed 
field of ( a ). In this way one can explicitly describe the lattice of all subfields of F(/) 
containing K shown in Figure 6. 



Purely transcendental extensions of <Q> play an important role in the problem of real- 
izing finite groups as Galois groups over <Q>. We describe a deep result of Hilbert which 

is fundamental to this area of research. If a\ , a 2 a„ are independent indeterminates 

over a field F, we may evaluate (or specialize) a \ , . . . , a„ at any elements of F, i.e., 
substitute values in F for the 'Variables” ai,a 2 , ... ,a n . If E is a Galois extension of 
F(a\, ... ,a„) t then E is obtained as a splitting field of a polynomial whose coefficients 
lie in F[a \ , . . . , a„]. Any specialization of a \ , . . . , a„ into F maps this polynomial into 
one whose coefficients lie in F. The specialization of E is the splitting field of the 
resulting specialized polynomial. 

Theorem. (Hilbert) Let xj , x 2 , . . . , x„ be independent transcendentals over Q, let E = 
<Q)(xj , . . . , x„) and let G be a finite group of automorphisms of E with fixed field K . 
If K is a purely transcendental extension of (Q) with transcendence basis ai,a 2 , . . . , a„, 
then there are infinitely many specializations of a \ . . . . , a n in Q such that E specializes 
to a Galois extension of <Q> with Galois group isomorphic to G. 

Hilbert’s Theorem gives a sufficient condition for the specialized extension not to 
collapse. In general, the Galois group of the specialized extension is a subgroup of G 
(cf. Proposition 19) and may be a proper subgroup of G. It is also known that the fixed 
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field K need not always be a purely transcendental extension of Q. An example of this 
occurs when G is the cyclic group of order 47. 

This theorem can be used to give another proof of Proposition 42: 

Corollary. S„ is a Galois group over Q, for all n. 

Proof of the Corollary: We have already proved that the fixed field of S n acting in 
the obvious fashion on Q(jci , . . . , x„) is purely transcendental over <Q> (with the elemen- 
tary symmetric functions as a transcendence base), so Hilbert’s Theorem immediately 
implies the corollary. 

The hypothesis that K be purely transcendental over Q is crucial to the proof of 
Hilbert’s Theorem. Every finite group is isomorphic to a subgroup of S n and so acts 
on <Q>(xi, , x n ) for some n. It is not known, however, even for the subgroup A n of 
S„ whether its fixed field under the obvious action is a purely transcendental extension 
of Q (although it is known by other means that A n is a Galois group over (Q> for all n). 
Thus there are a number of important open problems in this area of research. 

One should also notice that Hilbert’s Theorem does not work when the base field Q 
is replaced by an arbitrary field F (suppose F were algebraically closed, for instance). 
In particular, as noted earlier, the general polynomial f(x) in Section 6 has Galois group 
S n over F(fli, . . . , a n ) for any F, but when F is a finite field, the specialized extension 
obtained from its splitting field is always cyclic. 

We next expand on the theory of inseparable extensions described in Section 13.5. 
Let p be a prime and let F be a field of characteristic p. 

Definition. An algebraic extension E/F is called purely inseparable if for each a e E 
the minimal polynomial of a over F has only one distinct root. 

It is easy to see that the following are equivalent: 

(1) E/F is purely inseparable 

(2) if a e E is separable over F, then a € F 

(3) if a € E, then € F for some n (depending on a), and m aiF ( x) — . 

The following easy proposition describes composites of separable and purely in- 
separable extensions. 

Proposition. If E\ and £2 are subfields of E which are both separable (or both purely in- 
separable) extensions of F, then their composite E\Ei is separable (purely inseparable, 
respectively) over F. 

Proof: Exercise. 

One immediate consequence of this is the following result. 
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Proposition. Let E/F be an algebraic extension. Then there is a unique field E sep with 
F c E sep c E such that E sep is separable over F and E is purely inseparable over 
E sep . The field E scp is the set of elements of E which are separable over F. 

The degree of E sep /F is called the separable degree of E/F and the degree of 
E /E sep is called the inseparable degree of E/F (often denoted as [E : FL and [E : F]> 
respectively). The product of these two degrees is the (ordinary) degree. The proposi- 
tions immediately give the following corollary. 

Corollary. Separable degrees (respectively inseparable degrees) are multiplicative. 

When E is generated over F by the root of an irreducible polynomial p(x) e F[x] 
the separable and inseparable degrees of the extension E/ F are the same as the separable 
and inseparable degrees of the polynomial p(x) defined in Section 13.5. 

The proposition asserts that any algebraic extension may be decomposed into a 
separable extension followed by a purely inseparable one. Exercise 3 at the end of 
this section outlines an example illustrating that this decomposition cannot generally 
be reversed, namely an extension which is not a separable extension of a purely insep- 
arable extension. We shall shortly state conditions on an extension under which the 
decomposition into separable and purely inseparable subextensions may be reversed. 

We now know that an arbitrary extension E/F can be decomposed into a purely 
transcendental extension F(S ) of F followed by a separable extension E\ of F(S) 
followed by a purely inseparable extension E/E\. In certain instances the insepara- 
bility in the algebraic extension at the “top” may be removed by a judicious choice of 
transcendence base: 

Proposition. If E is a finitely generated extension of a perfect field F, then there is a 
transcendence base T of E/F such that £ is a separable (algebraic) extension of F(T). 

A transcendence base T as described in the proposition is called a separating 
transcendence base. Exercise 4 at the end of this section illustrates this with a nontrivial 
example. 

Recall that an extension E/F is normal if it is the splitting field of some (possibly 
infinite) set of polynomials in F[x] (in particular, normal extensions are algebraic but not 
necessarily finite or separable). We previously used the synonymous term splitting field 
and the term normal is reintroduced here in the context of arbitrary algebraic extensions 
since it is used frequently in the literature, often in the context of embeddings of a field 
into an algebraic closure. Although the following set of equivalences can be gleaned 
from the preceding sections, the reader should write out a complete proof, checking 
that the arguments work for both infinite and inseparable extensions: 

Proposition. Let E/F be an arbitrary algebraic extension and let Q be an algebraic 
closure of E. The following are equivalent: 

(1) E/F is a normal extension (i.e., is the splitting field over F of some set of 
polynomials in F[x]) 
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(2) whenever a : F — £2 is an embedding such that a |/r is the identity, a (F) = E 

(3) whenever an irreducible polynomial f(x) € has one root in F, it has all 
its roots in E. 

In general, any embedding of a normal extension E/F into an algebraic closure 
of E which extends the identity embedding of F is an automorphism of F, i.e., is 
an element of Aut (E/F). Moreover, the number of such automorphisms equals the 
separable degree of E/F, provided the latter is finite: 

if E/F is a normal extension and \E : FL is finite, |Aut(F /F)\ = [F : F] s . 

If [E : F~\ s is infinite we shall see shortly that |Aut(F /F)\ is also infinite but need not 
be of the same cardinality. 

If E/F is a normal extension whose separable degree is finite, let Fo be the fixed 
field of Aut(F/F). By Corollary 11, E/Eq is a (separable) Galois extension whose 
degree equals |Aut(£ /F) |. It follows that F 0 /F must be purely inseparable (of degree 
equal to [E : F],), i.e., the separable and purely inseparable pieces of the extension 
may be reversed for normal extensions. More precisely, we easily obtain the following 
proposition. 

Proposition. If E/F is normal with [F : FT < oo, then E = E sep E pii where F pi - is 
a purely inseparable extension of F (E pi consists of all purely inseparable elements of 
E over F) and E sep Pi E pi — F. 

Finally, we mention how Galois Theory generalizes to infinite extensions. 

Definition. An extension E/F is called Galois if it is algebraic, normal and separable. 
In this case Aut(F/F) is called the Galois group of the extension and is denoted by 
Gal(E/F). 

For infinite extensions there need not be a bijection between the set of all subgroups 
of the Galois group and the set of all subfields of E containing F, as the following 
example illustrates. 

Let E be the subfield of M obtained by adjoining to Q all square roots of positive 
rational numbers. One easily sees that E may also be described as the splitting field 
of the set of polynomials x 1 — p t where p runs over all primes in Z + . Note that E 
is a (countably) infinite Galois extension of Q. Since every automorphism a of E is 
determined by its action on the square roots of the primes and c either fixes or negates 
each of these, c 2 is the identity automorphism. It follows that Aut(F) is an infinite 
elementary abelian 2-group. Thus Aut(F) is an infinite dimensional vector space over 
F 2 . By an exercise in the section on dual spaces (Section 1 1.3) the number of nonzero 
homomotphisms of Aut(F) into F 2 is uncountable, whence their kernels (which are 
subspaces of co- d im ension 1) are uncountable in number (and distinct). Thus Aut(F) 
has uncountably many subgroups of index 2, whereas Q has only a countable number 
of quadratic extensions. 

The basic problem is that many (most) subgroups of Gal(F /F) do not correspond 
(in a bijective fashion) to subfields of E containing F. In order to pick out the ‘‘right” 
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set of subgroups of Gal(£/£) we must introduce a topology on this group (called the 
Krull topology). The axioms for the collection of (topologically) closed subsets of a 
topological space are precisely the bookkeeping devices which single out the relevant 
subgroups (these are listed in Section 15.2). Galois theory for finite extensions force 
certain subgroups of finite index to be closed sets and these in turn determine the 
topology on the entire group (as we might expect since every extension of F inside E 
is a composite of finite extensions). Moreover, the Galois group of E/F is the inverse 
limit of the collection of finite groups Gal(K /F), where K runs over all finite Galois 
extensions of F contained in E (cf. Exercise 10, Section 7.6). 

Theorem. (Krull) Let E/F be a Galois extension with Galois group G. Topologize G by 
taking as a base for the closed sets the subgroups of G which are the fixing subgroups of 
the finite extensions of F in £, together with all left and right cosets of these subgroups. 
Then with this (“KmU”) topology the closed subgroups of G correspond bijectively with 
the subfields of E containing F and the corresponding lattices are dual. Closed normal 
subgroups of G correspond to normal extensions of F in E. 

One important area of current research is to describe (as a topological group) the 
Galois group of certain field extensions such as F/F, where F is the algebraic closure 
of F. Little is known about the latter group when F = <Q> (in particular, its normal 
subgroups of finite index, i.e., which finite groups occur as Galois groups over (Q>, are 
not known). If £ is the algebraic closure of the finite field F p , the Galois group of 
this extension is the topologically cyclic group Z with the Frobenius automorphism 
as a topological generator. The group Z is an uncountable group (in particular, is not 
isomorphic to Z) with the property that every closed subgroup of finite index is nor mal 
with cyclic quotient Note that Z must also have nontrivial infinite closed subgroups 
(unlike Z) since £ contains proper subfields which are infinite over F p (such as the 
composite of all extensions of F p of g-power degree, for any prime q — this Galois 
extension of F p has Galois group Z 9 , the q - adic integers, as described in Exercise 11 
of Section 7.6). 


EXERCISES 

1. Prove that every purely inseparable extension is normal. 

2. Let p be a prime and let K = F p (jc, y) with x and y independent traruscen dentals over F p . 
Let F = F p (x p — x, y p — jc). 

(a) Prove that [K : F] = p 1 and the separable degree and inseparable degree of K/ F are 
both equal to p. 

(b) Prove that there is a subfield £ of £ containing F which is purely inseparable over F 
of degree p (so then AT is a separable extension of E of degree p). [Let s = x p —x e F 
and t = y p — x g F and considers — r.] 

3. Let p be an odd prime, let s and t be independent transcendental over F p , and let F be 
the field F p (.?, /). Let be a root of x 2 — sx + 1 = 0 and let a be a root of x p - fi = 0 (in 
some algebraic closure of F). Set E = F(/0) and K = F(a). 

(a) Prove that £ is a Galois extension of F of degree 2 and that AT is a purely inseparable 
extension of £ of degree p. 
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(b) Prove that K is not a normal extension of F. [If it were, conjugate over F to show 
that K would contain a p 1 * 1 root of s and then also a p A root of /, so [K : F]> p 2 , a. 
contradiction.] 

(c) Prove that there is no field Kq such that F c Kq c K with Kq/F purely inseparable 
and K/Ko separable. [If there were such a field, use Exercise 1 and the feet that 
the composite of two normal extensions is again normal to show that K would be a 
normal extension of F.J 

4. Under the notation of the previous exercise prove that a. s is a separating transcendence 
base for K over F p . 

5. Let p be a prime, let t be transcendental over F p and let AT be obtained by adjoining to 
F p (f) all p-power roots of /. Prove that K has transcendence degree 1 over F p and has no 
separating transcendence base. 

6 . Show that if t is transcendental over <Q> then <Q>(f , V / 3 — t ) is not a purely transcendental 
extension of Q. (This is an example of what is called an elliptic function field.) 

7. Let k be the field with 4 elements, t a transcendental over k, F = k(t 4 + 1 ) and K = k(t). 

(a) Show that [K : F] = 4. 

(b) Show that K is separable over F. 

(c) Show that K is Galois over F. 

(d) Describe the lattice of subgroups of the Galois group and the corresponding lattice of 
subfields of K, giving each subfield in the form k(r), for some rational function r. 

8 . Let p be an odd prime, k an algebraically closed field of characteristic p and let t be 
transcendental over k. Suppose F is a degree 2 field extension of k(t). Show that F can 
be written in the form k(t, y), for some y e F with y 2 e k(t) and y transcendental over k. 
If y 2 = 4 / 3 — / — 1, find [F : £(y)] and describe k(t) fi k(y) as k(r ), for some r € k(t). 

9. Let t be transcendental over F 3 , let K = F 3 (f), let G = Aut(/f /F 3 ) and let F be the fixed 
field of G. 

(a) Prove G = S4 and deduce that there is a unique field E with F C E C K and 
[£ : F] = 2. [Recall that G = PGL 2 OF 3 ); show that GL 2 OF 3 ) permutes the 4 
lines in a 2 -dimensional vector space over F 3 and the kernel of this permutation 
representation is the scalar matrices.] 

(b) Complete the description of the lattice of subfields of K containing E: 



Give each subfield in the form E(r) for some rational function r. (The lattice of 
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subgroups of Aa appears in Section 3.5). 

10. Prove that a purely transcendental proper extension of a field is never algebraically closed. 

11. Let S be a set of independent transcendental over a field F and let f2 be an algebraic 
closure of F(S). Prove that any permutation on S extends to an element of Aut(F(S)/F). 
Prove that any such automorphism of F(S ) extends to an automorphism of Q. Deduce 
that C has infinitely many automorphisms. 

12. Let AT be a subfield of C maximal with respect to the property “V2 ^ K.” 

(a) Show such a field K exists. 

(b) Show that C is algebraic over K . 

(c) Prove that every finite extension of K in C is Galois with Galois group a cyclic 
2-group. 

(d) Deduce that [C : AT] is countable (and not finite). 

13. Let K be the fixed field in C of an automorphism of C Prove thaL every finite extension 
of K in C is cyclic. 

14. Let K n be the splitting field of (x 2 - pi )(x 2 -pi)--(x 2 — p n ) over Q, where pi , . . . , p„ are 
the first « primes. Prove that the Galois group of K n /Q is an elementary abelian 2-group 
of order 2" . 

15. Let Kg = Q and for n > 0 define the field K n + 1 as the extension of K„ obtained by 
adjoining to K„ all roots of all cubic polynomials over K n . Let AT be the union of the 
subfields Kjt, n >0. Prove that AT is a Galois extension of Q. Prove that every cubic 
polynomial over K splits completely over K. Prove that there are nontrivial algebraic 
extensions of AT. 

16. Let F be the composite of all the splitting fields of irreducible cubics over Q. Prove that 
F does not contain all quadratic extensions of Q. 

17. Let Kq = Q and for n > 0 define the field K n+ \ as the extension of K n obtained by 
adjoining to K n all radicals of elements in K n . Let AT be the union of the subfields K n , 
n > 0. Prove that AT is a Galois extension of Q. Prove that there are no nontrivial solvable 
Galois extensions of K. Prove that there are nontrivial Galois extensions of AT. 

18. Let Fo = Q and for n > 0 define the field as the extension of F„ obtained by 
adjoining to F„ all real radicals of elements in F n . Let F be the union of the subfields F„, 
n > 0. Let K + be the fixed field of complex conjugation restricted to the field AT in the 
previous exexcise (the maximal real suhfield of K). Prove that F ^ K + . 

19. This exercise proves that if K/F is a Galois extension of fields, then Gal(Af /F) is isomor- 
phic to l^m Ga \{L/F), where the inverse limi t is taken over all the finite Galois extensions 
L of F contained in K . 

(a) Show that K is the union of the fields L. 

(b) Prove that the map <p : Gal(AC/F) ljm Gal(L/F) defined by mapping a in 
Gal(AT/F) to (. . . , oji, . . .), where a \i is the restriction of a to L, is a homomor- 
phism. 

(c) Show that (p is injective. 

(d) If (. . . , ol, . . - ) € l^mGal(Z//F), define a 6 Gal (K/F) by a (a) = ox (a) if a € L. 
Prove that a is a well defined automorphism and deduce that <p is surjective. 
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Part V 


INTRODUCTION TO COMMUTATIVE RINGS, 
ALGEBRAIC GEOMETRY, AND 
HOMOLOGICAL ALGEBRA 


In this part of the book we continue the study of rings and modules, concentrating 
first on commutative rings. The topic of commutative algebra, which is of interest in 
its own right, is also a basic foundation for other areas of algebra. To indicate some 
of the importance of the algebraic topics introduced, we parallel the development of 
the ring theory in Chapter 15 with an introduction to affine algebraic geometry. Each 
section first presents the basic algebraic theory and then follows with an application of 
those ideas to geometry together with an indication of computational methods using 
the theory of Grobner bases from Chapter 9. The purpose here is twofold: the first is to 
present an application of algebraic techniques in the important branch of mathematics 
called Algebraic Geometry, and the second is to indicate some of the motivations for 
the algebraic concepts introduced from their origins in geometric questions. 

This connection of geometry and algebra shows a rich interplay between these two 
areas of mathematics and demonstrates again how results and structures in one circle 
of mathematical ideas provide insights into another. 

In Chapter 16 we continue with some of the fundamental structures involving 
commutative rings, culminating with Dedekind Domains and a structure theorem for 
modules over such rings which is a generalization of the structure theorem for modules 
over P.I.D.s in Chapter 12. 

In Chapter 17 we describe some of the basic techniques of “homological algebra,” 
which continues with some of the questions raised by the failure of exactness of some 
of the sequences considered in Chapter 10. The cohomology of groups in this chapter 
is intended to serve both as a more in-depth application of homological algebra to see 
its uses in practice, and as a relatively self contained exposition of this important topic. 
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CHAPTER 1 5 


Commutative Rings 
and Algebraic Geometry 


Throughout this chapter R will denote a commutative ring with 1^0. 

15.1 NOETHERIAN RINGS AND AFFINE ALGEBRAIC SETS 

In this section we study Noetherian rings in greater detail. These are a natural gener- 
alization of Principal Ideal Domains and were introduced briefly in Chapter 12. Note 
that when R is considered as a left module over itself, its R -submodules are precisely 
its ideals, so the definition in Section 1 of Chapter 12 may be phrased in the following 
form: 

Definition. A commutative ring R is said to be Noetherian or to satisfy the ascending 
chain condition on ideals (or A.C.C. on ideals ) if there is no infini te increasing chain 
of ideals in R, i.e., whenever 1 1 C / 2 c I 3 c ■ • • is an increasing chain of ideals of R, 
then there is a positive integer m such that h = l m for all k > m. 

Proposition 1. If / is an ideal of the Noetherian ring R, then the quotient R/I is a 
Noetherian ring. Any homomorphic image of a Noetherian ring is Noetherian. 

Proof: If R is a ring and / is an ideal in R , then any infinite ascending chain of 
ideals in the quotient R/I would correspond by the Lattice Isomorphism Theorem to 
an infin ite ascending chain of ideals in R. This gives the first statement, and the second 
follows by the first Isomorphism Theorem. 

Theorem 2. The following are equivalent: 

(1) R is a Noetherian ring. 

(2) Every nonempty set of ideals of R contains a maximal element under inclusion. 

(3) Every ideal of R is finitely generated. 

Proof: The proof is identical to that of Theorem 1 in Section 12.1 in the special 
case where the R- module M is R itself (and submodules are ideals). 
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Examples 

Every Principal Ideal Domain is Noetherian since it satisfies condition (3) of Theorem 2. 
In particular, Z, the polynomial ring &[*] where A: is a field, and the Gaussian integers Z[i], 
are Noetherian rings. The ring Z[x\, * 2 , - - - ] is not Noetherian since the ideal (jcj, *2, . . . ) 
cannot be generated by any finite set (any finite set of generators involves only finitely 
many of the *;). Exercise 33(d) in Section 7.4 shows that the ring of continuous real valued 
functions on [0, 1] is not Noetherian. 

A Noetherian ring may have arbitrarily long ascending chains of ideals and may have 
infinitely long descending chains of ideals. For example, Z has the infinite descending 
chain 

(2) D (4) D (8) D - ■ ■ 

i.e., aNoetherian ring need not satisfy the descending chain conditionon ideals (D.C.C.). 
We shall see, however, that a commutative ring satisfying D.C.C. on ideals necessarily 
also satisfies A.C.C., i.e., is Noetherian; such rings are called Artinian and are studied 
in Chapter 16. 

The following theorem and its corollary, which we record here for completeness, 
were proved in Section 9.6 (Theorem 21 and Corollary 22, respectively). 

Theorem3. (Hilbert’s BasisTheorem)lfR is a Noetherian ring then so is the polynomial 
ring R[x]. 

Note that Hilbert’s Basis Theorem shows how larger Noetherian rings may be built 
from existing ones in a manner analogous to Theorem 7 of Section 9.3 (which proved 
that if R is a U.F.D., then so is 7?[x]). 

Corollary 4. The polynomial ring k[xy, x 2 , . . . , x„] with coefficients from a field k is 
a Noetherian ring. 

Let A: be a field. Recall that a ring R is a k-algebra if k is contained in the center of 
R and the identity of k is the identity of R. 

Definition. 

(1) The ring R is a finitely generated k-algebra if R is generated as a ring by k 
together with some finite set r \ , /* 2 , - ■ . , r n of elements of R. 

(2) Let R and S be ^-algebras. A map : R S is a k-algebra homomorphism 
if is a ring homomorphism that is the identity on k. 

If R is a ^-algebra then R is both a ring and a vector space over k, and it is 
important to distinguish the sense in which elements of R are generators for R. For 
example, the polynomial ring k[x\, . . . , x„] in a finite number of variables over k is a 
finitely generated A>algebra since x \ , . . . , x n are ring generators, but for n > 0 this ring 
is an infinite dimensional vector space over k. 
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Corollary 5. The ring R is a finitely generated fc-algebra if and only if there is some 
suijective A:-algebra homomorphism 

(p : k[x i, x 2 x„]-> R 

from the polynomial ring in a finite number of variables onto R that is the identity map 
on k. Any finitely generated fc-algebra is therefore Noetherian. 

Proof: If R is generated as a ^-algebra by r \ , . . . , r„, then we may define the map 
<p : k[xi, x n ] -y R by (p(xi) = n for all i and <p(a) = a for all a e k. Then <p 
extends uniquely to a suijective ring homomorphism. Conversely, given a suijective 
homomorphism the images of x \ , . . . , x„ under <p then generate R as a A: -algebra, 
proving that R is finitely generated. Since , jc„] is Noetherian by the previous 

corollary, any finitely generated A: -algebra is therefore the quotient of a Noetherian ring, 
hence also Noetherian by Proposition 1. 

Example 

Suppose the A:-algebra R is finite dimensional as a vector space over k, for example when 
R = k[x]/(f(x)), where / is any nonzero polynomial in A:[x]. Then in particular R is a 
finitely generated A:- algebra since a vector space basis also generates R as a ring. In this 
case since ideals are also A:- subspaces any ascending or descending chain of ideals has at 
most dim * R + 1 distinct terms, hence R satisfies both A.C.C. and D.C.C. on ideals. 

The basic idea behind “algebraic geometiy” is to equate geometric questions with 
algebraic questions involving ideals in rings such as k\x \, .... x„]. The Noetherian 
nature of these rings reduces many questions to consideration of finitely many algebraic 
equations (and this was in turn one of the main original motivations for Hilbert’s Basis 
Theorem). We first consider the principal geometric object, the notion of an “algebraic 
set” of points. 

Affine Algebraic Sets 

Recall that the set A" of n-tuples of elements of the field k is called affine n-space 
over k (cf. Section 10.1). If x\, x 2i . . . , x„ are independent variables over k, then the 
polynomials / in k[x j , x 2t ■ . - , x„] can be viewed as A-valued functions / : A" -y k 
on A 71 by evaluating / at the points in A 71 : 

/ : a n ) f(,a\,a 2 a n ) e k. 

This gives a ring of k- valued functions on A 71 , denoted by A:[A"] and called the coordinate 
ring of A 71 . For instance, when k = R and n = 2, the coordinate ring of Euclidean 
2-space IR 2 is denoted by R[A 2 ] and is the ring of polynomials in two variables, say x 
and y, acting as real valued functions on ]R 2 (the usual “coordinate functions”). 

Each subset S of functions in the coordinate ring k[ A 71 ] determines a subset Z(S) 
of affine space, namely the set of points where all functions in S are simultaneously 
zero: 


Z(S) = {(a 1 ,a 2 >---.4i) € A" | f(a\, a 2> . . . , a n ) = 0 for all / e 5}, 
where Z(0) = A 71 . 
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Definition. A subset V of A 71 is called an affine algebraic set (or just an algebraic set) 
if V is the set of common zeros of some set S of polynomials, i.e., if V = Z(S) for 
some 5 C £[A”]. In this case V = Z(S) is called the locus of S in A”. 

If 5 = {/} or {/i, . . . , f m ) we shall simply write Z(f) or Z{f f m ) for Z(S) 

and call it the locus of / or f m , respectively. Note that the locus of a single 

polynomial of the form / — g is the same as the solutions in affine n-space of the 
equation / = g, so affine algebraic sets are the solution sets to systems of polynomial 
equations, and as a result occur frequently in mathematics. 


Examples 


(1) If n = 1 then the locus of a single polynomial / e Afx] is the set of roots of / in k. 
The algebraic sets in A 1 are 0, any finite set, and k (cf. the exercises). 

(2) The one point subsets of A” for any n are affine algebraic since [(ai, ai a„)) is 

Z(xi - a[, X 2 - «2. , x n — a n ). More generally, any finite subset of A” is an 

affine algebraic set. 

(3) One may define lines, planes, etc. in A” — these are linear algebraic sets, the loci 

of sets of linear (degree 1) polynomials of £[jq x„]. For example, a line in 

A 2 is defined by an equation ax + by = c (which is the locus of the polynomial 
f (x, y) = ax + by — c e k[x, y]). A line in A 3 is the locus of two linear polynomials 
of k[x, y, z] that are not multiples of each other. In particular, the coordinate axes, 
coordinate planes, etc. in A 71 are all affine algebraic sets. For instance, the x-axis in 
A 3 is the zero set Z(y , z ) and the x,y plane is the zero set Z(z). 

(4) In general the algebraic set Z(f) of a nonconstant polynomial / is called a hyper- 
surface in A n . Conic sections are familiar algebraic sets in the Euclidean plane R 2 . 
For example, the locus of y — x 2 is the parabola y = x 2 , the locus of x 2 + y 2 — 1 
is the unit circle, and Z(xy — 1) is the hyperbola y = 1/x. The x- and y-axes are 


the algebraic sets Z(y) and Z(x) respectively. Likewise, quadric surfaces such as the 

y 2 z 2 

ellipsoid defined by the equation x 2 + + — = 1 are affine algebraic sets in R 3 . 


We leave as exercises the straightforward verification of the following properties 
of affine algebraic sets. Let S and T be subsets of k [A 71 ]. 

(1) If S C T then Z(T) C Z(S) (i.e., Z is inclusion reversing or contravariant). 

(2) Z(S) = Z{1), where 7 = (5) is the ideal in A[A”] generated by the subset 5. 

(3) The intersection of two affine algebraic sets is again an affine algebraic set, in 
fact Z(S) H Z(T) = Z(S U T). More generally an arbitrary intersection of affine 
algebraic sets is an algebraic set; if [Sj) is any collection of subsets of £[A"], then 

nZ(Sj) = Z(USj). 

(4) The union of two affine algebraic sets is again an affine algebraic set, in fact 
Z(I) U Z(J) = Z(J J), where 7 and J are ideals and 7 J is their product 

(5) Z( 0) = A” and Z( 1) = 0 (here 0 and 1 denote constant functions). 

By (2), evei 7 affine algebraic set is the algebraic set corresponding to an ideal of 
the coordinate ring. Thus we may consider 

Z : [ ideals of k[A n ] } — > { affin e algebraic sets in A” } . 
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Since every ideal l in the Noetherian ring k[x\ , X 2 , . . . , x„] is finitely generated, say 
I = (/l, fi, . f q ), it follows from (3) that Z(I) = Z(fi)DZif 2 )n-nZif^),i.e., 
each affine algebraic set is the intersection of a finite number of hypersurfaces in A". 
Note that this “geometric” property in affine n -space is a consequence of an “algebraic” 
property of the corresponding coordinate ring (namely, Hilbert’s Basis Theorem). 

If V is an algebraic set in affine n-space, then there may be many ideals I such 
that V = Z(I). For example, in affine 2-space over M the y-axis is the locus of the 
ideal (x) of M[x, y], and also the locus of (x 2 ), (x 3 ), etc. More generally, the zeros 
of any polynomial are the same as the zeros of all its positive powers, and it follows 
that Z(I) = Z{l k ) for all k > 1. We shall study the relationship between ideals that 
determine the same affine algebraic set in the next section when we discuss radicals of 
ideals. 

While the ideal whose locus determines a particular algebraic set V is not unique, 
there is a unique largest ideal that determines V , given by the set of all polynomials 
that vanish on V. In general, for any subset A of A" define 

X(A) = {/ € k[x i, | fia^a 2 , a,,) = 0 for all (a u a 2 , . . . , a„) e A}. 

It is immediate that 1(A) is an ideal , and is the unique largest ideal of functions that 
are identically zero on A. This defines a correspondence 

I : { subsets in A" } -»■ { ideals of &[A”] }. 


Examples 

(1) In the Euclidean plane, I (the x-axis) is the ideal generated by y in the coordinate ring 
R[x. yj- 

(2) Over any field k, the ideal of functions vanishing at (ai, a 2 , . . - , a„) e A" is a maximal 
ideal since it is the kernel of the surjective ring homomorphism from Jtfxi , .... x„] to 
the field k given by evaluation at (a\,a 2 , , a n ). It follows that 

I((fll,fl 2 , ... ,fl„)) = (xi -a\. x 2 -a 2 , . ... x n ~a n ). 

(3) Let V = Z(x 2 — y 2 ) in A 2 . If ( a . b) e A 2 is an element of V then a 3 = b 2 . If 

a / 0, then also b 0 and we can write a = ( b/a ) 2 , b = ( b/a ) 3 . It follows 
that V Is the set {(a 2 , a 3 ) \ a e k). For any polynomial /(x, y) e k[x, y] we can 
write f(x. y) = /o(x) + /i(x)y + (x 3 - y 2 )g(x, y). For fix, y) € X(V), i.e., 
/ ia 2 , a 3 ) = 0 for all a e k, it follows that /o (a 2 ) + fiia 2 )a 3 = 0 for all a € k. If 
foix) = a r x r -I 1- ao and fi (x) = b s x s -\ h bo then 

/o(* 2 ) + x 3 /i(x 2 ) = ia r x lr H 1- ao) + (fc s x 2s+3 H 1- ho* 3 ) 

and this polynomial is 0 for every a e k. If k is infinite, this polynomial has infinitely 
many zeros, which can happen only if all of the coefficients are zero. The coefficients 
of the terms of even degree are the coefficients of /o(x) and the coefficients of the 
terms of odd degree are the coefficients of /i(x), so it follows that /o(x) and /i(x) 
are both 0. It follows that fix, y) = (x 3 - y 2 )g(x, y), and so 

XiV) = (x 3 - y 2 ) C k[x, y]. 

If k is finite, however, there may be elements in XiV) not lying in the ideal (x 3 — y 2 ). 
For example, if k = F 2 , then V is simply the set {(0, 0). (1, 1)) and so X(V) contains 
the polynomial x(x — 1) (cf. Exercise 15). 
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The following properties of the map X are very easy exercises. Let A and B be 
subsets of A n . 

(6) If A c B then X(B) C X{A) (i.e., X is also contravariant). 

(7) X(A U B)= 1(A) n X(B). 

(8) X(0) = k[ x tl . . . , x w ] and, if A: is infinite, X(A n ) = 0. 

Moreover, there are easily verified relations between the maps Z and X: 

(9) If A is any subset of A n then A c Z(X(A)) S and if I is any ideal then / c X(Z(I)). 

(10) If V = Z(l) is an affine algebraic set then V = Z(X(V)), and if 1 = X(A) then 

X(Z(I)) = /, i.e., Z(X(Z(I))) = Z(I) and X(Z(X(A))) = X(A). 

The last relation shows that the maps Z and X act as inverses of each other provided 
one restricts to the collection of affine algebraic sets V = Z(I) in A” and to the set of 
ideals in k[A n ] of the form X(V). In the case where the field k is algebraically closed 
we shall (in the following two sections) characterize those ideals / that are of the form 
X(V) for some affine algebraic set V in terms of purely ring-theoretic properties of the 
ideal / (this is the famous “Zeros Theorem” of Hilbert, cf. Theorem 32). 

Definition. If V C A” is an affine algebraic set the quotient ring /rfA 71 ]/X(V) is called 
the coordinate ring cf V, and is denoted by k[V]. 

Note that for V = A n and k infinite we have X(V) = 0, so this definition extends 
the previous terminology. The polynomials in &[A”] define ^-valued functions on V 
simply by restricting these functions on A n to the subset V. Two such polynomial 
functions / and g define the same function on V if and only if / — g is identically 0 
on V, which is to say that / — g e X(V). Hence the cosets / = / + X(V) giving the 
elements of the quotient k[V] are precisely the restrictions to V of ordinary polynomial 
functions / from A n to k (which helps to explain the notation A[ V]). If jc,- denotes the 
I th coordinate function on A” (projecting an n -tuple onto its I th component), then the 
restriction of x, to V (which also just gives the I th component of the elements in V 
viewed as a subset of A w ) is an element of A[ V], and A[ V] is finitely generated as a 
^-algebra by xf, . . . , (although this need not be a minimal generating set). 

Example 

If V — Z(xy — 1) is the hyperbola y = 1/x in IR 2 , then V] = R[x, y]/(xy — 1). The 
polynomials /(x, y) = x (the x -coordinate function) and g(x, y) = x + (xy — 1), which 
are different functions on R 2 , define the same function on the subset V. On the point 
(1 /2, 2) € V, for example, both give the value 1/2. In the quotient ring R[V] we have 
xy = 1, so R[ V] = R[x, 1/x]. For any function / e R[V] and any (a, b) e V we have 
/ (a, b) = f(a, 1 /a) for any polynomial / e £[x, y] mapping to / in the quotient 

Suppose now that V C A n and W C A m are two affine algebraic sets. Since V 
and W are defined by the vanishing of polynomials, the most natural algebraic maps 
between V and W are those defined by polynomials: 
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Definition. A map <p : V -y W is called a morphism (or polynomial map or regular 
map ) of algebraic sets if there are polynomials <p \ , . . . , <p m e k[x \ , xj , . . . , x„] such that 

<p((a u ... y a n )) = (pifai, - ,a„) 

for all (aj , . . . , a n ) e V. The map <p : V -> W is an isomorphism of algebraic sets if 
there is a morphism \J/ : W -> V with <p o \}r = 1 w and \}r o ip = 1 y. 

Note that in general <p\ y <p 2 y . . . y <p m are not uniquely defined. For example, both 
/ = x and g = x + (xy — 1) in the example above define the same morphism from 
V = Z(xy - 1) to W = A 1 . 

Suppose F is a polynomial in k[x\ y .... Jt Then F o ip = F(^i, <P 2 ,..., <p m ) 
is a polynomial in &[jci, . . . , x„] since yn, ^> 2 , . . . , <p m are polynomials in xi, . . . , x,,. 
If F e X(W), then F o <p((ai , 02 , ■■■ ,a n )) = 0 for every (a iy a 2 , . . . , a„) e V 
since ip((a\, 02 , . . . , a n )) e W. Thus F o <p e J(V). It follows that <p induces 
a well defined map from the quotient ring k[x[ y ... y x m ]/X(W) to the quotient ring 
k[x ly ...,x n ]/Z(V): 

7p : k[W ] -y k[V] 
f ^ f °<P 

where / o <p is given by F o ip + X( V) for any polynomial F = F(x 1 , . . . , x m ) with 
f = F + Z(W). It is easy to check that ip is a k - algebra homomorphism (for example, 
7p(f + g) = (/ + g)o<p = fo(p + go<p=. ip(f) + <p(g) shows that 7p is additive). 
Note also the contravariant nature of 7p\ the morphism from V to W induces a /: -algebra 
homomorphism from k[W] to fc[V]. 

Suppose conversely that <P is any /: -algebra homomorphism from the coordinate 
ring k[W\ = A[jci , . . . , x„,]/X(W) to k[V] = k[x \ y ... y x n ]/Z(V). Let F; be a repre- 
sentative in &[xi, . . . , x„] for the image under 0 of xi € klW^] (i.e., #>(x, modX(W)) 
is Fi modX(V)). Then <p = (Fi, . . . , F m ) defines a polynomial map from A n to A m y 
and in fact <p is a morphism from V to W. To see this it suffices to check that <p maps a 
point of V to a point of W since by definition <p is already defined by polynomials. If 
g € X(W) C k[xi x m ], then in k[W] we have 

g ( Jtt + Z(W), ...,x m + X(W» = g(xu ■ • . , x m ) + X(W) = Z(W) = 0e k[Wl 

and so 

0(g(x { + Z(W) y ...,x m + X( W))) =0e k[V]. 

Since 0 is a k -algebra homomorphism, it follows that 

*(<£(*, + X(W)), . . . , 0(x m + Z(W)) = 0e k[V}. 

By definition, 0(xj + Z(W)) = F, mod Z(V) y so 

g(F, mod Z{V) y ...,F m modX(V)) = 0 € k[V] y 
i.e., 

g(F u . . . , F m ) e Z(V). 

It follows that £(Fi(tf l( . . . , a „ ) F m (a u ..., a „ )) = 0 for every (dj, . . . , a n ) in 

V. This shows that if (ai , . . . , a n ) e V, then every polynomial in X(VV) vanishes 
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on <p(a\ a n ). By property (10) of the maps Z and X above, this means that 

<p(a\, . . . , On ) € Z(T(W)) = W, which proves that cp maps a point in V to a point 
in W. It follows that <p — (Fj , . . . , F m ) is a morphism from V to W. Since the F, are 
well defined modulo X( V), this morphism from V to W does not depend on the choice 
of the Fj . Furthermore, the morphism (p induces the original -algebra homomorphism 
0 fromFfW] toF[ V], i.e., 7p = 0, since both homomorphisms take the value F,+Z(V) 
on Xj + X(W) £ k[W]. This proves the first two statements in the following theorem. 

Theorem 6. Let V c A” and W c A m be affine algebraic sets. Then there is a bijective 
correspondence 

I morphisms from V to W | j A: -algebra homomorp hisms 1 

as algebraic sets J [ from k[W] to k[V\ 

More precisely, 

(1) Every morphism (p \V W induces an associated A-algebra homomorphism 
7p : k[W] — > k[V] defined by £>(/) = / o <p. 

(2) Every A-algebra homomorphism 0 : k[W] -> k[V] is induced by a unique 
morphism <p : V -> IV, i.e., 0 — 7p. 

(3) If <p : V W and xj/ : W U are morphisms of affine algebraic sets, then 
\j/ o ip — Tpoxjf \ k[U\ — > k[ V]. 

(4) The morphism <p : V W is an isomorphism if and only if (jp : k[W] —> k[V] 
is a & -algebra isomorphism. 

. Proof: The proof of (3) is left as an exercise and (4) is then immediate. 

Example 

For any infinite field k let V = A 1 and let W = Z[x 3 - y 2 ) = {(a 2 , a 3 ) | a 6 A). The 
map (p : V ->• W defined by <p(a) ~ (a 1 , a 3 ) is a morphism from V to W. Note that <p is 
a bisection. The coordinate rings are A[V] — A[jcJ and A[W] = A[x. y]/(x 3 — y 2 ) (by the 
computations in a previous example — it is at this point we need A to be infinite) and the 
associated A-algehra homomorphism of coordinate rings is determined by 

V : Jt[W] — ► k[V] 

2 

ri4r 

3 

y H>- x . 

The image of <p is the subalgebra k[x 2 , x 3 ] = k + x 2 A[x] of A[x], so in particular <p is not 
suijective. Hence Ip is not an isomorphism of coordinate rings, and it follows that <p is not 
an isomorphism of algebraic sets, even though the morphism <p is a bijective map. The 
inverse map is given by \j/{ 0, 0) = 0 and \j/(a,b) = b/a for b 0, and this cannot be 
achieved by a polynomial map. 


The bijection in Theorem 6 gives a translation from maps between two geomet- 
rically defined algebraic sets V and W into algebraic maps between their coordinate 
rings. It also allows us to define a morphism intrinsically in terms of V and W without 
explicit reference to the ambient affine spaces containing them: 
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Corollary 7. Suppose (p : V — > W is a map of affine algebraic sets. Then (p is a 
morphism if and only if for every / e k[W] the composite map / o <p is an element of 
£[V] (as a A: -valued function on V). When <p is a morphism, <p(v) = w with ueV and 
w e W if and only if 5> -1 (Z({u})) = T([wj). 

Proof: We first prove that if <p is any map from V to W such that 7p is a ^-algebra 
homomorphism then <p(v) = w if and only if £> -1 (X({u})) = Z({w}), which will in 
particular establish the second statement. Note that <p(v) = w if and only if every poly- 
nomial / v anishing at in vanishes at <p(v) (by property (10) above: (m) = Z(Z({iu}))). 
Since / vanishes at (p(y) if and only if $>(/) vanishes at v, this is equivalent to the 
statement that <p(f) € Z({u}) for every / e Z({ui}), i.e., £>(Z({lu})) c Z({t/}), or 
X([w)) c 7p~ x (Z([v))). Since both Z({uj}) and Z({u}) are maximal ideals, this is 
equivalent to $? -1 (Z({u})) = Z({iu}). 

We now prove the first statement. If <p is a morphism, then f exp e k[V] for every 
/ € k[W\. For the converse, observe first that composition with any map <p : V W 
defines a I: -algebra homomorphism 7p from the jfc-algebra of fc-valued functions on W to 
the ^-algebra of k- valued functions on V (this is imm ediate from the pointwise definition 
of the addition and multiplication of functions). If / o <p e k[V] for every / e k[Wf } 
then 7p is a ^-algebra homomorphism from k[W] to Jfc[V], so by the proposition, 7p = <P 
for a unique morphism Q> : V W. Also, since 7p is a ^-algebra homomorphism from 
k\V/] to k[V] it follows by what we have already shown that <£(v) = w if and only if 
$> -1 (Z({t>})) = Z(\w\). Because 7p = <f>, this is equivalent to 0 -1 (Z({u})) — X({w}), 
and so &(v) = w. Hence <p and <t> define the same map on V and so <p is a morphism, 
completing the proof. 

Corollary 7 and the last part of Theorem 6 show that the isomorphism type of the 
coordinate ring of V (as a ^-algebra) does not depend on the embedding of V in a 
particular affine n-space. 

Computations in Affine Algebraic Sets and fc-algebras 

The theory of Grobner bases developed in Section 9.6 is very useful in computa- 
tions involving affine algebraic sets, for example in computing in the coordinate rings 
k[A n ]/X(V). When n > 1 it can be difficult to describe the elements in this quotient 
ring explicitly. By Theorem 23 in Section 9.6, each polynomial / in fc[A w ] has a unique 
remainder after general polynomial division by the elements in a Grobner basis for 
X(V), and this remainder therefore serves as a unique representative for the coset / of 
/ in the quotient fc[A"]/Z(V). 

Examples 

(1) In the example W = Z(x 3 — y 2 ) above, we showed / = Z(W) = (jc 3 — y 2 ) for any 
infinite field A and so k[W] = Arfjc , y]/(jc 3 — y 2 ). Here jc 3 — y 2 gives a Grdbner basis for 
/with respect to the lexicographic monomial ordering with y > jc, so every polynomial 
f = /(jc, y) can be written uniquely in the form /(jc, y) = /o(jc) + f\ (jc)y + // with 
/o(jc), fi(x) e k{x] and // e /. Then /o(x) + fi(x)y gives a unique representative 
for / in k[W]. With respect to the lexicographic monomial ordering with x > y. 
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x 3 — y 2 is again a Grobner basis for 7, but now the remainder representing f in k [W] 
is of the form /io(y) + h\ (y)x + h 2 (y)x 2 . 

(2) Let V = Z(xz+y 2 +z 2 , xy—xz+yz—2z 2 ) c C 3 and W = Z(\A — uv 2 + v i ) c C 2 . 
We shall show later that I = X( V) = (xz + y 2 + z 2 , xy — xz + yz — 2z 2 ) C C[x, y, z] 
and J = X(W) = (m 3 — uv 2 + i> 3 ) C C[u, uj. In this case m 3 — uv 2 + w 3 gives a 
Grobner basis for J for the lexicographic monomial ordering with u > v s imil ar to 
the previous example. The situation for I is more complicated. With respect to the 
lexicographic monomial ordering with x > y > z the reduced GrObner basis for I is 
given by 

gi=xy + y 2 + yz-z 2 , g 2 = xz + y 2 + z 2 , g3 = y 3 - y 2 z + z 3 . 

Unique representatives for C[V] = C[x, y, z]/(x 2 + xz + y 2 , 2x 2 — xy + xz — yz) 
are given by the remainders after general polynomial division by {gi, g 2 , £3}- 

We saw already in Section 9.6 that Grobner bases and elimination theory can be used 
in the explicit computation of affine algebraic sets Z(S), or, equivalently, in explicitly 
solving systems of algebraic equations. The same theory can be used to determine 
explicitly a set of generators for the image and kernel of a algebra homomorphism 

® : *Lvt y m yj — > M*i. ■ ■ ■ . x n yi 

where 1 and J are ideals. In the particular case when 7 = X(V) and J — I(W) are 
the ideals associated to affine algebraic sets V C A" and W C A m then by Theorem 
6, the ^-algebra homomorphism <P corresponds to a morphism from V to W, and we 
shall apply the results here to affine algebraic sets in Section 3. 

For 1 < i < m, let w e k[x 1 , . . . , x n ] be any polynomial representing the coset 
<P(yi), where as usual we use a bar to denote the coset of an element in a quo- 
tient The polynomials tp \ , . . . , <p„ are unique up to elements of 1. Then the image 
of a coset f(y j, . . . , y m ) + J under 0 is the coset f(<p \, . . . , <p m ) -j- 7. Given any 
tpi , . . . , (p n , the map sending y t to (p-, induces a algebra homomorphism 0 if and only 
if /Cvi , • • • , ym) G 7 for every f e J, a condition which can be checked on a set of 
generators for J. 

Proposition 8. With notation as above, let R = &[yi , . . . , y m , xj, . . . , x„] and let A be 
the ideal generated by yi — <p \ , . . . , y m — <p m together with generators for 7. Let G be 
the reduced Grobner basis of A with respect to the lexicographic monomial ordering 
> • . . > x n > yi > ■ - - > y m . Then 

(a) The kernel of 0 is A D fcfyi, • • • . y m ] modulo J. The elements of G in 
k\yi , . . . , y m ] (taken modulo J ) generate ker 0. 

(b) If / e k[x 1 , .... x n ], then / is in the image of 0 if and only if the remainder 
after general polynomial division of / by the elements in G is an element 
h € £[yi, . . . , y w ], in which case 0(h) = f. 


Proof: If we show ker 0 = A f) &Lyi, - - - , y m ] modulo J then (a) follows by 

Proposition 30 in Section 9.6. Suppose first that / G ,4n/:[yi, If /i f s 

are generators for 7 in k[x 1 , . . . , x„], then 

n s 

f(yu ■‘■<ym) = - w) + ^2 b ifi 

/=! 7=1 
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as polynomials in R, where a Xl . . . , a„, b \. . . . , b s e R. Substituting y,- = <pj we see that 
/ (<p \, . . . , <p M ) is an element of /. Since 0(f) = fOpi, - - , <Pm) modulo /, it follows 
that / represents a coset in the kernel of 0. Conversely, suppose / € /c[yi, - - - , y m ] 
represents an element in ker 0. Then f(<p x , . . . , cp m ) e I (in fcUi, . . . , jc„]) and so also 
f(<pi <p m ) g A (in /?). Since y, - <fii e A 

/(yi, ■ • • , y m ) = = Omod A 

so f € AC\ /c[yi, . . . , y m ]. 

For (b), suppose first that / e k[x\ , ,x n ] represents an element in the image of 
0, i.e., / = 0(h) for some polynomial h € £[yi, Then 

/(*i, h(<p u ...,<p m )el 

as polynomials in k[x i, . . . , jc,,], and so /( jci, . . . , *„) — h(<p \ , . . . , ^> m ) € A as poly- 
nomials in R. As before, since each y ; — <pi e A it follows that 

fix i, . . - , x„) - h(y x , . . . , y m ) e A. 

Then f(xu ... ,x n ) and h(y i , . . . ,y m ) leave the same remainder after general poly- 
nomial division by the elements in G. Since x\ > ■ - > x n > y L > • ■ ■ > y m , 

the remainder of h(y x , . . . , y w ) is again a polynomial ho only involving yi, . . . , y,„. 
Note also that h — ho € AC\ fc[yi , . . . , y m ] so h and ho differ by an element in 
ker 0 by (a), so 0(ho) = 0(h) = f. For the converse, if / leaves the remain- 
der h € k[y i , . . . , y w ] after general polynomial division by the elements in G then 
f(xi,...,x„)-h(yi,...,y m ) € A. i.e., 

n s 

f(x I x„)~ h(y x ,. . . , y m ) = - <Pi) + £>,-/- 

i = 1 j = 1 

as polynomials in R , where a x , . . . , a n , b \ , . . . , b s € R. Substituting y,- = <pt we obtain 

fix l. -h(<p x <p m ) £ I 

as polynomials in x Xt . . .x nt and so / — <£( h ). 

It follows in particular from Proposition 8 that 0 will be a surjective homomorphism 
if and only if for each i = 1 , 2, . . . , n, dividing x t by the elements in the Grobner basis 
G leaves a remainder h; in fc[yi, . . . , y m ]. In particular, x„ — h n leaves a remainder 
of 0. But this means the leading term of some element g n in G divides the leading 
term of x n — h r and since x x > ■■■ > x„ > yi > ■ ■ - > y m by the choice of the 
ordering, the leading term of x n — h n is just x„. It follows that LT (g„) — x„ and so 
g n = x„ — h, K o € G for some h ni o € /:[yi, . . . , y m ] (in fact h„o is the remainder of h„ 
after division by the elements in G). Next, since x n -i — h „^ i leaves a remainder of 0, 
there is an element g„-\ in G whose leading term is x n ^\. Since G is a reduced GrObner 
basis and g n € G , the leading term of g nt i.e., x nt does not divide any of the terms in 
gn - i and it follows that g„_ i = x„_i - /j„_ I i0 € G for some h „- L 0 e k[y x , .... y w ,]. 
Proceeding in a similar fashion we obtain the following corollary, showing that whether 
0 is surjective can be seen immediately from the elements in the reduced Grobner basis. 
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Corollary 9. The map <P is surjective if and only if for each i, 1 < i < «, the reduced 
Grobner basis G contains a polynomial jc,- — h { where hi e £[yi, . . . , y m ]. 

Examples 

(1) Let 0 : Q[m, v] — >• Q[jr] be defined by 0(u) — x 2 + x and G>(v) — jc 3 . The reduced 
Grobner basis G for the ideal A = (u—x 2 —x, v — jc 3 ) with respect to the lexicographic 
monomial ordering x > u > u is 

g\=x 2 + x-u, g-s = vx-x-u z + u+2v, 

g 2 = ux+x — u — v, g 4 = u 3 — 3 uv ~ v 2 — v. 

The kernel of 0 is the ideal generated by G H Q[n, v] = {#4}. By Corollary 9, we 
see that 0 is not suijectivc. The remainder after general polynomial division of jc 4 
by {gi , g x , g3, £4} is jc + u 2 — u — 2v Q(u, v], so x 4 is not in the image of 0 . The 
remainderofjc 5 +jcis — u 2 + uv+u+2v e Q[u, v] sojc 5 -^ = 0(— u 2 +uv+u+2v) 
is in the image of 0, as a quick check will confirm. 

(2) Let V = 2(1) c <C 3 and W = 2(J) c C 2 where/ = (xz+y 2 +z 2 , xy—xz+yz—2z 2 ) 
and J — (n 3 — uv 2 + v 3 ) as in Example 2 following Corollary 7. Then the map 
<p : V — > W defined by <p((a, b, c )) = (c, b) is a morphism from V to W. To see this, 
we must check that ( c , b) € W if (a, b, c) € V. Equivalently, by Theorem 6, we must 
check that the map 

<p : Cfn, v]/(u 3 - uv 2 + u 3 ) — >• C\x, y, z]/(xz + y 2 + z 2 , xy -xz + yz- 2 z 2 ) 

induced by mapping u to z and v to y is a (C-algebra homomorphism. This in turn is 
equivalent to verifying that / = z 3 — zy 2 + y 3 is an element of the ideal /. In this 
case / is actually an element in the reduced Grbbner basis for /: 

xy + y 2 + yz-z 2 , xz + y 2 +z 2 , y 3 - y 2 z + z 3 , 

so certainly / e /. (Note that dividing / by the original two generators for / leaves 
the nonzero remainder / itself, from which it is much less clear that / e I, so it is 
important to use a Grobner basis when working in coordinate rings.) 

(3) In the previous example, let A = (u — z, u — y, xz + y 2 + z 2 , xy — xz + yz — 
2z 2 ) C C[u, v, x, y, z] as in Proposition 8. With respect to the lexicographic monomial 
ordering x>y>z>u^-v the reduced Grobner basis G for A is 

xu + u 2 + v 2 , xv — u 2 + uv + v 2 , y — v, z — u, u 3 — uv 2 + v 3 . 

By Proposition 8. we see that ker <p is generated by u 3 — u v 2 + v 3 = 0 mod J , so <p is 
injective. Since there is no element of the form jc — h(u, u) in G, Ip is not surjective 
(in fact x is not in the image). 

As a final example, we use the determination of the kernel of ^-algebra homomor- 
phisms to compute minimal polynomials of elements in simple algebraic field exten- 
sions. 

Proposition 10. Suppose a is a root of the irreducible polynomial p(x) e fc[x] and 
fi e k(a), say ft = f (a) for the polynomial / € k[x]. Let G be the reduced Grobner 
basis for the ideal (p, y — /) in k\x, y] for the lexicographic monomial ordering x > y . 
Then the minimal polynomial of fi over k is the monic polynomial in G n k\y]. 
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Proof: The kernel of the *- algebra homomorphism *[y] -> k[x]/(p) = k( a) 
defined by mapping y first to / and then to f is the principal ideal generated by the 
minimal polynomial of f$ in k[y], and the result follows by Proposition 8. 

Example 

Take* = <Q>, and let /J = 1+ V2+3Z/4 € Q(\/2). Then the ideal (x 3 — 2, y— (l+x+3x 2 )) 
in Q[x, y] has reduced Grfibner basis {53 jc — 3y 2 + ly + 32, y 3 — 3y 2 — 15y — 93} 
for the lexicographic monomial ordering x > y, so the minimal polynomial for fi is 
y 3 - 3y 2 - 15y - 93. 


EXERCISES 

Let R be a commutative ring with 1 ^ 0 and let * be a field. 

1. Prove the converse to Hilbert’s Basis Theorem: if the polynomial ring /?[jc] is Noetherian, 
then R is Noetherian. 

2. Show that each of the following rings are not Noetherian by exhibiting an explicit infinite 
increasing chain of ideals: 

(a) the ring of continuous real valued functions on [0,1], 

(b) the ring of all functions from any infinite set X to Z/2Z. 

3. Prove that the field k(x) of rational functions over * in the variable x is not a finitely 
generated ^-algebra. (Recall that k(x) is the field of fractions of the polynomial ring *[*]. 
Note that k(x) is a finitely generated field extension over *.) 

4. Prove that if R is Noetherian, then so is the ring ■/?[[*]] of formal power series in the 
variable x with coefficients from R (cf. Exercise 3, Section 7.2). [Mimic the proof of 
Hilbert’s Basis Theorem.] 

5. ( Fitting’s Lemtna ) Suppose M is a Noetherian /?-module and <p : M — *■ M is an R-module 
endomorphism of M. Prove that kzi(<p n ) O imaged") = 0 for n sufficiently large. Show 
that if <p is surjective, then q> is an isomorphism. [Observe that ker(«p) C ker(<p 2 ) C — ] 

6. Suppose that 0 — ► M' — >• M — * M" — ► 0 is an exact sequence of R-modules. Prove 
that M is a Noetherian /?-module if and only if M' and M" are Noetherian R -modules. 

7. Prove that submodules, quotient modules, and finite direct sums of Noetherian A-modules 
are again Noetherian R-modules. 

8. If R is a Noetherian ring, prove that M is a Noetherian R -module if and only if M is a 
finitely generated /?-module. (Thus any submodule of a finitely generated module over a 
Noetherian ring is also finitely generated.) 

9. For k a field show that any subring of the polynomial ring *[x] containing k is Noetherian. 
Give an example to show such subrings need not be U.F.D.s. [If * C R Q *[x] and 
y e R — k show that *[x] is a finitely generated *[y]-module; then use the previous two 
exercises. For the second, consider *[x 2 . x 3 ].] 

10. Prove that the subring *[jc, jc 2 y, re 3 )* 2 x'y' -1 , . . .] of the polynomial ring *[x, yjisnot 

a Noetherian ring, hence not a finitely generated ^-algebra. (Thus subrings of Noetherian 
rings need not be Noetherian and subalgebras of finitely generated ^-algebras need not be 
finitely generated.) 

11. Suppose R is a commutative ring in which all the prime ideals are finitely generated. This 
exercise proves that R is Noetherian. 
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(a) Prove that if the collection of ideals of R that are not finitely generated is nonempty, 
then it contains a maximal element l , and that /?// is a Noetberian ring. 

(b) Prove that there are finitely generated ideals Jy and Ji containing / with J\ H £ 7 
and that Jy H is finitely generated. [Observe that / is not a prime ideal.] 

(c) Prove that J/JyJ'i is a finitely generated /?//-submodule of Jy/JyJi- [Use Exercise 

8 .] 

(d) Show that (c) implies the contradiction that / would be finitely generated over R and 
deduce that R is Noetherian. 

12. Suppose R is a Noetherian ring and S is a finitely generated R -algebra. If T c 5 is an 
R -algebra such that S is a finitely generated T -module, prove that T is a finitely generated 
R -algebra. [If sj , . . . , generate S as an /?-algebra, and , . . . , s' m generate S as a T- 
module, show that the elements sy and sjsl. can be written as finite T -linear combinations of 
the s{. If To is the I? -subalgebra generated by the coefficients of these linear combinations, 
show S (hence 7o) is finitely generated (by the sj) as a 7o -module, and conclude that T is 
finitely generated as an R -algebra.] 

13. Verify properties (1) to (10) of the maps 2 and X. 

14. Show that the affine algebraic sets in A 1 over any field k are 0, k, and finite subsets of k. 

15. If k = F 2 and V = [(0, 0), (1, 1)} C A 2 , show lhalX(V) is the product ideal mi m 2 where 
mi = (x, y) and m 2 = (x - l. y - 1). 

16. Suppose that V is a finite algebraic set in A”. If V has m points, prove that A[ V] is 
isomorphic as a Jt-algebra to k™. [Use the Chinese Remainder Theorem.] 

17. If A: is a finite field show that every subset of A” is an affin e algebraic set 

18. If k = W q is the finite field with q elements show that X(A L ) = ( x q — x) C /c[x]. 

19. For each nonconstant / € k[x] describe 2(f) c A 1 in terms of the unique factorization 
of / in Jk[x], and then use this to describe T(2(f)), Deduce that X(2(f)) = (J) if and 
only if / is the product of distinct linear factors in £[x]. 

20. If / and g are irreducible polynomials in k[x , y] that are not associates (do not divide each 
other), show that 2((f , g)) is either 0 or a finite set in A 2 . [If (/, g) / (1), show (/, g) 
contains a nonzero polynomial in fc[x] (and similarly a nonzero polynomial in A[y]) by 
letting R - k[x), F = k(x), and applying Gauss’s Lemma to show / and g are relatively 
prime in F[y].] 

21. Identify each 2x2 matrix ^ ^ ^ with entries from k with the point (a, b,c,d) in A 4 . 

Show that the group SL 2 (k) of matrices of determinant 1 is an algebraic set in A 4 . 

2 

22. Prove that SL„(k) is an affine algebraic set in A” . [Generalize the preceding exercise.] 

23. Let V be any line in R 2 (the zero set of any nonzero linear polynomial ax 4- by — c). 
Prove that R[V] is isomorphic as an R-algebra to the polynomial ring R[x], and give the 
corresponding isomorphism from A 1 to V. 

24. Let V = 2(xy — z) £ A 3 . Prove that V is isomorphic to A 2 and provide an explicit 
isomorphism <p and associated Jt-algebra isomorphism $5 from Jt[V] to it[A 2 ], along with 
their inverses. Is V = 2(xy — z 2 ) isomorphic to A 2 ? 

25. Suppose V c A" is an affine algebraic set and f € Jt[V]. The graph of / is the collection 

of points ((oj, . . . ,a n , f(ay, . . . , a„))\ in A” +1 . Prove that the graph of / is an affine 
algebraic set isomorphic to V. [The morphism in one direction maps to 

(at a n , f(a\ a„)).] 
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26. Let V = Z{xz — y 2 , yz — x 3 , z 2 — x 2 y) c A 3 . 

(a) Prove that the map <p : A 1 ->■ V defined by <p(t) — (r 3 , r 4 , i 5 ) is a surjective morphism. 
[For the suijectivity, if (jk. y, z) ^ (0, 0, 0), let / = y/x.] 

(b) Describe the corresponding A:-algebra homomorphism $> : k[V] — >• AfA 1 J explicitly. 

(c) Prove that <p is not an isomorphism. 

27. Suppose tp : V -> W is a morphism of affine algebraic sets. If W 7 is an affine algebraic 
subset of W prove that the preimage V' — <p~ l (W') of W' in V is an affine algebraic 
subset of V. If W' = Z(l) show that V' ~ Z (//(])) for the corresponding morphism 
7p : k[W] -+ Jfe[V]. 

28. Prove that if V and W are affine algebraic sets, then so is V x W and k[V x W] = 
k[V J ® k k[W]. 


The following seven exercises Introduce the notion of the associated primes of an P-module 
M. Cf. also Exercises 30-40 in Section 4 and Exercises 25-30 in Section 5. 

Definition. A prime ideal P of R is said to be associated to the R -module M (sometimes 
called an assassin for M) if P is the annihilator of some element m of M, i.e., if M cont ains 
a submodule Rm Isomorphic to R/P. The collection of associated primes for M is denoted 
Ass r(M). 

When M — / is an ideal in R, it is customary to abuse the terminology and refer instead to 
the elements of Ass r ( R/l ) (rather than the less interesting collection Ass r(I)) as the primes 
associated to I. (Cf. Exercises 28-29 in Section 5.) 


29. If R = Z and M = Z/nZ, show that Assr(M) consists of the prime ideals (p) for the 
prime divisors p of n. 

30. If M is the union of some collection of submodules Mi, prove that Ass r(M) is the union 
of the collection Ass r (M -, ) . 

31. Suppose that Ann(m) = P, i.e., that Rm = R/P. Prove that if 0 ^ m' e Rm then 
Ann(m') = P. Deduce that Ass r(R/P) = (P). [Observe that R/P is an integral domain.] 

32. Suppose that M is an P-module and that P is a maximal element in the collection of ideals 
of the form Ann(ra), for m e M. Prove that P is a prime ideal. [If P = Arm(rn) and 
ab e P, show that bm ^ 0 implies Ann(m) c Ann(bm) and use the maximality of P to 
deduce that a G Ann {bm) = P.] 

33. Suppose R is a Noetherian ring and M ^ 0 is an P-module. Prove that Assr(M) 0. 
[Use Exercise 32.] 

34. If L is a submodule of M with quotient N = M/L , prove that there are containments 
Ass/j(A) c AsSfl(Af) c Assr(L) U Assj ?(A0, and show that both containments can be 
proper. [If Rm = P/P, show that Rm (11 = 0 implies P e Ass^(A^) and if Rm C\ L ^ 0 
then P e Ass r(L) (by Exercise 31). For the second statement, consider nZ C Z.] 

35. Suppose M is an P-module and let S be a subset of the prime ideals in A ssr(M). Prove 
there is a submodule N of M with Ass/?(A) = S and Assr(M/N) = Ass r(M) — S. 
[Consider the collection of submodules N' of M with Assr (N f ) c <S. Use Exercise 30 and 
Zom’s Lemma to show that there is a maximal submodule N subject to Ass* ( N ) c <S. If 
P € Assr(M/N), there is a submodule M'/N = R/P. Use the previous exercise to show 
thatAsSfl(M') c Ass r (A) U[P} and then use maximality of N to show P G Assj?(M)— S, 
so that Assr(M/N) c Ass r (M) — S and Ass# (A0 c <S. Use the previous exercise again 
to conclude that equality holds in each.] 
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Suppose M is a finitely generated module over the commutative ring R with generators 
The Fitting ideal J- R (M) (of level 0) of M (also called a determinant ideal ) 
is the ideal in R generated by the determinants of all n x n matrices A — (r/y) where r,j € R 

and n \m \ -( hr,„m„ ■= 0 in M, i.e., the rows of A cons is t of the coefficients in R of relations 

among the generators mi (A is called an n x n delations matrix” for M). The following five 
exercises outline some of the properties of the Fitting ideal. 

36. (a) Show that the Fitting ideal of M is also the ideal in R generated by all the n x n minors 

of all p x n matrices A = (r,y) for p > 1 whose rows consist of the coefficients in R 
of relations among the generators mi. 

(b) Let A be a fixed p x n matrix as in (a) and let A' be a p x n matrix obtained from 
A by any elementary row or column operation. Show that the ideal in R generated 
by all the n x n minors of A is the same as the ideal in R generated by all the n x n 
minors of A'. 

37. Suppose m\ m n and m\ m' n , are two sets of 7?-module generators for M. Let 

J- denote the Fitting ideal for M computed using the generators mi,..., m„ and let T' 

denote the Fitting ideal for M computed using the generators mi, . . . . m n , m \ , ... , m' n ,. 

(a) Show that m' = a^im i H + a^„m„ for some <3^1,..., a s > n e R, and deduce that 

( -a s > i -a s >„ , 0 0, 1 , 0, ... 0) is a relation amongmi , . . . ,m n ,m\, . . . 

(b) If A = (r,y) is an n x n matrix whose rows are the coefficients of relations among 
mi, ... ,m n show that det A = del A! where A' is an (n+n 1 ) x (n + n') matrix whose 

rows are the coefficients of relations among m\, .. . ,m n ,m\ m' n , . Deduce that 

T c F' . [Use (a) to find a block upper triangular A' having A in the upper left block 
and the n’ x n' identity matrix in the lower right block.] 

(c) Prove that Fc/ and conclude that T' ~ T. [Use the previous exercise.] 

(d) Deduce from (c) that the Fitting ideal Tr (M) of M is an invariant of M that does not 
depend on the choice of generators for M used to compute it 

38. All modules in this exercise are assumed finitely generated. 

(a) If M can be generated by n elements prove that Ann(M)” c c Ann(M), 

where Ann(M) is the annihilator of A7 in R. [If A is an n x n relations matrix for M, 
then AX = 0, where X is the column matrix whose entries are mi , . . . , m n . Multiply 
by the adjoint of A to deduce that det A annihilates M .] 

(b) If M = M\ x M 2 is the direct product of the /^-modules M\ and M 2 prove that 
Tr(M) = J r R(.M l )T n {M 1 ). 

(c) If M = (R/l 1 ) x ■ ■ • x ( R/l „ ) is the direct product of cyclic fl-modules for ideals 7/ 
in R prove that J-r(M) = hh ■■ In- 

(d) If 7? = Z and M is a finitely generated abelian group show that Ti(M) = 0 if M is 
infinite and TyXM} = |A7|Z if M is finite. 

(e) If 7 is an ideal in R prove that the image of Tr (M ) in the quotient R/I is J-r/i ( M/IM ) . 

(0 Prove thbt J- r(M / 1 M) c 7) c R. 

(g) If tp : M -> M' is a surjective 7?-module homomorphism prove Tr(M) c J 7 r(M'). 

(b)lf0— >N->0isa short exact sequence of /^-modules, prove that 
C Fr^M). 

(I) Suppose 7? is the polynomial ring/: [x, y . z] over the field/:. LetAf = /?/(*, y 2 .yz.z 2 ) 
and let L be the submodule (x, y, z)/(x, y 2 , yz, z 2 ) of M. Prove that Tr(M) is 
(x, y 2 ,yz, z 2 ) and J^rCL) is ( x , y, z) 2 . (This shows that in general the Fitting ideal 
of a submodule L of M Deed not contain the Fitting ideal for M.) 

39. Suppose M is an 7?-module and that <p : 7?" -> A7 is a suijective 7?-moduIe homomorphism 

(i.e., M can be generated by n elements). Let L = ker <p. Prove that the image of the 
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R -module homomorphism from /\"(L) — > f\ n (R n ) = R induced by the inclusion of L 
in R n is the Fitting ideal J-r(M). 

40. Suppose R and S are commutative rings, <p : R — > S' is a ring homomorphism, M is a 
finitely generated R -module, and M' = S ®r M is the 5- module obtained by extending 
scalars from R to S. Prove that the Fitting ideal J-s(M') for M' over S is the extension to 
S of the Fitting ideal Tr(M) for M over R. 

The following two exercises indicate how the remainder in Theorem 23 of Chapter 9 can be 
used to effect computations in quotients of polynomial rings. 

41. Suppose [gi g m } is a GrObner basis for the ideal I in k[x \ , . . . , x„]. Prove that the 

monomials m not divisible by any LT (gi), 1 < i < m, give a fc-vector space basis for the 
quotient k[xi x„]/I. 

42. Let I = (x 3 y — xy 2 + 1, x 2 y 2 — y 3 — 1) as in Example 1 following Proposition 9.26. 

(a) Use the previous exercise to show that {1, y, y 2 , y 3 } is a basis fear the ^-vector space 
k[x, y]/I. 

(b) Compute the 4 x 4 multiplication table for the basis vectors in (a). 

43. Suppose K[x\ , . . . , jc„] is a polynomial ring in n variables over a field K and k is a subfield 
of K. If / is an ideal in k[x \ , . . . , x n ], let I' be the ideal generated by / in K{x \ , . . . , jc w ]. 

(a) If G is a Grobner basis for the ideal I in fc[xi , . . . , x n ] with respect to some monomial 
ordering, show that G is also a Grobner basis for the ideal /' in K[x \ , . . . , x„] with 
respect to the same monomial ordering. [Use Buchberger’s Criterion.] 

(b) Prove that the dimension of the quotient k[ x\ x n ]/l as a vector space over k is 

the same as the dimension of the quotient AT[xi, . . . ,x n ]/I' as a vector space over K. 
[One method: use (a) and Exercise 41.] 

(c) Prove that I — A:[jq, .... x„] if and only if V — #[xi, . . . , *„]. 

44. Let V = Z(y?— jP'z— y 2 z) and W — Z(x 2 -hy 2 —z 2 )inC 3 . ThenZ(V) = (x 3 —x 2 z~y 2 z) 
and X(W) = (x 2 + y 2 — z 2 ) in C[jc, y, z] (cf. Exercise 23 in Section 3). Show that 
<p((a, b, c)) = ( a 2 c — b^c, 2 abc, — a 3 ) defines a morphism from V to W. 

45. Let V = Z( jc 3 +y 3 + 7z 3 ) C C 3 . Thai X(V) = (x 3 + y 3 + lz 3 ) in C[jc, y, z] (cf. 
Exercise 24 in Section 3). 

(a) Show that 

7p(x) = x(y 3 - lz 3 ), $>(y) = y(7z 3 - x 3 ), ^(z) = z(x 3 - y 3 ) 

defines a C-algebra homomorphism from /r[V] to itself. 

(b) Let (p : V — ^ V be the morphism corresponding to <p. Observe that (—2, 1, 1) e V 
and compute^ ((—2, 1, 1)) € V . 

(c) Prove there are infinitely man y points (a,b,c) on V with a,b,c € % and the greatest 
common divisor of a, b, and c is 1 . 

46. Let V = Z(xz+y 2 + z 2 , xy — xz+yz — 2z 2 ) C C 3 andW = Z(u 3 — uv 2 + v 3 ) C C 2 asin 
Example2 following Corollary 9. Show that the map c>((a, b)) = (—2 a 2 +ab,ab—b 2 ,a 2 — 
ab) defines a morphism from W to V. Show the corresponding C-algebra homomorphism 
from k{ V] to £[ W] has a kernel generated by x 2 — ly 2 + yz. 

47. Define : Q[m, v, u>] Q[x, y] by 4>(u) = x 2 + y, 4>(v) = x + y 2 , and 4>(w) = x — y. 
Show that neither x nor y is in the image of <J>. Show that / = 2x 3 — 4xy — 2y 3 — 4y is 
in the image of <£> and find a polynomial in Q[u, v, tu] mapping to /. Show that ker <J> is 
the ideal generated by 

u 2 — 2uv — 2 uw 2 + 4 uw + v 2 — 2 vw 2 — 4vw + w 4 + 3w 2 . 
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48. Suppose a is a root of the irreducible polynomial p(x) € k[x] and ft = f (a)/g(a) with 
polynomials / (x), g(x) € k[x] where g(a) ^ 0. 

(a) Show ag + bp = 1 for some polynomials a, b e k[x] and show ft = h(a) where 
h = af. 

(b) Show that the ideals {p, y — h) and (p, gy — /) are equal in /c[x, yj. 

(c) Conclude that the minimal polynomial for ft is the monic polynomial in G n £[>■] 
where G is the reduced Grobner basis for the ideal (p, gy — f) in fc[x, y] for the 
lexicographic monomial ordering x > y. 

(d) Find the minimal polynomial over Q of (3 — 1/2 + \/4)/(l + 3 X/2 — 3 VA). 

15.2 RADICALS AND AFFINE VARIETIES 

Since the zeros of a polynomial / are the same as the zeros of the powers / 2 , / 3 , ... 
in general there are many different ideals in the ring k[x\,x 2 , ... ,x n ] whose zero locus 
define the same algebraic set V in affine « -space. This leads to the notion of the radical 
of an ideal, which can be defined in any commutative ring: 

Definition. Let / be an ideal in a commutative ring R. 

(1) The radical of I, denoted by rad /, is the collection of elements in R some 
power of which lie in /, i.e., 

rad / = [a e R \ a k € / for some k > 1). 

(2) The radical of the zero ideal is called the nilradical of R. 

(3) An ideal I is called a radical ideal if l = rad I. 

Note that a e R is in the nilradical of R if and only if some power of a is 0, so the 
nilradical of R is the set of all nilpotent elements of R. 

Proposition 11. Let / be an ideal in the commutative ring R. Then rad I is an ideal 
containing /, and (rad /)// is the nilradical of R/I. In particular, R/I has no nilpotent 
elements if and only if / = rad / is a radical ideal. 

Proof: It is clear that / C rad /. By definition, the nilradical of R/I consists of 
the elements in the quotient some power of which is 0. Under the Lattice Isomorphism 
Theorem for rings this collection of elements corresponds to the elements of R some 
power of which lie in /, i.e., rad I. It is therefore sufficient to prove that the nilradical N 
of any commutative ring R is an ideal. Since OeN, 7V^0. IfaeN and r G R, then 
since a n = 0 for some n > 1 , the commutativity of R implies that (ra) n = r n a n = 0, 
so ra € TV. It remains to see that if a, b € N then a + b € N. Suppose a" = 0 and 
hT = 0. Since the Binomial Theorem holds in the commutative ring R (cf. Exercise 25 
in Section 7.3), 

n+m 

(a + b) n+m = r i a i b n+m - i 
/=o 

for some ring elements r, (the binomial coefficients in R). For each term in this sum 
either i > n (in which case a' = 0) or rt + tn — i > m, (in which case b n+m ~ i — 0). 
Hence (a + b) n+m = 0, which shows that a + b is nilpotent, i.e., a + b € N. 
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Proposition 12. The radical of a proper ideal 7 is the intersection of all prime ideals 
containing 7. In particular, the nilradical is the intersection of all the prime ideals in R. 

Proof: Passing to R/ 1, Proposition 1 1 shows that it suffices to prove this result for 
7=0, and in this case the statement is that the nilradical N of 7? is the intersection of 
all the prime ideals in R. Let N' denote the intersection of all the prime ideals in R. 

Let a be any nilpotent element in R and let P be any prime ideal. Since a k — 0 
for some k, there is a smallest positive power n such that a n e P. Then the product 
a"- 1 a e P, and since P is prime, either a n ~ i e P or a e P. The former contradicts 
the minimality of n, and so a € P. Since P was arbitrary, a e N’, which shows that 
N C N'. 

We prove the reverse containment N' C. N by showing that if a £ N, then a (£ N'. 
If a is an element of R not contained in N, let S be the family of all proper ideals not 
containing any positive power of a. The collection S is not empty since 0 e S. Also, if 
a k is not contained in any ideal in the chain 7 t c / 2 c ■ • then a k is also not contained 
in the union of these ideals, which shows that chains in S have upper bounds. By Zorn’s 
Lemma, S has a maximal element, P. The ideal P must in fact be a prime ideal, as 
follows. Suppose for some x and y not contained in P, the product xy is an element of 
P. By the maximality of P, a n e (x) + P and a™ e (y) + P for some positive integers 
n and m. Then a’ >+m e (xy) + P = P contradicting the fact that P is an element 
of S. This shows that P is indeed a prime ideal not containing a, and hence a N', 
completing the proof. 

Note that in Noetherian rings. Theorem 2 can be used to circumvent the appeal to 
Zorn’s Lemma in the preceding proof. 

Corollary 13. Prime (and hence also maximal) ideals are radical. 

Proof: If P is a prime ideal, then P is clearly the intersection of all the prime ideals 
containing P, so P = rad P by the proposition. 

Examples 

(1) In the ring of integers Z, the ideal (a) is a radical ideal if and only if a is square- 

free or zero. More generally, if a = pj 1 p° r with a ; > 1 for all i, is the 

prime factorization of the positive integers, then rad (a) = {p\pi • • ■ p r ). For in- 
stance, rad(180) = (30). Note that (pi), (pi), - - . , (p r ) are precisely the prime ideals 
containing the ideal (a) and that their intersection is the ideal (pi pi ■ ■ ■ p r ). More 
generally, in any LLF.D. R, rad (a) = (pi p 2 ■ • • p r ) if a = p°f pff Pr’ is the unique 
factorization of a into distinct irreducibles. 

(2) The ideal (x 3 — y 2 ) in &[x, y] is a prime ideal (Exercise 14, Section 9.1), hence is 
radical. 

(3) If l\ l m are linear polynomials in fc[xi , X 2 , . . . , x M ] then / = (h , . . . , l m ) is either 

k[x i , X 2 , - - . , x„ J or a prime ideal, hence / is a radical ideal. 

Proposition 14. If 7? is a Noetherian ring then for any ideal 7 some positive power 
of rad 7 is contained in 7. In particular, the nilradical, N , of a Noetherian ring is a 
nilpotent ideal: N k = 0 for some k > 1. 
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Proof: For any ideal 7, the ideal rad 7 is finitely generated since R is Noetherian. If 
are generators of rad I, then by definition of the radical, for each i we have 
af e I for some positive integer A . Let k be the maximum of all the k\ . Then the ideal 

(rad 7)*™ is generated by elements of the form af af 2 ■ ■■ cfy where d\ H h d m = km, 

and each of these elements has at least one factor af with d, > k. Then af € I , hence 
each generator of (rad 7) ton lies in 7, and so (rad I) km c 7. 

The Zariski Topology 

We saw in the preceding section that if we restrict to the set of ideals 7 of /t[A n ] arising 
as the ideals associated with some algebraic set V , i.e., with 7 = X(V), then the maps 
Z (from such ideals to algebraic sets) and X (from algebraic sets to ideals) are inverses 
of each other: Z(X(V)) = V and X(Z(I)) = I. The elements of the ring k[A n ]/X(V) 
give ^-valued functions on V and, since k has no nilpotent elements, powers of nonzero 
functions are also nonzero functions. Put another way, the ring k[A n ]/X(V) has no 
nilpotent elements, so by Proposition 1 1, the ideal X(V) is always a radical ideal. 

For arbitrary fields k , it is in general not true that every radical ideal is the ideal 
of some algebraic set, i.e., of the form X(V) for some algebraic set V. For example, 
the ideal ( x 2 + 1) in R[x] is maximal, hence is a radical ideal (by Corollary 13), but 
is not the ideal of any algebraic set — if it were, then x 2 + 1 would have to vanish on 
that set, but x 2 + 1 has no zeros in M. A s imil ar construction works for any field k that 
is not algebraically closed — there exists an irreducible polynomial p(x) of degree at 
least 2 in k[x], which then generates the maximal (hence radical) ideal (p(x)) in fc[x] 
that has no zeros in k. It is perhaps surprising that the presence of polynomials in one 
variable that have no zeros is the only obstruction to a radical ideal (in any number 
of variables) not being of the form X(V). This is shown by the next theorem, which 
provides a fundamental connection between “geometry” and “algebra” and shows that 
over an algebraically closed field (such as C) every radical ideal is of the form X(V). 
Over these fields the “geometrically defined” ideals 7 = X(V) are therefore the same 
as the radical ideals, which is a “purely algebraic” property of the ideal 7 (namely that 
7 = rad 7). 

Theorem. (Hilbert’s Nullstellensatz) Let £ be an algebraically closed field. Then 
X(Z(I)) = rad 7 for every ideal 7 of E[x\ , * 2 , - - - , *„]. Moreover, the maps Z and X 
in the correspondence 

z 

{affine algebraic sets) {radical ideals} 

z 

are bijections that are inverses of each other. 

Proof: This will be proved in the next section (cf. Theorem 32). 

Example 

The maps X and Z in the Nullstellensatz are defined ova: any field k, and as mentioned 
axe not bijections if it is not algebraically closed. For any field k, however, the map Z is 
always surjective and the map X is always injective (cf. Exercise 9). 
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One particular consequence of the Nullstellensatz is that for any proper ideal I we 
have 2(1) ^ 0 since rad I ^ fc[A w ]. Hence there always exists at least one common 
zero (“nullsteHen” in German) for all the polynomials contained in a proper ideal (over 
an algebraically closed field). 

We next see that the affine algebraic sets define a topology on affine n -space. Recall 
that a topological space is any set X together with a collection of subsets T of X, called 
the closed sets in X, satisfying the following axioms: 

(i) an arbitrary intersection of closed sets is closed: if 5, € T for i in any index set, 
then DSj e T, 

(ii) a finite union of closed sets is closed: if Si , . . . , S q e T then S\ U ■ • * U S g € T, 
and 

(iii) the empty set and the whole space are closed: 0, X e T. 

A subset (J of X is called open if its complement, X — U, is closed (i.e., X — U e T). 
The axioms for a topological space are often (equivalently) phrased in terms of the 
collection of open sets in X. 

There are many examples of topological spaces, and a wealth of books on topology. 
A fixed set X may have a number of different topologies on it, and the collections of 
closed sets need not be related in these different structures. On any set X there are 
always at least two topologies: the so-called discrete topology in which every subset 
of X is closed (i.e., T is the collection of all subsets of X), and the so-called trivial 
topology in which the only closed sets are 0 and X required by axiom (iii). 

Suppose now that X = A” is affine n -space over an arbitrary field k. Then the 
collection T consisting of all the affine algebraic sets in A* satisfies the three axioms 
for a topological space — these are precisely properties (3), (4) and (5) of algebraic sets 
in the preceding section. It follows that these sets can be taken to be the closed sets in 
a topology on A" : 

Definition. The Zariski topology on affine n-space over an arbitrary field k is the 
topology in which the closed sets are the affine algebraic sets in A". 

The Zariski topology is quite “coarse” in the sense that there are “relatively few” 
closed (or open) sets. For example, for the Zariski topology on A 1 the only closed sets 
are 0, k and the finite sets (of. Exercise 14 in Section 1), and so the nonempty open 
sets are the complements of finite sets. If it is an infini te field it follows that in the 
Zariski topology any two nonempty open sets in A 1 have nonempty intersection. In 
the language of point- set topology, the Zariski topology is always 7j (points are closed 
sets), but for infinite fields the Zariski topology is never Ti (Hausdorff), i.e., two distinct 
points never belong to two disjoint open sets (cf. the exercises). For example, when 
k = R, a nonempty Zariski open set is just the real line R with some finite number of 
points removed, and any two such sets have (infinitely many) points in common. Note 
also that the Zariski open (respectively, closed) sets in R are also open (respectively, 
closed) sets with respect to the usual Euclidean topology. The converse is not true; for 
example the interval [0,1] is closed in the Euclidean topology but is not closed in the 
Zariski topology. In this sense the Euclidean topology on R is much “finer”; there are 
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many more open sets in the Euclidean topology, in fact the collection of Euclidean open 
(respectively, closed) sets properly contains the collection of Zariski open (respectively, 
closed) sets. 

The Zariski topology on A" is defined so that the affine algebraic subsets of A n 
are closed. In other words, the topology is defined by the zero sets of the ideals in the 
coordinate ring of A". A similar definition can be used to define a Zariski topology 
on any algebraic set V in A", as follows. If k[V] is the coordinate ring of V , then the 
distinct elements of k[V] define distinct A;-valued functions on V and there is a natural 
way of defining 


Z : ( ideals in ^[V 7 ] } — ► { algebraic subsets of V ) 

Z : ( subsets of V } — ► ( ideals in k[V ] ) 

just as for the case V = A”. For example, if 7 is an ideal in A[V], then Z (7) is the set 
of elements in V that are common zeros of all the functions in the ideal 7. It is easy to 
verify that the resulting zero sets in V satisfy the three axioms for a topological space, 
defining a Zariski topology on V, where the closed sets are the algebraic subsets, Z(7), 
for any ideal 7 of k[V], By the Lattice Isomorphism Theorem, the ideals of k[V] are 
the ideals of A[jq, . . . , jc„] that contain Z(F) taken mod Z(V). If 7 is the complete 
prelmage in k[ x[ , j :,,] of 7, then the locus of 7 in A” is the same as the locus of 7 
in V. It follows that this definition of the Zariski topology on V is just the subspace 
topology for V c A" . (Recall that in a topological space X, the closed sets with respect 
to the subspace topology of a subspace Y are defined to be the sets C C\Y , where C is 
a closed set in X.) The advantage to the definition of the Zariski topology on V above 
is that it is defined intrinsically in terms of the coordinate ring k[V] of V, and since the 
isomorphism type of k[V] does not depend on the affine space A” containing V, the 
Zariski topology on V also depends only on V and not on the ambient affine space in 
which V may be embedded. 

If V and W are two affine algebraic spaces, then since a morphism <p : V —*■ W 
is defined by polynomial functions, it is easy to see that <p is continuous with respect 
to the Zariski topologies on V and W (cf. Exercise 27 in Section 1, which shows that 
the inverse image of a Zariski closed set under a morphism is Zariski closed). In fact 
the Zariski topology is the coarsest topology in which points are closed and for which 
polynomial maps are continuous. There exist maps that are continuous with respect to 
the Zariski topology that are not morphisms, however (cf. Exercise 17). 

We have the usual topological notions of closure and density with respect to the 
Zariski topology. 

Definition. For any subset A of A 71 , the Zariski closure of A is the smallest algebraic 
set containing A. If A c V for an algebraic set V then A is Zariski dense in V if the 
Zariski closure of A is V. 

For example, if k = K, the algebraic sets in A 1 are 0, IR, and finite subsets of M by 
Exercise 14 in Section 1 . The Zariski closure of any infinite set A of real numbers is 
then all of A 1 and A is Zariski dense in A 1 . 
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Proposition 15. The Zariski closure of a subset A in A" is Z(Z(A)). 


Proof: Certainly A C Z(Z(A)). Suppose V is any algebraic set containing A: 
A C V. Then Z(V) C X(A) and Z(Z(A)) C Z(1(V)) = V, so Z(Z(A)) is the 
smallest algebraic set containing A. 

If <p \ V ->■ W is a morphism of algebraic sets, the image <p(V) of V need not be an 
algebraic subset of W, i.e., need not be Zariski closed in W. For example the projection 
of the hyperbola V = Z(xy — 1) in R 2 onto the x-axis has image R 1 — {0}, which as 
we have just seen is not an affin e algebraic set. 

The next result shows that the Zariski closure of the image of a morphism is deter- 
mined by the kernel of the associated /c-algebra homomorphism. 

Proposition 16. Suppose <p : V ->• W is a morphism of algebraic sets and 7p : k[W] — ► 
A[V] is the associated /c-algebra homomorphism of coordinate rings. Then 

(1) The kernel of 7p is ^(^(V)). 

(2) The Zariski closure of <p(V) is the zero set in W of ker <p. Id particular, the 
homomorphism 7p is injective if and only if <p(V) is Zariski dense in W. 

Proof: Since 7p — f o (p t we have (jp(f ) = 0 if and only if (/ o <p)(P ) = 0 for 
all P € V, i.e., f(Q) = 0 for all Q = <p(P) € <p(V ), which is the statement that 
/ e Z(<p(V)), proving the first statement Since the Zariski closure of cp(V) is the zero 
set of l(<p(V)) by the previous proposition, the first statement in (2) follows. 

If 7p is injective then the Zariski closure of <p( V) is Z(0) = W and so <p( V) is Zariski 
dense. Conversely, suppose <p(V ) is Zariski dense in W, i.e., Z(X((p(V))) — W. Then 
Z(<p(V)) = Z(Z(l(<p(V )))) = Z(W) = 0 and so ker 7p = 0. 

By Proposition 16 the ideal of polynomials defining the Zariski closure of the 
image of a morphism ip is the kernel of the corresponding /c-algebra homomorphism (jp 
in Theorem 6. Proposition 8(1) allows us to compute this kernel using Grobner bases. 

Example: (Implicitization) 

A morphism <p : A" -*■ A m is just a map 

<p(Sfl 1,02* ■ - ■ . a n )) = 02 , . . . , a„) a „ )) 

where <pi is a polynomial. If k is an infinite field, then Z(h m ) and Z{k n ) are both 0, 
so we may write fcf A 7 ”] = k[yt, . . . , y w ] and /fe[A"] = k[x \, .... x„]. The fc-algebra 
homomorphism Ip : fc[A w ] — > A[A n ] corresponding to <p is then defined by mapping y, to 
<Pi = <pi(x i x n ). The image <p(k n ) consists of the set of points (fcj , . . . , b m ) with 

b\ = <p\(ay.ai a n ) 

b2 = <Pi{o\ t a2, 

bm = (Pm (.Oy , 02 > • • • « Gn ) 

where dj e k. This is the collection of points in A m parametrized by the functions 
<p\ ip m (with the a\ as parameters). In general such a parametrized collection of points 
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is not an algebraic set. Finding the equations for the smallest algebraic set containing these 
points is referred to as unplicitization, since it amounts to finding a (‘smallest’) collection 
of equations satisfied by the b( (the ‘implicit* algebraic relations). 

By Proposition 16, this algebraic set is the Zariski closure of <p( A w ) and is the zero set 

of ker 7p. By Proposition 8 this kernel is given by A n £[yi y m J, where A is the ideal 

in frOi — . x „ , yi y m J generated by the polynomials yi - <pi, - ■ . , y m - <Pm- If we 

compute the reduced Grtibner basis G for A with respect to the lexicographic monomial 
onderingjci > ■ ■ ■ > x n > yi > ■ ■ ■ > y m , then the polynomials of G lying in k[yi, ... , y m ] 
generate ker 7 r . The zero set of these polynomials defines the Zariski closure of <fi( A") and 
therefore give the implicitizalion. 

For an explicit example, consider the points A = [(a 2 , a 3 ) \ a e R} in K 2 . Using 
coordinates x, y for R 2 and / forR 1 , the ideal ^4 inR[x, y, z,/] is (x — I 2 , y-/ 3 ). The only 
element of the reduced Grobner basis for A for the ordering / > x > y lying in R[x, y] is 
x 3 - y 2 , so 2(x 3 — y 2 ) is the smallest algebraic set in R 2 containing A. 

Example: (Projections of Algebraic Sets) 

Suppose V c A" is an algebraic set and m < n. Let n : V -*■ A m be the morphism 
projecting onto the first m coordinates: 

n((ai,a 2 a n )) = {a \,02 a m ). 

If we use coordinates jcj x n in AfV) and coordinates yi, ...,y m inA[A'”],thefc-algebra 

homomorphism corresponding to jt is given by the map 

jr : *[>1 ym] — ► k[x\, ...,x n )/X(V) 

yi i — ► x,-. 

Suppose V = Z(J) and / — (/i, - • - , fs). The Zariski closure of n(V) is the zero set of 

ker n = A n A[yi , . . . , y,„] where A is the ideal in A[jcj jc„ , yi , . . . , y m ] generated 

by the polynomials yi — xy, . . . , y m — x m together with a set of generators for X(V). The 

polynomials involving only y\ y m in the reduced GrObner basis G for A with respect 

to the lexicographic monomial ordering x\ > ■ • ■ > x„ > yi > - - - > y,„ are generators 
for the Zariski closure of tt(V). 

If it is algebraically closed we can actually do better with the help of the Nullstellensatz, 
which gives X(V) = rad /. Then it is straightforward to see that we obtain the same zero 
set if in the ideal A we replace the generators for X(V) by the generators f \ , . . . , f s of 1 
(cf. Exercise 46). 

For an explicit example, consider projection onto the first two coordinates of V = 
Z{xy — z 2 , xz — y, x 2 — z) in <C 3 . Using u, u as coordinates in <C 2 , we find the reduced 
Grobner basis G for the ideal (u — x, u - y, xy — z 2 , xz — y, x 2 — z) for the ordering 
x>y>z>u>v contains only the polynomial u 3 — v in C[w, u]. The smallest algebraic 
set cont ainin g n(V) is then the cubic v = u 3 . 

Affine Varieties 

We next consider the question of whether an algebraic set can be decomposed into 
smaller algebraic sets and the corresponding algebraic formulation in terms of its co- 
ordinate ring. 

Definition. A nonempty affine algebraic set V is called irreducible if it cannot be 
written as V = V\ U V 2 , where V\ and V 2 are proper algebraic sets in V. An irreducible 
affine algebraic set is called an affine variety. 
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Equivalently, an algebraic set (which is a closed set in the Zariski topology) is 
irreducible if it cannot be written as the union of two proper, closed subsets. 

Proposition 17. 

(1) The affine algebraic set V is irreducible if and only if X(V) is a prime ideal. 

(2) Every nonempty affine algebraic set V may be written uniquely in the form 

V = V, U V 2 U ■ - ■ U V q 

where each V, is irreducible, and V-, <£ Vj for all j ^ i (i.e., the decomposition 
is “minimal” or “irredundant”). 

Proof: Let 7 =X(V) and suppose first that V = V[ U V 2 is reducible, where V } and 
V 2 are proper closed subsets. Since V\ ^ V, there is some function f\ that vanishes on 
V\ but not on V, i.e.,/i g X(V\) — 7. Similarly, there is a function f 2 g X(V 2 ) — I. Then 
/ 1/2 vanishes on V\ U V 2 = V, so f\f 2 e 7 which shows that 7 is not a prime ideal. 
Conversely, if 7 is not a prime ideal, there exists / 1 , f 2 e fc[A"] such that f\f 2 e 7 
but neither fi nor f 2 belongs to 7. Let V! = Z(f{) O V and V 2 = Ziff) n V. Since 
the intersection of closed sets is closed, V] and V 2 are algebraic sets. Since neither /1 
nor f 2 vanishes on V, both V\ and V 2 are proper subsets of V. Because f\f 2 g 7, 
V Q Z(fiff) ~ Z(f\) U Z(f 2 ), and so V is reducible. This proves (1). 

To prove (2), let <S be the collection of nonempty algebraic sets that cannot be written 
as a finite union of irreducible algebraic sets, and suppose by way of contradiction that 
S ^ 0. Let 7o be a maximal element of the corresponding set of ideals, (Z(V) \ V G «S}, 
which exists (by Theorem 2) since A[A”] is Noetherian. Then Vq = Z(Iq) is a minimal 
element of «S. Since Vq g «S, it cannot be irreducible by the definition of «S. On the 
other hand, if Vo = Vj U V 2 for some proper, closed subsets V L , V -2 of Vo, then by 
the minimality of Vo both V\ and V 2 may be written as finite unions of irreducible 
algebraic sets. Then Vq may be written as a finite union of irreducible algebraic sets, a 
contradiction. This proves <S = 0, i.e., every affine algebraic set has a decomposition 
into affine varieties. 

To prove uniqueness, suppose V has two decompositions into affine varieties (where 
redundant terms have been removed from each decomposition): 

V = V, U V 2 U - ■ ■ U Vr = Ui u u 2 u - ■ ■ u u s . 

Then Vi is contained in the union of the Uj . Since V\ n 7/,- is an algebraic set for each 
i , we obtain a decomposition of Vj into algebraic subsets: 

v t = (V: n f/|) u (v y n u 2 ) u - ■ - u (v, n u s ). 

Since Vi is irreducible, we must have Vi = Vi 0 Uj for some j, i.e., Vi c U r By 
the symmetric argument we have Uj C Vy for some j'. Thus Vi c Vy, so / = l 
and Vj = Uj . Applying a similar argument for each V/ it follows that r — s and that 
{ V[ , . . . , V r ) = [U \ , . . . , U s ). This completes the proof. 

Corollary 18. An affine algebraic set V is a variety if and only if its coordinate ring 
k[V) is an integral domain. 

Proof: This follows immediately since X(V) is a prime ideal if and only if the 
quotient k[V] = k{k n )/X(V) is an integral domain (Proposition 13 of Chapter 7). 
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Definition. If V is a variety, then the field of fractions of the integral domain k[V~\ is 
called the field of rational Junctions on V and is denoted by k(V). The dimension of a 
variety V\ denoted dim V, is defined to be the transcendence degree of k(V) over k. 

Examples 

(1) Single points in A” are affine varieties since their corresponding ideals in A: [A”] are 
maximal ideals. The coordinate ring of a point is isomorphic to k , which is also the 
field of rational functions. The dimension of a single point is 0. Any finite set is 
the union of its single point subsets, and this is its unique decomposition into affine 
subvarieties. 

(2) The x-axis in R 2 is irreducible since it has coordinate ring R[x, y]/(y) = R[x], which 
is an integral domain. Similarly, the y-axis and, more generally, lines in R 2 are also 
irreducible (cf. Exercise 23 in Section 1). Linear sets in R" are affine varieties. The 
field of rational functions on the x-axis is the quotient field R(x) of R[x], which is 
why R(x) is called a rational function field. The dimension of the x-axis (or, more 
generally, any line) is 1. 

(3) The union of the x and y axes in R 2 , namely Z(xy), is not a variety: Z(xy) = Z(x) U 
Z(y) is its unique decomposition into subvarieties. The corresponding coordinate 
ring R[x, y]/(xy ) contains zero divisors. 

(4) The hyperbola xy = 1 in R 2 is a variety since we saw in Section 1 that its coordinate 
ring is the integral domain R[x, 1/x]. Note that the two disjoint branches of the 
hyperbola (defined by x > 0 and x < 0) are not subvarieties (cf, also Exercises 
12-13). 

(5) IfV = 2(ii,/2, ,/ w ) is the zero set of /ineor polynomials /i . ■ ■ ■ , lm infc[xi x m ] 

and V / 0, then V is an affine variety (called a linear variety ). Note that determining 
whether V ^ 0 is a linear algebra problem. 

We end this section with some general ring-theoretic results that were originally 
motivated by their connection with decomposition questions in geometry. 

Primary Decomposition of Ideals in Noetherian Rings 

The second statement in Proposition 17 shows that any ideal of the form X(V) in 
£[A W ] may be written uniquely as a finite intersection of prime ideals, and by Hilbert’s 
Nullstellensatz this applies in particular to all radical ideals when k is algebraically 
closed. In a large class of commutative rings (including all Noetherian rings) every 
ideal has a primary decomposition, which is a similar decomposition but allows ideals 
that are analogous to “prime powers” (but see the examples below). This decomposition 
can be considered as a generalization of the factorization of an integer n € Z into the 
product of prime powers. We shall be primarily concerned with the case of Noetherian 
rings. 

Definition. A proper ideal Q in the commutative ring R is called primary if whenever 
ab G Q and a £ Q, then If £ Q for some positive integer n. Equivalently, if ab G Q 
and a £ Q, then b eradQ. 

Some of the basic properties of primary ideal s are given in the following proposition . 
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Proposition 19. Let R be a commutative ring with 1. 

(1) Prime ideals are primary. 

(2) The ideal Q is primary if aDd only if every zero divisor in Rf Q is nilpotenL 

(3) If Q is primary then rad Q is a prime ideal, and is the unique smallest prime 
ideal containing Q. 

(4) If Q is an ideal whose radical is a maximal ideal, then Q is a primary ideal. 

(5) Suppose M is a maximal ideal and Q is an ideal with M" c Q c M for some 
n > 1 . Then Q is a primary ideal with rad Q = M. 

Proof: The first two statements are immediate from the definition of a primary 
ideal. For (3), suppose ab <e rad Q. Then a m b m = (ab) m e Q, and since Q is primary, 
either a m e Q, in which case a e rad Q, or (//")" e Q for some positive integer n, in 
which case b € rad Q. Thi s proves that rad Q is a prime ideal, and it follows that rad Q 
is the smallest prime ideal containing Q (Proposition 12). 

To prove (4) we pass to the quotient ring R/Q\ by (2), it suffices to show that every 
zero divisor in this quotient ring is nilpotenL We are reduced to the situation where 
Q — (0) and M = rad Q = rad(0), which is the nilradical, is a maximal ideal. Since 
the nilradical is contained in every prime ideal (Proposition 12), it follows that M is 
the unique prime ideal, so also the unique maximal ideal. If d were a zero divisor, then 
the ideal ( d ) would be a proper ideal, hence contained in a maximal ideal. This implies 
that d e M, hence every zero divisor is indeed nilpotenL 

Finally, suppose M n c Q C M for some n > 1 where M is a maximal ideal. Then 
Q Q M so rad Q c rad M = M. Conversely, M 71 c Q shows that M c rad Q, so 
rad Q = M is a maximal ideal, and Q is primary by (4). 

Definition. If Q is a primary ideal, then the prime ideal P = rad Q is called the 
associated prime to Q, and Q is said to belong to P (or to be P -primary). 

It is easy to check that a finite intersection of P-primaiy ideals is again a P-primary 
ideal (cf. the exercises). 

Examples 

(1) The primary ideals in % are 0 and the ideals {p m ) for p a prime and m > 1. 

(2) For any field k, the ideal (x) in P[jc, y] is primary since it is a prime ideal. For any 
n > 1, the ideal (x, y) n is primary since it is a power of the maximal ideal (x. y). 

(3) The ideal Q — (x 2 , y) in the polynomial ring k[x, y] is primary since we have 
(*, y) 2 Q (x 2 . y) C (x, y). Similarly, Q ' = (4, x) in Z[x] is a (2, x)-primary ideaL 

(4) Primary ideals need not be powers of prime ideals. For example, the primary ideal Q 
in the previous example is not the power of a prime ideal, as follows. If (x 2 , y) = P k 
for some prime ideal P and some k > 1, then x 2 , y e P k c p so x, y e P. Then 
P = (x, y), and since y g (x, y) 2 . it would follow that k = 1 and Q — (x, y). Since 
x £ (x 2 , y), this is impossible. 

(5) If R is Noetherian, and Q is a primary ideal belonging to the prime ideal P, then 

P m QQ Q P 

for some m > 1 by Proposition 14. If P is a maximal ideal, then the last statement 
in Proposition 19 shows that the converse also holds. This is not necessarily true if P 
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is a prime ideal that is not maximal . For example, consider the ideal 7 = (x 2 , xy ) in 
k[x, y]. Then (x 2 ) c 7 C (x), and (x) is a prime ideal, but 7 is not primary: xy e I 
and x I, but no positive power of y is an element of 7. This example also shows 
that an ideal whose radical is prime (but not maximal as in (4) of the proposition) is 
not necessarily primary. 

(6) Powers of prime ideals need not be primary. For example, consider the quotient ring 
R = R[x, y, z]/(xy — z 2 ), the coordinate ring of the cone z 2 = Ay in R 3 , and let 
p = (x, z) be the ideal generated by x and z in R. This is a prime ideal in R since the 
quotient is 7f/(x, z) = R[x, y, z]/(x,z) = R[yJ (because (xy — z 2 ) C (x, z)). The 
ideal 

P 2 = (x 2 , xz, z 2 ) = (x 2 , xl, xy) = x(x, y, z), 

however, is not primary: xy = z 2 e P 2 . but x £ P 2 , and no power of y is in P 2 . Note 
that P 2 is another example of an ideal that is not primary whose radical is prime. 

(7) Suppose R is a U.F.D. If rr is an irreducible element of R then it is easy to see that 

the powers (rr”) for n — 1,2,... are (7r)-primary ideals. Conversely, suppose Q is 
a (jr)-primary ideal, and let n be the largest integer with Q c (jt r ) (such an integer 
exists since, for example, n k e Q for some k > 1, so n < k). If q is an element of Q 
not contained in (jr' 1+1 ), then q = rjt n for some r s R and r £ (n). Since r (n) 

and Q is (7r) -primary, it follows that e Q. This shows that Q — (n n ). 

In the examples above, the ideal (x 2 , xy) in k[x, y] is not a primary ideal, but it 
can be written as the intersection of primary ideals: (x 2 , xy) = (x) D (x, y) 2 . 

Definition. 

(1) An ideal I in R has a primary decomposition if it may be written as a finite 
intersection of primary ideals: 

771 

7 = ^ Qi Qi a primary ideal. 

/=i 

(2) The primary decomposition above is minimal and the Qi are called the primary 
components of I i f 

(a) no primary ideal contains the intersection of the remaining primary 
ideals, i.e., Q f ^ n ;? y Qj for all i, and 

(b) the associated prime ideals are all distinct: rad Qj ^ rad Qj for i j. 

We now prove that in a Noetherian ring every proper ideal has a minimal primary 
decomposition. This result is often called the Lasker-Noether Decomposition Theorem, 
since it was first proved for polynomial rings by the chess master Emanuel Lasker and 
the proof was later greatly simplified and generalized by Emmy Noether. 

Definition. A proper ideal 7 in the commutative ring 7? is said to be irreducible if 7 
cannot be written nontrivially as the intersection of two other ideals, i.e., if / = J 0 K 
with ideals 7, K implies that / = J or / = K. 

It is easy to see that a prime ideal is irreducible (see Exercise 1 1 in Section 7.4). 
The ideal (x, y) 2 in k[x, y] in Example 2 earlier shows that primary ideals need not 
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be irreducible since it is the int ersection of the ideals (x) + (x, y) 1 = (x, y 2 ) and 
(y)+(x, y) 2 = (y,x 2 ). In aNoetherianring, however, irreducible ideals are necessarily 
primary: 

Proposition 20. Let R be a Noetherian ring. Then 

(1) every irreducible ideal is primary, and 

(2) every proper ideal in R is a finite intersection of irreducible ideals. 

Proof: To prove (1) let Q be an irreducible ideal and suppose that ah e Q and 
b £ Q. It is easy to check that for any fixed n the set of elements x g R with cPx e Q 
is an ideal, A n , in R. Clearly Ay c Ai c . . . and since R is Noetherian this ascending 
chain of ideals must stabilize, i.e., A„ = A n +\ = ... for some n > 0. Consider the 
two ideals 7 = ( a n ) + Q and J = (b) + Q of R, each containing Q. If y e 7 O J then 
y = a n z + q for some z 6 R and q e Q. Since ab e Q , it follows that aJ c Q t and 
in particular ay € Q. Then a n+l z = ay — aq e £>, so z € A n+y = A„. But z € A n 
means that a n z e Q t so y e Q . It follows that I f] J = Q. Since Q is irreducible and 
(b) + Q Q (since b £ Q), we must have a n e Q, which shows that Q is primary. 

The proof of (2) is the same as the proof of the second statement in Proposition 
17. Let S be the collection of ideals of R that cannot be written as a finite intersection 
of irreducible ideals. If S is not empty, then since R is Noetherian, there is a maximal 
element 7 in <S. Then 7 is not itself irreducible, so 7 = J fl K for some ideals J and K 
distinct from 7. Then 7 c J and 7 C K and the maximality of 7 implies that neither J 
nor K is in S. But this means that both J and K can be written as finite intersections 
of irreducible ideals, hence the same would be true for 7. This is a contradiction, so 
S = 0, which completes the proof of the proposition. 

It is immediate from the previous proposition that in a Noetherian ring every proper 
ideal has a primary decomposition. If any of the primary ideals in this decomposition 
contains the intersection of the remaining primary ideals, then we may simply remove 
this ideal since this will not change the intersection. Hence we may assume the de- 
composition satisfies (a) in the definition of a minimal decomposition. Since a finite 
intersection of P-primaiy ideals is again P -primary (Exercise 31), replacing the primary 
ideals in the decomposition with the intersections of all those primary ideals belonging 
to the same prime, we may also assume the decomposition satisfies (b) in the definition 
of a minimal decomposition. This proves the first statement of the following: 

Theorem 21. (Primary Decomposition Theorem) Let 7? be a Noetherian ring. Then 
every proper ideal 7 in 7? has a minimal primary decomposition. If 

m n 

/=n&=ne; 

i=i i=i 

are two minimal primary decompositions for 7 then the sets of associated primes in the 
two decompositions are the same: 

{rad Q \ , rad Q 2 rad Q m } = { rad Q \ , rad ^ 2 , . . . , rad Q' n ). 

Moreover, the primary components Qi belonging to the minimal elements in this set of 
associated primes are uniquely determined by 7 . 
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Proof: The proof of the uniqueness of the set of associated primes is outlined in 
the exercises, and the proof of the uniqueness of the primary components associated to 
the minimal primes will be given in Section 4. 

Definition. If 7 is an ideal in the Noetherian ring R then the associated prime ideals 
in any primary decomposition of 7 are called the associated prime ideals of I. If an 
associated prime ideal P of I does not contain any other associated prime ideal of 7 
then P is called an isolated prime ideal ; the remaining associated prime ideals of 7 are 
called embedded prime ideals. 

The prime ideals associated to an ideal 7 provide a great deal of information about 
the ideal 7 (cf. for example Exercises 41 and 43): 

Corollary 22. Let 7 be a proper ideal in the Noetherian ring R. 

(1) A prime ideal P contains the ideal 7 if and only if P contains one of the 
associated primes of 7, hence if and only if P contains one of the isolated 
primes of 7, i.e., the isolated primes of 7 are precisely the minim al elements 
in the set of all prime ideals containing 7. In particular, there are only finitely 
many minimal elements among the prime ideals containing 7. 

(2) The radical of 7 is the intersection of the associated primes of 7, hence also the 
intersection of the isolated primes of I. 

(3) There are prime ideals P\, , P n (not necessarily distinct) containing 7 such 
that P X P 2 ■ - ■ P n c 7. 

Proof: The first statement in (1 ) is an exercise (cf. Exercise 37), and the remainder 
of (1) follows. Then (2) follows from (1) and Proposition 12, and (3) follows from (2) 
and Proposition 14. 

The last statement in Theorem 21 states that not only the isolated primes, but also 
the primary components belonging to the isolated primes, are uniquely determined by 
7. In general the primary decomposition of an ideal 7 is itself not unique. 

Examples 

(1) Let 7 = (x 2 , xy) in R[x, y]. Then 

(x 2 , xy) = (x) n (x, y) 2 = (x) n (x 2 , y) 

are two minimal primary decompositions for 7. The associated primes for I are (x) and 
rad((x, y) 2 ) = rad((x 2 , y)) = (x, y). The prime (x) is the only isolated prime since 
(x) c (x, y), and (x , y) is an embedded prime. A prime ideal P contains 7 if and only if 
P contains (x). The (x)-primary component of 1 corresponding to this isolated prime 
is just (x) and occurs in both primary decompositions; the (x, y)-priraary component 
of 7 corresponding to this embedded prime is not uniquely determined — it is (x, y) 2 
in the first decomposition and is (x 2 , y) in the second. The radical of 7 is the isolated 
prime (x). 

This example illustrates the origin of the terminology: in general the irreducible 
components of the algebraic space Z(I) defined by 7 are the zero sets of the isolated 
primes for 7, and the zero sets of the embedded primes are irreducible subspaces of 
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these components (so are “embedded” in the irreducible components). In this example, 
2(7) is the set of points with x 1 = xy = 0, which is just the y-axis in R 2 . There is 
only one irreducible component of this algebraic space (namely the y-axis), which is 
the locus for the isolated prime (jt). The locus for the embedded prime (jc, y) is the 
origin (0. 0), which is an irreducible subspace embedded in the y-axis. 

(2) Suppose 7? is a U.F.D. If a = p\ E ' • • ■ p,„ em is the unique factorization into distinct 
prime powers of the element a € 7?, then ( a ) = (p\) e) n - - - n (p m Y m is the minimal 
primary decomposition of the principal ideal (a). The associated primes to (a) are 
(pi), . . . , (p,n) and are all isolated The primary decomposition of ideals is a gener- 
alization of the factorization of elements into prime powers. See also Exercise 44 for 
a characterization of U.F.D.s in terms of minimal primary decompositions. 


For any Noetherian ring, an ideal / is radical if and only if the primary components of 
a minimal primary decomposition of / are all prime ideals (in which case this primary 
decomposition is unique), cf. Exercise 43. This generalizes the observation made 
previously that Proposition 17 together with Hilbert’s Nullstellensatz shows that any 
radical ideal in k[A n ] may be written uniquely as a finite intersection of prime ideals 
when the field k is algebraically closed — this is the algebraic statement that an algebraic 
set can be decomposed uniquely into the union of irreducible algebraic sets. 

EXERCISES 

1. Prove (3) of Corollary 22 directly by consideiing the collection S of ideals that do not 
contain a finite product of prime ideals. [If 7 is a maximal element in S, show that since 
7 is not prime there are ideals 7, K properly containing 7 (hence not in <S) with J K C 7.] 

2. Let 7 and J be ideals in the ring R. Prove the following statements: 

(a) If 7* c J for some k > t then rad 7 c rad J. 

(b) If /* c j c / for some k > 1 then rad 7 = rad J. 

(c) rad(77) = rad(7 H J) = rad 7 H rad J. 

(d) rad (rad 7) = rad 7. 

(e) rad 7 + rad J c rad(7 4- J ) and rad(7 + J) — rad (rad 7 4- rad J). 

3. Prove that the intersection of two radical ideals is again a radical idea). 

4. Let 7 = mi m 2 be the product of the ideals mi = (x, y) and m 2 = (jc — 1, y — 1) inF 2 [x, y]. 

Prove that 7 is a radical ideal. Prove that the ideal (jc 3 — y 2 ) is a radical ideal in F 2 [x, y], 

5. If 7 = (jcy, (x — y)z) C k[x, y, z ] prove that rad 7 = (xy, xz, yz). For this ideal prove 

directly that 2(7) = 2( rad 7), that 2(7) is not irreducible, and that rad 7 is not prime. 

6. Give an example to show that over a field k that is not algebraical ly closed the containment 
7 c 1(2(7)) can be proper even when 7 is a radical ideal. 

7. Suppose R and S are rings and <p : R — ► S is a ring homomorphism. If 7 is an ideal of R 
show that <p(rad 7) c rad(^>(7)). If in addition <p is surjective and 7 contains the kernel of 
<p show that v>(rad 7) = rad(^>(7)). 

8. Suppose the prime ideal P contains the ideal 7. Prove that P contains the radical of 7. 

9. Prove that for any field k the map 2 in the Nullstellensatz is always surjective and the map 
X in the Nullstellensatz is always injective. [Use property (10) of the maps 2 and 2 in 
Section 1 .] Give examples (over a field k that is not algebraically closed) where 2 is not 
injective and 2 is not surjective. 
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10. Prove that for k a finite field the Zariski topology is the same as the discrete topology: 
every subset is closed (and open). 

11. Let V be a variety in A n and let U\ and U 2 be two subsets of A" that are open in the Zariski 
topology. Prove that if V n U\ ^ 0 and V DU 2 ^ 0 then V 0 U\ n U 2 ^ 0- Conclude that 
any nonempty open subset of a variety is everywhere dense in the Zariski topology (i.e., 
its closure is all of V). 

12. Use the fact that nonempty open sets of an affine variety are everywhere dense to prove that 
an affine variety is connected in the Zariski topology. (A topological space is connected 
if it is not the union of two disjoint, proper, open subsets.) 

13. Prove that the affine algebraic set V is connected in the Zariski topology if and only if k[ V] 
is not a direct sum of two nonzero ideals. Deduce from this that a variety is connected in 
the Zariski topology. 

14. Prove that if k is an infinite field, then the varieties in A 1 are the empty set, the whole 
space, and die one point subsets. What are the varieties in A 1 in the case of a finite field 
fc? 

15. Suppose V is a hypersurface in A” and Z(V) = (/) for some nonconstant polynomial 

/ ek[x\,X 2 x„]. Prove that V is a variety if and only if / is irreducible. 

16. Suppose V C A” is an affine variety and / € k[V]. Prove that the graph of / (cf. Exercise 
25 in Section 1) is an affine variety. 

17. Prove that any permutation of the elements of a field A is a continuous map from A 1 to 
itself in the Zariski topology on A 1 . Deduce that if k is an infinite field, there are Zariski 
continuous maps from A 1 to itself that are not polynomials. 

18. Let V be an affine algebraic set in h n over k = <C. 

(a) Prove that morphisms of algebraic sets over C are continuous in the Euclidean topol- 
ogy (the topology on C” obtained by identifying C" with R 2 " with its usual Euclidean 
topology). 

(b) Prove that V is a closed set in the Euclidean topology on C n (so the Zariski closed 
sets of A" over C are also Euclidean closed). 

(c) Give an example of a set that is closed in the Euclidean topology but is not closed in 
the Zariski topology, i.e., is not an affine algebraic set (so the Euclidean topology is 
“finer” than the Zariski topology). 

19. Give an example of an injective A:- algebra homomorphism <p : £[W] — ► £[V] whose 
associated morphism <p : V -v W is not surjective. 

20. Suppose <p : V -> V/ is a suijective morphism of affine algebraic sets. Prove that if V is 
a variety then W is a variety. 

21. Let V be an algebraic set in A" and let / e k[V]. Define V/ = [v € V \ f(v) ^ 0). 

(a) Show that Vf is a Zariski open set in V 7 (called a principal open set in V). 

(b) Let / be the ideal in ] generated by X( V) and x„ + \ f — 1. and let 

W = 2(J) Q A" +I . Show that J -= Z(W) and that the map jt : A M+1 — ► A n by 
projection onto the first n coordinates is a Zariski continuous injection from W onto 
Vf (so the principal open set Vf in V may be embedded as a closed set in some (larger) 
affine space). 

(c) If U is any open set in V show that U = vy, U ■ • ■ U Vf m for some f\ , . . . , f m e V]. 
(This shows that the principal open sets form a base for the Zariski topology.) 

22. Prove that GL„ ( k ) is an open affine algebraic set in A" and can be embedded as a closed 
affine algebraic set in A /,2+I . In particular, deduce that the set k * of nonzero elements in 


Sec. 152 Radicals and Affine Varieties 


687 



A 1 embeds into A 2 as the hyperbola xy — 1. [Use the preceding exercise.] 

23. Show that if £ is infinite then {(a, a 2 , a 3 ) \ a ^ k] c A 3 is an affine algebraic variety. If k 
is finite show that this set is always reducible. 

24. Let V = Z(xz — y 2 , yz — x 3 , z 2 —x 2 y) C A 3 . Show that if k is infinite then V is an affine 
variety. [Use Exercise 26 of Section 1 and Exercise 20.J 

25. Suppose /( x) = x 3 -f- ax 2 + bx + c is an irreducible cubic in Q[x] of disc rimin ant D. Let 
/ = (x + y + z + a, xy 4- xz 4- yz - b, xyz 4 c) in Q[x, y, z]. 

(a) Prove that I is a prime ideal if and only if D is not a square in Q, in which case / is a 
maximal ideal and Q[x, y, z ]// is a splitting field for f(x) over Q. 

(b) If D = r 2 , prove that the primary decomposition of I is / = Q + n Q- where 
Q± = (/, (x — y)(x — z)(y — z) ± r). Prove Q + and Q- are maximal ideals, and 
Q[x, y, z] modulo Q + or Q- is a splitting field for / (x) over <Q>. 

26. A topological space X is called quasicompact if whenever any collection of closed sub- 
sets V,- of X has empty intersection, then some finite number of these also has empty 
intersection, i.e., 

N 

whenever V,- =0 there exists V), , V/ 2 , . . . , Vj N such that (^j Vj ( — 0. 

i i=l 

Prove that every affine algebraic set is quasicompacl. [Translate the definition into a prop- 
erty of ideals in k[x\ x„].] (A quasicompacl and Hausdorff space is called compact.) 

27. When k is an infinite field prove that the Zariski topology on k 2 is not the same as taking 
the Zariski topology on k and then forming the product topology on k x k. [By Exercise 
14 of Section 1 , in the product topology on k x k the Zariski closed sets in k x it are finite 
unions of sets of the form {a} x {6}, { a } x k and k x {b}, for any a, b e k.] 

28. Prove that each of the following rings have infinitely many minimal prime ideals, and that 
(0) is not the intersection of any finite number of these (so (0) does not have a primary 
decomposition in these rings): 

(a) the infinite direct product ring Z/2 7L x Z/2Z x - * • (which is a Boolean ring, cf. 
Exercise 23 in Section 7.4). 

(b) k[xi ,X 2 , ■ . ■ ]/(xix 2l X3X4 , . . . , X 2 i-ix 2 j ,-■-), wherexi , . . . are independent vari- 

ables over the field k. 

29. Suppose that A and B are ideals with AB c Q for a primary ideal Q. Prove that if A Q 
then B C rad Q. 

30. Let Q be a P-primary ideal and suppose A is an ideal not contained in Q. Define 
A! = {r. e R | r A c Q] to be the elements of R that when multiplied by elements of 
A give elements of Q. Prove that A' is a P-primary ideal. 

31. Prove that if Q\ and Q 2 are primary ideals belonging to the same prime ideal P, then 
Ql n Q 2 is a primary ideal belonging to P. Conclude that a finite intersection of P- 
primary ideals is again P-primary. 

32. Prove that if Q\ and Q 2 are primary ideals belonging to the same maximal ideal M, then 
Q\ + Q 2 and Q\ Q 2 are primary ideals belonging to M. Conclude that finite sums and 
finite products of Af -primary ideals are again M-primary. 

33. Let / — (x 2 , xy, xz, yz) in k[x, y, z]. Prove that a primary decomposition of I is 
/ = (x, y) D (x, z) fl (x, y, z) 2 , determine the isolated and embedded primes of I, and 
find rad/. 

34. Suppose <p : R S is a surjective ring homomorphism. Prove that an ideal Q in R 
containing the kernel of <p is primary if and only if <p(Q) is primary in S, and when this is 
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the case the prime associated to <p(<2) is the image q>(P) of the prime P associated to Q. 

35. Suppose (p : R -» S is a ring homomorphism. 

(a) Suppose / is an ideal of R containing ker^ with mi nim al primary decomposition 
7 = Q\ n - • - n Q m with rad Qj = P; . If <p is a surjective homomorphism prove that 
(p(I) — <p(Qi)n - - n^(j2m)» where rad ^(<2/) is given by <p(Pj), is a minimal primary 
decomposition of ^(7). [Use the previous exercise.] 

(b) Suppose / is an ideal of S with minimal primary decomposition 7 = Q\ n • • - n Q m 

withrad Q-, = P,-. Prove that ^ -1 (7) = where rad ^ _1 (g ; ) 

is given by ^ -1 (P), is a primary decomposition of <p~ l (I), and is minimal if y is 
surjective. 

36. Let / = (xy, x — yz) in k[x t y, z\. Prove that ( x , z) n (y 2 , x — yz) is a minimal primary 
decomposition of 7. [Consider the ring homomorphism <p : h\x, y, z] -> k[y, z] given by 
mapping x to yz, y to y, and z to z and use the previous exercise.] 

37. Prove that a prime ideal P contains the ideal I if and only if P contains one of the 
associated primes of a minimal primary decomposition of 7. [Use Exercise 3 and Exercise 
1 1 in Section 7.4.] 

38. Show that every associated prime ideal for a radical ideal is isolated. [Suppose that 
7*2 = rad 02 c = rad Q\ in the decomposition of Theorem 21 for the radical ideal /. 
Show that if a e Qi fi • • ■ n Q m £ P 2 then a " e / for some n > 1, conclude that a E Q\ 
and derive a contradiction to the minimality of the primary decomposition.] 

39. Fix an element a in the ring R. For any ideal / in the ring 7? let I a — [r e R \ ar e I). 

(a) Prove that I a is an ideal and I a = R if and only if a E /. 

(b) Prove that (/ n J) a = l a n J a for ideals / and J . 

(c) Suppose that Q is a P -prim ary ideal and that a £ Q. Prove that go is a P-primary 
ideal and that Q a = Q if a £ P. 

40. With notation as in the previous exercise, suppose 7 = Q\ D • ■ ■ Pi Q m is a minim al primary 
decomposition of the ideal 7 and let P t be the prime ideal associated to Qj . 

(a) Prove that l a = (£h)a n ‘ - H( Q m ) a and that rad (I a ) = rad((£h) a )n- ■ •nrad((j2/ n ) a ). 

(b) Prove that rad(/ a ) is the intersection of the prime ideals P ; for which a £ Qi . [Use 
the previous exercise.] 

(c) Prove that if rad(/ a ) is a prime ideal then rad(/ a ) = Pj for some j. [Use the fact that 
prime ideals are irreducible.] 

(d) Fqr each i = 1 m, prove that rad(7 a ) = P,- for some a E R. [Show there exists 

ana e R with a £ Q\ but a e Qj for all j ^ i.] 

(e) Show from (c) and (d) that the associated primes for a minimal primary decomposition 
are precisely the collection of prime ideals among the ideals rad(7 a ) for a e R, and 
conclude that they are uniquely determined by 7 independent of the minimal primary 
decomposition. 

41. Let Pj , . . . , P m be the associated prime ideals of the ideal (0) in the Noetherian ring R. 

(a) Show that Pi fi - - ■ D P m is the collection of nilpotent elements in R. [Apply Corol- 
lary 22 to 7 = (0).] 

(b) Show that P\ U ■ - • U P ra is the collection of zero divisors in R. [Let 7 = (0) 

in the previous exercise and show that the set of zero divisors is given by the set 
Uae/?— { 0 } (0)a = rad((0) a )-] 

42. Suppose R is a Noetherian ring. Prove that R is either an integral domain, has nonzero 
nilpotent elements, or has at least two minimal prime ideals. [Use the previous exercise.] 

43. Prove that the ideal / in the Noetherian ring R is radical if and only if the primary compo- 
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nents of a minimal primary decomposition are all prime ideals, and conclude that in this 
case the minimal primary decomposition is unique. | If / = Qi n - n is radical with 
Qi a Pi -primary component of a minimal decomposition, show that if a e Pi 0 ■ • ■ n P m 
then some power of a is in /, hence a e 1 since 1 is radical. Deduce that I = P\ n ■ - - n P m 
and show that this is also a minimal primary decomposition, i.e., for any i there exists b 
with b £ Pi , but b e Pj for j ^ i. I i a € Pi, show that ab e Qi , and that a e Q,. 
Conclude that Qi = Pi .] 

44. Prove that a Noetherian integral domain R is a UJFD. if and only if for every a e R the 
isolated primes associated to the principal ideal (a) are principal ideals. [See Example 2 
following Corollary 22. To prove R is a U.F.D., show that an irreducible a e R is prime 
and then follow the proof of Theorem 14 in Section 8.3.] 

45. Let R be the ring of all real valued functions on the open interval (—1, 1) that have 
derivatives of all orders (the ring of C°° functions). Let 


F(x) = 



if* ^0 
if x = 0 


(you may assume F € R and F^(0) = 0 for all n > 0). Let (F) be the principal ideal 
generated by F and let A — rad((F». Let M be the (maximal) ideal of all functions in R 
that are zero at jc = 0 and let P = M n . 

(a) Prove that M = (jc) is the ideal generated by the function jc in R and that M 71 = (jc") 
consists of the functions whose first n — 1 derivatives vanish at the origin. 

(b) Prove that R is not Noetherian (compare Exercise 33 in Section 7.4). [One approach 
is the following: Let G(x) be the function that is 0 for jc <0 and is equal to F(jc) for 
x > 0. Let /„ be the ideal of functions in R vanishing for all jc < 1/n. Use translates 
of G(x) to show that J\ C h C h C ■ ■ • is an infinite ascending chain,] 

(c) Prove that P consists of the functions all of whose derivatives are zero at jc = 0 (Le., 
the functions whose associated Taylor series at jc = 0 is identically zero), and that P 
is a prime ideal. 

(d) Prove that F e P and deduce that A c p. 

(e) Prove that A ^ P. [Let G(jc) = e -1 /* 2 when x ^ 0 and G( 0) = 0. Show that G e P 
but G t A.} 

(f) Show that there is a prime ideal Q containing (F) with Q ± P,M. Prove that Q C P 
l.e., there are nonzero prime ideals properly contained in P. 

46. Let A be any ideal in R — k[xi , . . . , jc„, yj, . . . , y™]. 

(a) Show that rad(^4 n *[yi y*,]) = rad A n fc[yi , . . . , y m ]. 

(b) Suppose (/!,•■•, f s ) is an ideal in k[x \ , . . . , jr„]. Let F\,...,F t be generators for 

the radical of (/i, . . . , f s ), computed in Jt[xi jc„]. Suppose J is an ideal in R 

and let A = / + (Ji , • - - , f s ). B = J + (Ft F,) as ideals in R . Prove that 

rad ^4 = rad B. 

(c) Ccmclude from (a) and (b) that A = (yi -x { , . . . , y m -x m . f\ / s )fU[y i , . . . , y m ] 

and B = (y\ —x\ y m - x m , F\ F,) n k[yi y w ] have the same zero sets 

over an algebraically closed field k. [Use Hilbert’s Nullstellensatz.] 

47. Determine the Zariski closure in C 3 of the points on the curve {(n 2 , a 3 , a 4 ) | a e C). 

48. Show that Z(x 3 — xyz + z 2 ) is the smallest algebraic set in R 3 containing the points 
{(sr, s +t, s 2 t ) | s, t € R). 

49. Show that Z(x 3 z 2 — 3jc y 2 ^ 2 — y 6 — z 4 ) is the smallest algebraic set in R 3 containing the 
points {(s 2 + / 2 , st, s 3 ) | s, / € R). 
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50. Find equations defining the Zariski closure of the set of points {(s 4 , s 3 t, 

51. Show that V = Z( x 2 — y 2 z) (the Whitney umbrella surface) is the smallest algebraic set 
in R 3 containing the points S = {(st, s, t 2 ) \ s, t € R) . Show that S is not Zariski closed 
in V (the missing points explain the name for the surface). Do the same over C, but show 
that in this case S = V is closed. 

52. Let V = Z(x z 2 — w 3 ,xw 2 — /.yV — ur") c C 4 . Determine the Zariski closure of the 
image of V under the projection n{(x, y, z, iu)) = (*. y, z). 

53. Let V = Z(xy — 1) in A 2 and let S be the projection of V onto the x-axis in A 1 . 

(a) If* = R, show thatZ(lO = (xy — 1) C R[x, y] and that (u — x,xy — l)nl[a] = 0 
in R[jc, y. «]. Use Propositions 8 and 16 to conclude that the Zariski closure of S is 
A 1 and show that S is not itself closed. 

(b) If k = F3, show that X(V) = (xy — 1, x 3 — x, y 3 — y) c F3[x, y] and that (u — 
x, xy — l,x 3 - x, y 3 — >') fiF3[nj = (u 2 — 1) in F3[x, y, u). Use Propositions 8 and 
16 to conclude that S is Zariski closed in A 1 . 

54. Recall the ideal quotient (/ : J) = {r e R \ rj e /} of two ideals I, J in a ring R (cf. 
Exercise 34 in Section 9.6). Clearly / C (/ : J). 

(a) Show that Z(I) — Z(J), the set of elements of Z(l) not lying in Z(J), is contained 
in Z((I : J)) and conclude that the Zariski closure of Z(I) — Z(J) is contained in 

' -f))- 

(b) Show that if A is algebraically closed and / is a radical ideal then Z((I : J)) is 
precisely the Zariski closure of Z(I) — Z{J). 

(c) Show that if V and W are affine algebraic sets then ( Z(V ) : T(W)) = X(V — W). 


15.3 INTEGRAL EXTENSIONS AND HILBERTS NULLSTELLENSATZ 

In this section we consider the important concept of an integral extension of rings, 
which is a generalization to rings of algebraic extensions of fields. This leads to the 
definition of the “integers” in finite extensions of Q (the basic subject of the branch 
of mathematics called algebraic number theory) and is also related to the existence of 
tangent lines for algebraic curves. 

Definition. Suppose R is a subring of the commutative ring S with 1 = I5 e R. 

(1) An element s € S is integral over R if s is the root of a monic polynomial in 
R[x]. 

(2) The ring S is an integral extension of R or just integral over R if every s e S is 
integral over R. 

(3) The integral closure of R in S is the set of elements of S that are integral over 
R. 

(4) The ring R is said to be integrally closed in S if R is equal to its integral closure 
in S . The integral closure of an integral domain R in its field of fractions is 
called the normalization of R. An integral domain is called integrally closed or 
normal if it is integrally closed in its field of fractions. 

Before giving some examples of integral extensions we prove some basic properties 
of integral elements analogous to those of algebraic elements over fields. 
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Proposition 23. Let R be a subring of the commutative ring S with 1 e R and let .s' € S. 
Then the following are equivalent: 

(1) s is integral over R, 

(2) R[s] is a finitely generated R-module (where i?[s] is the ring of all R -linear 
combinations of powers of .s), and 

(3) s e T for some subring T, R CT c. S, that is a finitely generated R-module. 

Proof: Suppose first that (1) holds and let s be a root of the monic polynomial 
x n + a„^x n ~ l H 1- a 0 e R[x]. Then 

s n = —(a n -{S n 1 a n — 2 s n 2 + ■ ■ ■ 

and so s' 1 , and then all higher powers of s, can be expressed as R-linear combinations 

of s' 1-1 , . . . , s, 1. Hence R[s] = R 1 + Rs H 1- Rs n ~ l is finitely generated as an 

R-module, which gives (2). 

If (2) holds, then (3) holds with T = R[sl 

Suppose that (3) holds and let ti, V 2 , . . . , v„ be a finite generating set for 7\ Then 
for i — 1, 2, . . . , n the element xu, is an element of T since T is a ring, and so can be 
written as 7? -linear combinations of uj v„: 

n 

SVj = Y.OijVj, 
j = 1 

i.e., 

n 

0 = 'Yi&ijS — djj)Vj i — 1, 2, . . . , n 

7=t 

where Sij is the Kronecker delta. If B is the n x n matrix whose i, j entry is Sjjs — ajj, 
and u is the n x 1 column vector whose entries are ui , . . . , then these equations are 
simply Bv = 0. It follows from Cramer’s Rule that (det B)Vj = 0 for all i (cf. Exercise 
3, Section 11.4). Since 1 e T is an R-linear combination of v lt . . . , v n , it follows that 
det B = 0. But B = si — A, where A is the matrix (a t j). Thus s is a root of the monic 
polynomial det(x/ — A) e R[x] (the characteristic polynomial of A), and so s is a root 
of a monic polynomial with coefficients in R, which gives (1), completing the proof. 

Corollary 24. Let R c S be as in Proposition 23 and let s, t € S. 

(1) If s and t are integral over R then so are s ± t and st. 

(2) The integral closure of R in S is a subring of S containing R. 

(3) Integrality is transitive: let S be a subring of T ; if T is integral over S and S is 
integral over R, then T is integral over R. 

Proof: Let s and t be integral over R. By Proposition 23 both R[j] and /?[/] are 
finitely generated /?-modules, say 

7?[s] — Rs{ -|- Rs 2 - 1 - - ■ * — I - Rs n 
R[r] — Rt\ + Rt2 + * • * + Rt ln . 
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Then 


/] = Rsyty H 1 -Rsitj 1- Rs„t m 


* is a ring containing s ± t and st that is also a finitely generated /?-module. Hence s ± t 
and st are also integral over R , which proves (1) and also (2). 

To prove (3), let / e T. Since t is integral over S, it is the root of some monic 

polynomial p(jc) = x n +a„-yx n ~ l -\ ha 0 € $[-*;]. Sincetf; e S is integral over ft, each 

ring R[aj] is a finitely generated ft -module and so the ring ft t = ft|rio, a i £„-i] is 

also a finitely generated ft -module. Since themonic polynomial p(x) has its coefficients 
in fti , / is integral over Ry and it follows that the ring Ry [/] = R[ao, ay, ... , a n -y , /] is 
a finitely generated ft -module. By the proposition, this means that / is integral over ft, 
which gives (3). 

The second statement in Corollary 24 shows that taking the elements of S that are 
integral over R gives a (possibly larger) subring of S, and the last statement in the 
corollary shows that the process of taking the integral closure stops after one step: 

Corollary 25. Let R be a subring of the commutative ring S with 1 e 1?. Then the 
integral closure of R in S is integrally closed in S. 

Examples 

(1) If R and S are fields then S is integral over R if and only if 5 is algebraic over R — 
if s € S is a root of the polynomial p(x) with coefficients in R then it is a root of the 
monic polynomial obtained by dividing by the (nonzero) leading coefficient of p(x). 

(2) Suppose 5 is an integral extension of R and / is an ideal in S. Then S/1 is an integral 
ring extension of R/(R H I) (reducing the monic polynomial over R satisfied by r € S 
modulo 1 gives a mooic polynomial satisfied by s & S/I over R/(R O /)). 

(3) If R is a UP.D. then R is integrally closed, as follows. Suppose a/b is an element in 

the field of fractions of R (with b ^ 0 and a and b having no common factors) and 
satisfies (a/h)" +r„-y(a/b) n ~ l ( -ry(a/b) + ro = 0 withro r„-y G R. Then 

a” = b(—r„-ia n ~ i — ■ ■ ■ — r\ab n ~ 2 — r$b n ~^) 

shows that any irreducible element dividing b divides a n , hence divides a. Since a/b 
is in lowest terms, this shows that b must be a unit, i.c., a/b € R. 

(4) The polynomial ring k[ jc, y] over the field k is integrally closed in its fraction field 
k( x, y) by example (3) above. The ideal (x 2 — y 3 ) is prime (cf. Exercise 14, Section 
9. 1), so the quotient ring R = k[x, y]/(x 2 — y 3 ) = k[x t y] is an integral domain. This 
domain is not integrally closed, however, since x/y is an element of the fraction field 
of R that is integral over R (since (i/y) 3 — x = 0), but is not an element of R. In 
particular, R is not a U.F.D. by the previous example. 

We next consider the behavior of ideals in integral ring extensions. 

Definition. Let tp : R -► S be a homomorphism of commutative rings. 

(a) If / is an ideal in R then the extension of / to S is the ideal (p(I)S of S generated 
by the image of I. 

(b) If J is an ideal of S t then the contraction in R of / is the ideal <p -1 ( J ). 
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In the special case where R is a subring of S and <p is the natural injection, the 
extension of I c R is the ideal ISiaS and the contraction of J C S is the ideal J HR 
of R. 

It is immediate from the definition that 

(1) I C IS D R, more generally, / is contained in the contraction of its extension 
to S, and 

(2) (J D R) S C J, more generally, J contains the extension of its contraction in 
R. 

In general equality need not hold in either situation (cf. the exercises). 

If Q is a prime ideal in S t then its contraction is prime in R (although the contraction 
of a maximal ideal need not be maximal). On the other hand, if P is a prime ideal in 
R t its extension need not be prime (or even proper) in S ; moreover, it is not generally 
true that P is the contraction of a prime ideal of S (cf. the exercises). For integral ring 
extensions, however, the situation is more controlled: 

Theorem 26. Let R be a subring of the commutative ring S with lei? and suppose 
that S integral over R. 

(1) Assume that S is an integral domain. Then J? is a field if and only if 5 is a field. 

(2) Let P be a prime ideal in R. Then there is a prime ideal Q in S with P = QC\R. 
Moreover, P is maximal if and only if Q is maximal. 

(3) (The Going-up Theorem ) Let Pi c P 2 C ■ - - C P n be a chain of prime ideals 
in R and suppose there are prime ideals Q\ c Q 2 c . - • c Q m of S with 
Pj = Qi C\ R, 1 < i < m and m < n. Then the ascending chain of ideals 
can be completed: there are prime ideals Q m +\ C • ■ ■ c Q n in S such that 
Pi = Q, n R for all i. 

(4) (The Going-down Theorem ) Assume that S is an integral domain and R is 

integrally closed in S. Let P\ ^ P 2 5 ^ P n be a chain of prime ideals 

in R and suppose there are prime ideals Q\ ^ Q 2 2 ■ * • 2 Qm of S with 
Pi = Qj n R, 1 < i < m and m < n. Then the descending chain of ideals 
can be completed: there are prime ideals Q m+ \ 5 ^ Q n in S such that 

Pi = Qj C\ R for all i. 

Proof: To prove (1) assume first that R is a field and let s be a nonzero element of 
S. Then s is integral over R, so 

S n + dn — \S n 1 + ■ - • + &\S + flo = 0 

for some ao, a\ , - - . , ^n-i in R. Since S is an integral domain, we may assume ao ^=0 
(otherwise cancel factors of .?)■ Then 

s(s" -1 + tfn-is"' 2 H \-ai) = -ao 

and since (— 1/co) e R, this shows that (— l/a Q )(s n ~ l + a n -is n ~ 2 + ■ ■ ■ + a\) is an 
inverse for s in S, so S is a field. Conversely, suppose S is a field and r is a nonzero 
element of R. Since r -1 e S is integral over R we have 

r~ m + a m -ir~ m+1 + ■■■+ a ir" 1 +a 0 = 0 
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for some a 0 , . . . , a m - 1 e R. Then r" 1 = — + * ■ ■ + air' 7 ' -2 + do'''” -1 ) e R , so 
is a field. 

The proof of the first statement in (2) is given in Corollary 50. For the second state- 
ment, observe that the integral domain S/Q is an integral extension of R/P (Example 
2 following Corollary 25). By (1), S/Q is a field if and only if R/P is a field, i.e., Q 
is maximal if and only if P is maximal. 

To prove (3), it suffices by induction to prove that if P\ c P 2 and Qi'isa prime of 
S with Q\ n R = P{ then there is a prime Q 2 of S with Q i c Q 2 and Q 2 C\ R = P 2 . 
Since S = S/Q\ is an integral extension of R = R/P\, the first part of (2) shows that 
there exists a prime Q 2 of S with Q 2 n R = P 2 /P\. Then the preimage Q 2 of Q 2 in S 
is a prime ideal containing Q\ with Q 2 fi R = P 2 . 

The proof of (4) is outlined in Exercise 24 in Section 4. 

Corollary 27. Suppose R is a subring of the ring S with 1 € R and assume S is integral 
and finitely generated (as a ring) over R. If P is a maximal ideal in R then there is a 
nonzero and finite number of maximal ideals Q of S with Q fl R = P. 

Proof: There exists at least one maximal ideal Q lying over P by (2) of the theorem, 
so we must see why there are only finitely many such maximal ideals in S. If Q is a 
maximal ideal of S with Q n R — P then S/Q is a field containing the field R/P. 
To prove that there are only finitely many possible Q it suffices to prove that there are 
only finitely many homomorphisms from S to a field containing R/P that extend the 
homomorphism from R to R/P. Let S = R[si, . . . , ■?„], where the elements Sj are 
integral over R by assumption, and let pt(x) be a monic polynomial with coefficients 
in R satisfied by If Q is a maximal ideal of 5 then S/Q = (R/P)[s\, is 

the field extension of the field R/P with generators s\ s„. The element is a 

root of the monic polynomial pi(x) with coefficients in R/P obtained by reducing the 
coefficients of pi(x) mod P. There are only a finite number of possible roots of this 
monic polynomial (in a fixed algebraic closure of R/P), and so only finitely many 
possible field extensions of the form (R/P)[s] , . . . , s„], which proves the corollary. 

Algebraic Integers 

We can use the concept of an integral ring extension to define the “integers” in extension 
fields of the rational numbers Q: 

Definition. Let AT be an extension field of Q. 

(1) An element a € K is called an algebraic integer if a is integral over Z, i.e., if 
a is the root of some monic polynomial with coefficients in Z. 

(2) The integral closure of Z in K is called the ring of integers of K, and is denoted 
by O k . 


An algebraic integer is clearly algebraic over Q, so the ring of all algebraic integers 
is the ring of integers in Q, an algebraic closure of Q. Examples of algebraic integers 
include V 2 , a/ 5, etc. since these elements are certainly roots of monic polyno- 

mials with coefficients in Z. The definition of an algebraic integer a is that a be a root 
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of some monic polynomial in Z[x], a condition which seems difficult to check. The 
next proposition gives a simple criterion for a to be an algebraic integer in terms of the 
minimal polynomial for a. 

Proposition 28. An element a in some field extension of 0 is an algebraic integer 
if and only if or is algebraic over (Q> and its minimal polynomial m a <Q(x) has integer 
coefficients. In particular, the algebraic integers in Q are the integers Z, i.e., Oq = Z. 

Proof: If or is algebraic over Q with m a Q(x) € Z[x], then by definition or is integral 
over Z. Conversely, assume or is integral over Z, and let f(x) be a monic polynomial in 
Z[x\ of minimum degree having or as a root. If / were reducible in Q[x], then by Gauss’ 
Lemma f(x) = g(x)h(x) for some monic polynomials g(x), h(x) in Z[x\ of degree 
smaller than the degree of /. But then or would be a root of either goth, contradicting 
the minimality of f. Hence / is irreducible in Q[x], so f(x) = m aj q(x) and so the 
minimal polynomial for a has coefficients in Z. Finally, the minimal polynomial of 
or = a/b € Q (a/b reduced to lowest terms and b > 0) is bx — a, which is monic if 
and only if b = 1 , so or € Q is an algebraic integer if and only if or € Z. 

Because the integers Z are the algebraic integers in (Q>, for emphasis (and clarity) 
the elements of Z are sometimes referred to as the “rational integers” to distinguish 
them from the “integers” in extensions of finite degree over Q (called number fields). 
The next result gives some of the basic structure of the ring of integers in a general 
number field. 

Theorem 29. Let AT be a number field of degree n over Q. 

(1) The ring Ok of integers in AT is a Noetherian ring and is a free Z-module of 
rank n. 

(2) For every fi e K there is some nonzero d e Z such that dfi is an algebraic 
integer. In particular, K is the field of fractions of Ok ■ 

(3) If fii, fa, ■ . . , is any O-basis of K, then there is an integer d such that 
dfi \ , dfii , , dfi n is a basis for a free Z-submodule of Ok of rank n. Any basis 
of the Z-module Ok is also a basis for AT as a vector space over Q. 

Proof: Note first that any Z-l inear dependence relation among elements in Ok is a 
O-linear dependence relation in AT, and multiplying a Q-linear dependence relation of 
elements of Ok in AT by a common denominator for the coefficients yields a Z-linear 

dependence relation in O k . Let fi beany element of AT and letx* +ak-]X k ~ l H \-oq be 

the minimal polynomial of fi over O- If d is a common denominator for the coefficients, 
then multiplying through by d k shows that 

(df i) k + da^— i ( dfi) k ~ x + ■ • - + d k - l ai (df) + d k a 0 = 0, 

and d k ao, d k ~ x a \ , . . . , da ^ i € Z. Hence df is an algebraic integer, which proves the 
first part of (2) and then the second statement in (2) follows immediately. 

If fi \ , . . . , fi n are a Q-basis for K over (Q>, then there is a nonzero integer d such that 
dfi i , . . . , dfi„ all lie in Ok . These elements are still linearly independent over (Q>, so in 
particular are independent over Z, hence generate a free submodule of Ok of rank n. 
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which proves the first statement in (3). 

Since Ok is a subring of the field K, it is a torsion free Z-module. If Ok were 
contained in some finitely generated Z-module it would follow that Ok is also finitely 
generated over Z, hence is a free Z-module. If L is the Galois closure of K y then 
O k Q Ok and so it suffices to see that O l is contained in a finitely generated Z- 
module. Let a \ , . . . , a m be a Q-basis for L. Multiplying by an integer d € Z, if 
necessary, we may assume that each d\ is an algebraic integer, i.e., ct \ , . . . , a m € Ok- 
For each fixed 0 ^ 0 in L, the map 


Tq : L — y Q defined by 7# (a) = Ttl/q ( 6a ) 


(where Ttl/q denotes the trace map from L to Q, cf. Exercise 18 in Section 14.2) is 
a Q-linear transformation from L to Q. This linear transformation is nonzero because 
T 0 (6^') = Tr L/ Q (1) = m. It foUows that the map from L to Hom^L, Q) mapping 6 
to Tfi is an injective homomorphism of vector spaces over Q. Since both spaces have 
the same dimension over Q, the map is an isomorphism. Put another way, every linear 
functional on L is of the form Tq for some 6 € L. In particular, there are elements 
cc \ , . . . , a' m in L whose corresponding linear transformations T a ' give the dual basis of 
oil « • * • » oi in , i.e., 


Tr L/Q (ot'ctj) 


1, if i =y 

0. otherwise. 


Since a\, ... ,a' m are linearly independent, they give a basis for L over Q. Hence every 
element p e Ok can be written 

= a\ct\ + ■ ■ ■ + a,oij + 1 - a m a' m 

with a.\ , . . . , a m e Q. Multiplying by cnj and taking the trace shows that 

Tr l/q (fiotj) = aiTtK/Q (a[cij) H h aiTr L/Q (a'cij) H 1- a m Tr L/Q (a' m aj) = a } . 

But and cxj are both elements of Ol, so also is an element of O/., and this implies 

that a } — Tt l /q (fictj) is an element of Z (cf. Exercise 1 8(d) of Section 14.2). It follows 
that 

Ok ^ %a'i + ■ ■ ■ + 

so that O l is contained in a finitely generated Z-module, proving that Ok (and also 
Ok) is a free Z-module. 

Since K has dimension n as a vector space over Q, it follows that Ok is a free Z- 
module of r ank at most n (by Theorem 5 of Section 12.1). Because Ok also contains a 
free Z-submodule of rank n, it follows that the Z-rank of Ok is precisely n, proving (1 ), 
and then the second statement in (3) follows by the remarks on Z-linear and Q-linear 
dependence relations. 

Finally, any ideal I in O k is a Z-submodule of a free Z-module of rank n, so is a 
free Z-module of r ank at most n, and a set of Z-module generators for 7 is also a set 
of Ok -generators. Hence every ideal of O k can be generated by at most n elements, 
which implies that Ok is a Noetherian ring and completes the proof. 
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Definition. An integral basis for the number field K is a basis of the ring of integers 
in K considered as a free Z-module of rank [K : Q]. 


If P is a nonzero prime ideal in the ring of integers Ok of a number field K then 
P H Z is a prime ideal in Z. If a e P, then the constant term of the minimal polynomial 
for a over Q is then an element in P fl Z, which shows that Pfl Z = pZ is also a 
nonzero prime ideal in Z. By Theorem 26, every prime ideal (p) in Z arises in this 
way. Since pZ is a maximal ideal, it also follows from (2) in Theorem 26 that nonzero 
prime ideals in Ok are maximal , and then by Corollary 27, there are finitely many 
prime ideals P in Ok with PflZ = pZ. We shall see later (Corollary 16 in Section 
16.3) that every nonzero ideal in the ring of integers of a number field can be written 
uniquely as the product of prime ideals , and in the case of the ideal pOk the distinct 
prime factors are precisely the finitely many ideals P in Ok with PflZ = pZ. This 
property replaces the unique factorization of elements in Ok into primes (which need 
not hold since Ok need not be a U.F.D.). We shall also see that primary ideals in Ok 
are powers of prime ideals (in fact this is equivalent to the unique factorization of ideals 
of Ok into products of prime ideals, cf. the exercises). 


Example: (The Ring of Integers in Quadratic Extensions of Q) 

If K is a quadratic extension of Q then K = Q(\/~D ) for some squarefree integer D. Then 


= = Z ■ 1 + Z ■ (0, 


with integral basis 1 , to, where 


Vd. 

1 + *J~D 
2 


if D = 2, 3 mod 4 
if D = 1 mod 4. 


This is the quadratic integer ring introduced in Section 7.1. Since o> satisfies co 2 — D = 0 
(respectively, to 2 — to + ( 1 — D)/ 4) for D = 2, 3 mod 4 (respectively, D = I mod 4), it 
follows that to is an algebraic integer in K and so Z[to] c Ok - To prove that this is the full 
ring of integers in K , let a = a + b<jD with a, b e Q, and suppose that a is an algebraic 
integer. If b = 0, then a € Q and so a € Z. If b ^ 0, the minimal polynomial of or is 

x 2 — 'lax + (a 2 ~b 2 D). Then Proposition 28 shows that la and a 2 — b 2 D are elements 

of Z. Then 4 (a 2 —b 2 D) = (2a) 2 — (2b) 2 D € Z, hence 4b 2 D e Z. Since D is squarefree 
it follows that 2b is an integer. Write a = xj 2 and b = y/2 for some integers x, y. Since 
a 2 — b 2 D is an integer, x 2 — y 2 D = 0 (mod 4). Since 0 and 1 are the only squares mod 4 
and D is not divisible by 4, it is easy to check that the only possibilities are the following: 

(i) D s 2 or 3 (mod 4) and x. y are both even, or 

(ii) D = 1 (mod 4) and x, y are both even or both odd. 

In case (i), a, b € Z and a € Z[to], In case (ii), a + bJ~D = r + sco where r = (x — y)/2 
and s = y are both integers, so again a € Z\ a>]. 


Example: (The Ring of Integers in Cyclotomic Fields) 

The ring of integers in the cyclotomic field <Q>(f,j) of n* roots of unity is Z[f„], where 
is any primitive n 11 * root of 1. The elements 1 are an integral basis. It is 

clear that f „ is an algebraic integer since it is a root of x n — 1 , so the ring Z[f„] is contained 
in the ring of integers. The proof that this is the full ring of algebraic integers in <Q(£„) 
involves techniques from algebraic number theory beyond the scope of the material here. 
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Noether’s Normalization Lemma and Hilbert’s Nullstellensatz 

We now apply some of the techniques from the algebraic theory of integral ring exten- 
sions to affine geometry. 

Definition. If k is a field the elements yi, yi , . . . , y q in some fc-algebra are called 
algebraically independent over k if there is no nonzero polynomial p in q variables 
over k such that p(y,, yi> - ■ - , yf) = 0. 


Thus yi , y%, . . . , y q are algebraically independent if and only if the fc-algebra homo- 
morphism from the polynomial ring k[x i , . . . , Xq\ to k[y \ , . . . , y 9 ] defined by x; y,- 
is an isomorphism. Elements in a field extension of k are algebraically independent if 
and only if they are independent transcendental s over k. 


Theorem 30. (Noether’s Normalization Lemma) Let k be a field and suppose that 
A = k[r i, ;* 2 , ... , r m ] is a finitely generated ^-algebra. Then for some q, 0 < q < m, 

there are algebraically independent elements yi . y 2 y q € A such that A is integral 

over &[yi , >2 y Q l 


Proof: Proceed by induction on m. If ri, . . . , r m are algebraically independent 
over k then take y, = r,, i = 1, . . . , m. Otherwise, there exists f(x [, . . . , x,„) g 
k[x \ , . . . , *,„] such that fir ,, . . . , r,„) = 0. The polynomial / is a sum of monomials 
of the form ax*' xff ■ ■ ■ x *” 1 , where the degree of this monomial is e, + ■ ■ 4- e m and 
the degree, d, of / is the maximum of the degrees of its monomials. Renumbering the 
variables if necessary, we may assume that / is a nonconstant polynomial in x m with 
coefficients in the ring fc[xi , X 2 , . . . , x, n -\]. We now perform a change of variables that 
transforms (or “normalizes”) / into a monic polynomial in x,„ with coefficients from a 
subring of A which is generated over k by m — 1 elements, at which point we shall be 
able to apply induction. 

Define integers a, = (1 + d) 1 and new variables X,- = jc, — jc“' for 1 < i < m — 1. 
Let 


g(X,.X 2 X,„_i. x,„) = fix, +x“\ X 2 + x' 


a 2 

tit > 


x m -i+*Zr l 




so g G k\X i Vi,t«]. Each monomial term of / contributes a single term of the 

form a constant times x e m tog. It is also easy to check that the choice of a, ensures that 
distinct monomials in / give different values of e (for example by viewing the degrees 
of the monomials in the new variables as integers expressed in base b — d -\- 1). If N 
is the highest power of x,„ that occurs, then it follows that 

N - 1 

g = cx% + Y^hi(X u ... t X W _1 )*;„ 

1=0 


for some nonzero c g k. If now s t = r, — r“ f then 

-gfaii -S 2 . - - - , Jm-i, r, n ) = -f{r u r 2 , .... r„- lt r m ) — 0, 
c c 

which shows that r m is integral over B = k[s \. Each r, for 1 < / < m — 1 
is integral over B[r m ] since r,- is a root of the monic polynomial x — Si — r“ f , so A is 
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integral over B[r m }. By transitivity of integrality, A is therefore integral over B. Since 
B is a fc-algebra generated by m — 1 elements, induction completes the proof. 

A more “geometric” interpretation of Noether’s Normalization Lemma is indicated 
in Exercise 15. We next use the Normalization Lemma to prove that if A; is an alge- 
braically closed field then the maximal ideals of the polynomial ring k[x\ t * 2 , . - ■ , x n ) 
are of the form (x i— a x , . . . , x n — fl„)forsomefli, G k. Viewing £[* 1 , X 2 , - - . , x n ] 

as the ring of polynomial functions on A” , this says that the maximal ideals correspond 
to the kernels of evaluation maps at points of A n — similar to the corresponding result 
for rings of continuous functions on a compact set (cf. Exercises 33, 34 in Section 7.4). 

Theorem 31. (Hilbert's Nullstellensatz — Weak Form) Let k be an algebraically closed 
field. Then M is a maximal ideal in the polynomial ring k[x \ , * 2 , . . . , x„] if and only if 
M = (x i —a\,...,x n — a n ) for some a \ , . . . , a„ € k. Equivalently, the maps Z and X 
give a bijective correspondence 

i 

{points in A”} {maximal ideals in &[A"]j. 

z 

Moreover, if / is any proper ideal in k[x i, X 2 , , x n ] then Z(I) ^ 0. 

Proof: Certainly (x\ — a x , . . . , x„ — a n ) is a maximal ideal in &[xi, * 2 , . . . , x„]. 
Conversely, for any maximal ideal M in k[x\, * 2 , * • ■ * x„], letf = k[x i, * 2 , - . - . x„]/M. 
Then E is a field containing k that is finitely generated over k (by xi, ... , x n ). By 

Noether’s Normalization Lemma, E is integral over a polynomial ring k[y\ y q ]. 

Then k[y x y q ] is a field by Theorem 26(1), and since a polynomial ring in one or 

more variables is never a field, it follows that q = 0. Hence E is integral over k, so E is 
algebraic over k. Because k is algebraically closed, E = k, i.e., x, g k for 1 < i < n. 
Hence for / = 1 , . . . , n there is some a, G k such that x, — a, g M. This means that the 
maximal ideal (xi~a i , . . . ,x„—a n ) is contained in M, so M = (xi—a {, . . . , x„ — a„). 
Finally, if 1 is any nonzero ideal in k[x i, x ^, . . . , x„\ then I is contained in a maximal 
ideal M = (xi — a lf x„ — a „ ), and so (a { , . . . , a n ) e Z(I). 

Theorem 32, (Hilbert's Nullstellensatz) Let k be an algebraically closed field. Then 
X(Z(I)) = rad / for every ideal I of k[x i, * 2 , • ■ - , x„). Moreover, the maps Z and X 
define inverse bijections 

x 

{affine algebraic sets} {radical ideals). 

z 

Proof: Since rad I c X(Z(I)) it remains to prove the reverse inclusion. By 
Hilbert’s Basis Theorem, I = (/i, fi , . . . , f m ). Let g G X(Z(I)). Introduce a new 
variable x n +\ and consider the ideal V generated by f\, . . . , f m and x„ +i g — 1 in 
k[x i, . . . , x r , x n+ \]. At any point of A" +l where /j , . . . , vanish the polynomial g 
also vanishes since g g X(Z(I))> so that x„+]g — 1 is nonzero. Hence Z(V ) = 0 in 
A n+1 . By the Weak Form of the Nullstellensatz, V cannot be a proper ideal, i.e., 1 G V. 
Write 

1 = ai/i + 1- ^/ m + a, n+ i(x n+ ig - 1) for some a, g k[x i, . . . , x n+x \. 
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Letting y = 1 /x n+l and multiplying by a high power of y in this equation shows that 

y N = Cifi + 1 -C m f m + c„, + i (£ - y) for some c,- € k[x i x n , y]. 

Substituting g for y in this polynomial equation shows that g N € / (in /:[jci » . . . , x n ]), 
i.e., g 6 rad /. Hence X(Z(I)) c rad I and so X(Z(I)) = rad 7, completing the proof. 

It follows directly from Proposition 12 and Theorem 26(2) that if 5 is an integral 
extension of R with 1 e R and if / is an ideal of R, then 

(rad 5 / S) O R = rad/? 7 

where IS is the ideal generated by I in S, and the subscript indicates the ring in which 
the radicals are being computed. This has the following geometric interpretation. 

Corollary 33. (Variant of Hilbert's Nullstellensatz) If k is any field with algebraic 
closure k and / is an ideal in k[x { ,x 2 , .... Jt„], then Xk(Z- k (I)) = rad 7, where Z- k (l) 
is the zero set in k n of the polynomials in I and X k (Z k (l)) is the ideal of polynomials 
in k[xi,x 2 < vanishing at all the points in Z k (I). In particular, 7 = (1) if and 

only if there are no common zeros in k n of the polynomials in /. 

Proof: Since /:[*! , jc 2 , . . . , jc„] is an integral extension of /r[xi , x 2 , . ■ ■ , jc,i] (gener- 
ated by the integral elements k), the corollary follows immediately from Theorem 32 
and the re mar ks on radicals above. 

From the Nullstellensatz we now have a dictionary between geometric and ring- 
theoretic objects over the algebraically closed field k: 


Geometry 

Algebra 

affine algebraic set V 

coordinate ring k[V ] 

points of V 

maximal ideals of k[V) 

affine algebraic subsets in V 

radical ideals of k[V] 

subvarieties in V 

prime ideals in k[V\ 

morphism : V — > W 

k -algebra homomorphism 
: k[W] k[V) 


Computing Radicals 

There are algorithms for computing radicals and primary decompositions in polynomial 
rings using Grobner bases. While they are relatively elementary, they are somewhat 
technical and so we limi t our discussion here to some preliminary results. 

For hypersurfaces V = Z(f) defined by a single polynomial / e A:[jcj, . . . , *„], 
determining X(V ) = rad (/) is straightforward. Since k[x \, . . . , x n \ is a U-FJD., / 
factors uniquely as the product of powers of nonassociate irreducibles: / = p^ 1 ■ ■ ■ pf 
and then rad(/) is generated by p\ • ■ ■ p s (the ‘squarefree part* of /). 
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Example 

Suppose W = Z(J) with J = (u 3 — uv 2 4- u 3 ) € Q[w, u]. The polynomial x 3 — x + 1 is 
irreducible over Q, so / = u 3 — uv 2 + u 3 is irreducible in Q[w, v]. Hence rad J — J and 
X(W) ---- J. 


For nonprincipal ideals 7, determining rad 7 is more complicated. The following 
proposition (based on Hilbert’s Nullstellensatz) gives a criterion determining when an 
element is contained in rad 7. 

Proposition 34. Suppose k is any field. If / = /,) is a proper ideal in 

*[*!, . . . , x„], then / € rad / if and only if (/i, 1 - y/) = *[*i, . - - , x n , y). 


Proof: By Corollary 33, (/i, yf) = k{x i, . . . ,x„, y] if and only if the 

equations 

1 - yf(x i , - . - , x n ) = 0, /, (x \ , . . . , x„) = 0, . . . , f s (x } , . . . , x n ) = 0 

have no common zero over the algebraic closure k of k. For a given (a iy . . . , a n ) e k " , 
the equation 1 — y/ (crj , . . . , a n ) = 0 has a solution y unless f(ai,...,a n ) =0. Hence, 
the system of equations has no common zero if and only if for every (o\, . . . t a n ) e k n 

with . . . , a„) = ■ ■ • = f,(a { a„) = 0 we also have /(a, a n ) = 0. 

Equivalently, if (ay,..., a,,) e 2^(1), then also f(ay,...,a„) = 0, i.e., we have 
/ e Xk(Z- k (I)) = rad 7, by Corollary 33. 

Since the reduced Grobner basis (with respect to any fixed monomial ordering) 
for an ideal is unique, we immediately obtain the following algorithmic method for 
determining when a polynomial lies in the radical of an ideal. 

Corollary 35. Suppose I = (/i, . . . , f s ) in k[x \ , . . . , *„]. Then / € rad / if and only 
if { 1 } is the reduced Grobner basis for the ideal (/i , . . . , f s , 1 — yf) in k [x \ , . . . , x „ , y] 
with respect to any monomial ordering. 

Example 

Consider 7 = (x 2 — y 2 , xy) in k[x, y]. The reduced Grbbner basis for (x 2 — y 2 , xy , 1 — tx ) 
in k[x, y, f] with respect to the order x > y > / is { 1 }, showing x e rad(7). To determine 
the smallest power of x lying in 7, we find that the ideal (x 2 — y 2 , xy, x 3 ) in Jt[x, y] has 
the same reduced Grobner basis as 7 (namely {x 2 — y 2 , xy, y 3 }), but (x 2 — y 2 , x 2 , xy) has 
basis {x 2 ,xy. y 2 }. It follows that x 3 e 7 and x 2 £ 7 (alternatively, x 3 leaves a nonzero 
remainder after general polynomial division by {x 2 — y 2 , xy, y 3 }, but x 3 has a remainder 
of 0). By a similar computation (or by symmetry), y € rad 7, with y 3 € 7 but y 2 ^ 7. 
Since (x, y) c rad 7, it follows that rad 7 = (x, y). 


Some additional results for computing radicals are presented in the exercises. 
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EXERCISES 


Let R be a subring of the commutative ring S with 1 e fi. 

1- Use the fact that a U.F.D. is integrally closed to prove that the Gaussian integers, Z[i], is 
the ring of integers in Q(i). 

2. Suppose k is a field and let t = x/y in the field of fractions of the integral domain 
R — fc[x, y]/( x 2 - y 3 ). Prove that K = k(t) is the fraction field of R and fc[f] is the 
integral closure of R in K. 

3. Suppose k is a field and i and j are relatively prime positive integers. Find the normalization 
of the integral domain R = &[x, y]/0c' — y J ) (cf. Exercise 14, Section 9.1). 

4. SupposefcisafieldandletP be the ideal (y 2 — x 3 — x 2 ) in the polynomial ring fc[x,y], Prove 
that P is a prime ideal and find the normalization of the integral domain R = k[x, y]/P. 
[To prove P is prime, show that y 2 — x 3 — x 2 is irreducible in the U.F.D. k[x, y]. Then 
consider t = y/x e /?.] 

5. If if is an integral domain with field of fractions F, show that F is a finitely generated 
R-module if and only if R — F. 

6. For each of the following give specific rings R c S and explicit ideals in these rings that 
exhibit the specified relation: 

(a) an ideal / of R such that / # SI ft R (so the contraction of the extension of an ideal 
/ need not equal /) 

(b) a prime ideal P of R such that there is no prime ideal Q of S with P = Q OR 

(c) a maximal ideal M of S such that M n R is not ma xim al in R 

(d) a prime ideal P of R whose extension PS to S is not a prime ideal in S 

(e) an ideal J of S such that J ^ (J n R)S (so the extension of the contraction of an ideal 
J need not equal J). 

7. Let Ok be the ring of integers in a number field K. 

(a) Suppose that every nonzero ideal J of Ok can be written as the product of powers 
of prime ideals. Prove that an ideal Q of Ok is P-primary if and only if Q = P m 
for some m > 1. [Show first that since nonzero primes in Ok are maximal that 
Pi™ 1 c pf 1 2 for distinct nonzero primes Pi, P 2 implies Pi = P 2 J 

(b) Suppose that an ideal Q of Ok is P -primary if and only if Q = P m for some m > 1 . 
Assu ming all of Theorem 21, prove that every nonzero ideal I of Ok can be written 
uniquely as the product of powers of prime ideals. [Prove that Pi"’ 1 and Pi™ 2 are 
comaximal ideals if P\ and Pi are distinct nonzero prime ideals and use the Chinese 
Remainder Theorem.] 

8. Prove that if si € S are integral over R, then the ring P[ri , . . . , r„] is a finitely 

generated P-module. 

9. Suppose that S is integral over R and that P is aprime ideal in R. Prove that every element s 

in the ideal PS generated by Pin S satisfies an equation \-a\s+OQ ~ 0 

where the coefficients no, ai, . . . , 1 are elements of P. [Ifr = piriH \-p m s m e PS, 

show that T = P[si, . . . , ] satisfies the hypotheses in Proposition 23(3). Follow the 

proof in Proposition 23 that s is integral, noting that s € PT so that the a ,- j are elements 
of P.] 

10. Prove the following generalization of Proposition 28: Suppose R is an integrally closed 
integral domain with field of fractions k and a is an element of an extension field K 
of k . Show that a is integral over R if and only if a is algebraic over k and the mi nima l 
polynomial m^^Cx) fora over k has coefficients in R. [If a is integral prove the conjugates 
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of a, i.e., the roots of m a j (x), are also integral, so the elementary symmetric functions of 
the conjugates are elements of k that are integral over /?.] 

11 . Suppose R is an integrally closed integral domain with field of fractions k and p(x) g 
/?[x] is a monic polynomial. Show that if p(x) — a(x)b(x) with monic polynomials 
a(x),b(x) G k[x\ then a(x),b(x) G P[x] (compare to Gauss’ Lemma, Proposition 5, 
Section 9.3). [See the previous exercise.] 

12 . Suppose S is an integral domain that is integral over a ring R as in the previous exercise. 
If P is a prime ideal in R, let s be any element in the ideal PS generated by P in S. Prove 
that, with the exception of the leading term, the coefficients of the minimal polynomial 
m s k(x) for s over k are elements of P. [By Exercise 10, m s ,*(jt) e P[x]. Exercise 9 
shows that s is a root of a monic polynomial p(x) = x n + a n -\x n ~ i + ■ ■ ■ + ao with 
ao, . . . , < 2 n _i G P. Use the previous exercise to show that p(x) ~ m s j^(x)b(x) with b(x) 
in /?[*], and consider this equation in the integral domain (R/P)[x]. ] 

The next two exercises extend Exercise 6 in Section 7.5 by characterizing fields that are not 
fields of fractions of any of their proper subrings. 

i 

13 . Let A" be a field of characteristic 0 and let A be a subring of K maximal with respect to 
1/2 ^ A. (Such A exists by Zorn’s Lemma.) Let F be the field of fractions of A in A'. 

(a) Show that K is algebraic over F. [If / is transcendental over F, show that 1/2 ^ A [/].] 

(b) Show that A is integrally closed in K . [Show that 1 /2 is not in the integral closure of 
Ain AT.] 

(c) Deduce from (a) and (b) that K = F. 

14 . Show that a field K is the field of fractions of some proper subring of K if and only if K 
is not a subfield of the algebraic closure of a finite field. [If K contains t transcendental 
over F p argue as in the preceding exercise with 1 // in place of 1 /2 to show that K is the 
quotient field of some proper subring.] 

The next exercise gives a “geometric” interpretation of Noether’s Normalization Lemma, show- 
ing that every affine algebraic set is a. finite covering of some affine n- space. 

15 . Let V be an affine algebraic set over an algebraically closed field k. Prove that for some 

n there is a suijective morphism from V onto A” with finite fibers, and that if V is a 
variety, then n can be taken to be the dimension of V. [By Noether’s Normalization 
Lemma the finitely generated k algebra S = /c[ V] contains a polynomial subalgebra R = 
fcfjti , X 2 , . . . , x„] such that S is integral over R. Apply Theorem 6 to the inclusion of R in 
S to obtain a morphism <p from V to A” . Tb see that <p is surjective with finite fibers, apply 
Corollary 27 to the maximal ideal (xi — a \ , . . . , x„ — a „ ) of R corresponding to a point 
(a\ a„) of A*.] 

16 . Let V be an affine algebraic set in C*. Prove that V is compact in the Euclidean topology 
(i.e., closed and bounded) if and only if it is finite. [Use Exercise 18 in Section 2, the 
previous exercise, and the behavior of compact sets with respect to continuous functions.] 

17 . Let A be a subring of the commutative ring S with Is e R and suppose that S is integral 
over R. This exercise proves that R and S have the same Krull dimension , cf. Section 16.1 . 

(a) If Pi C Pi C ■ ■ - C P n is a chain of distinct prime ideals in R prove that there is a 
chain Q i c Q 2 C ■ ■ ■ C Q„ of distinct prime ideals in S with Q, H R = P,. 

(b) Prove conversely that if Q\ C Qi C • ■ ■ C Q n is a chain of distinct prime ideals in S 
and P; = Qj C\ R then Pi C P2 C ■ ■ ■ C P n is a chain of distinct prime ideals in R. 
[To prove the P; are distinct, pass to a quotient and reduce the problem to showing that 
if Q is a nonzero prime ideal in the integral domain S then QDR is a nonzero prime 
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ideal in 7?. In this case, if s € Q is nonzero, show that the constant coefficient of a 
polynomial of minimal degree in /?[x] satisfied by s is a nonzero element in Q n /?.] 

18. Let V = Z(I ) and W = Z(J) where 7 is the ideal ( uv + u) c C[«, u] and J is the ideal 
(— 2y — y 2 + 2z + z 2 , 2x - yz — z 2 ) C C[x, y, z]. 

(a) Show that 7 and J are prime ideals. Conclude that 1 =Z(V) and J = Z(W) and that 
V and W are varieties. 

(b) Show that the map <p : V — ► W defined by <p{(a \ , 02 )) = (a 2 + « 2 , + 02 . &\ — ) 

is an isomorphism. 

19. Let 7 = (jc 3 + y 3 + z 3 , x 2 + y 2 + z 2 , (x + y + z) 3 ) C k[x, y, z]. Use Grobner bases to 
show that x, y, z e rad 7 if ch(7r) ^ 2, 3. 

20. Let 7 = (x 3 + y 3 + z 3 . xy + xz + yz, xyz) C fc[x, y, z\. Use Grdbner bases to show that 
x, y, z € rad 7. 

21. Let 7 = (x 4 + y 4 + z 4 , x + y + z) c k[x, y, zj. 

(a) Use Grobner bases to show that xy + xz + yz e rad 7 if ch(fc) # 2 and determine 
the smallest power of xy -f xz + yz contained in 7. Show that none of x, y or z is 
contained in rad 7. 

(b) If J = (x 4 + y 4 + z 4 , x + y + z , xy + xz + yz) show that the reduced GrSbner basis 
of J relative to the lexicographic ordering x > y > z is {x + y + z, y 2 + yz + z 2 }. 
Deduce that k[x, y, z]/-7 = k[y, zj/fy 2 + yz + z 2 ) and that J is radical if ch(fc) ^ 3. 

(c) If ch(l) ^ 2, 3, show that rad 7 = J. 

(d) If ch(fr) = 3, show that rad 7 = (x — y, y — z). 

(e) If ch(fc) = 2, show that 7 = (x + y + z)isa prime, hence radical, ideal. 

22. Let 7 — (x 2 y + z 3 , x + y 3 — z, 2y 4 z — yz 2 — z 3 ) C l[x, y, z]. Use Grobner bases to 
show that x, y, z e rad 7 and conclude that rad 7 = (x, y, z). Show that x 9 , y 7 , z 9 are the 
smallest powers of x, y, z, respectively, lying in 7. 

23. Let V = Z(x 3 — x 2 z — y 2 z) and W = Z(x 2 + y 2 — z 2 ) in C 3 . Show that Z(V) = 
(x 3 - x 2 z - y 2 z) and T(W) = (x 2 + y 2 - z 2 ) in C[x, y, z]. 

24. Let V = Z(x 3 4- y 3 + 7Z 3 ) C C 3 . Show that Z(V) = (x 3 + y 3 + 7z 3 ) in C[x, y, z]. 

25. Let 7 = (xz + y 2 4-z 2 , xy — xz + yz — 2z 2 ) and let K = 7 + (x 2 — 3y 2 + yz) C C[x, y, z]. 

(a) By Exercise 46 in Section 1, there is an injective C-algebra homomorphism from 
Qx.y,z]/A! toC[«, u]/(w 3 — uv 2 +v 3 ). Use this together with the example preceding 
Proposition 34 to prove that K is a radical ideal and deduce that rad I Q K. 

(b) Show that rad 7 c (y, z). 

(c) Showthai K fl (y, z) = 7 and deduce that 7 is radical, so that Z(V) = l if V = Z(l). 

(d) Show that y(x 2 — 3y 2 + yz) and z(x 2 — 3y 2 + yz) are elements of 7 but none of y, 
z, or x 2 — Zy 2 + yz is contained in 7. 

26. Let 7 be an ideal in Ar[xi, . . . ,x„]. Prove that the following are equivalent (an ideal 
satisfying any of these conditions is called a zero-dimensional ideal because of (d)): 

(a) The quotient fcfxi , . . . , x„]/7 has finite dimension as a vector space over k. 

(b) 7 D fr[x,] # 0 for each i = 1 , 2, .... n. 

(c) If G is any reduced Grobner basis for 7 then for each i = 1, . . . , n, there is a g; e G 
with leading term x”' for some «/ > 1 . 

(d) The set of common zeros Z k (I) of the polynomials in 7 in an algebraic closure k of 
k is finite. 

[For (a) implies (b) use the injection fr[x, ]/(7 n fc[x,-]) <-> £[x| x„]/I. For (b) implies 

(c) note some LT (g, ) divides the leading term of a generator for 7DJt[x/ ]. For (c) implies (a) 
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use Exercise 37 in Section 9.6. Show (b) implies (d). For (d) implies (b) show the product 
m ai,k( x i) - - • ma N ,k( x i) of the minimal polynomials of the I th coordinates a\, . . . , of 
the points in Z k (I) is a nonzero polynomial in 2(Z k (I)) and apply Corollary 33.] 

27. Let / be a zero-dimensional ideal in fc[xi x„] and let I' be the ideal generated by 7 

in fc[xi , . . . , x„] where k is the algebraic closure of k. Let 2(7) be the zero set of / in k" 
and let Z- k (Y) be the zero set of 7 (equivalently, of I r ) in k n . 

(a) Prove that | Z- k (X) | = dim^xi x„)/ rad I'. [Show that rad V is the product of 

the maximal ideals corresponding to the points in V- k and use the Chinese Remainder 
Theorem.] 

(b) Show (.2(7)1 < dim*fc[xi x„]/7. [One approach: use Exercise 43 in Section 1 

and observe that dim £&[xi x w ]/rad7' < dim^fri x n \/ 1'.\ 

28. Suppose 7 is a zero-dimensional ideal in fc[xi , . . . , x „ ] , and suppose 7flfc[x,] is generated 
by the nonzero polynomial hj (cf. Exercise 26). Let r { be the product of the irreducible 
factors of h{ (the ‘squarefree part’ of hi). 

(a) Prove that 7 + (ri , . . . , r n ) C rad 7. 

(b) (Radicals of zero-dimensional ideals for perfect fields) If k is a perfect field, prove 
that rad 7 = 7 + (ri, . . . , r n ). [Use induction on n. Write r\ = p\ . . . p t with distinct 
irreducibles p; in fc[xi]. If J = 7 + (rj, . . . , r„) show that J = Jy fl ■ fl J t where 
J t = J + (pi). Show for each i that reduction modulo p, induces an isomorphism 
/c[jci , .... x„\/Jj = K[ JC 2 , .... x„]/J[ where K is the extension field k[x]/(pi) and 
7/ ^ 7 T[jc 2 , . . . , x„] is the reduction of the ideal J, modulo (p;). Use Exercise 1 1 of 
Section 13.5 to show that the image of rj in J[ n K[xj] remains a nonzero squarefree 
polynomial for each j = 2, . . . , n since k is perfect Conclude by induction that J [ is 
a radical ideal. Deduce that J, is a radical ideal, and finally that J is a radical ideal.] 

(c) Find the radicals of (x 7 + jt + y 3 , x 4 + y 3 + y), (x 3 — xy 1 + x,x 2 y + y 3 ), and 
(x 4 + y 3 , x 3 - xy + y 2 ) in Q[x, y] and of (x 2 + y 2 z, x 2 y 2 + z 3 , y 2 + z 2 ) in Q[x, y , z]. 

(d) Let k = F p (t). Show that 7 = (x p + t,y p — t) is a zero-dimensional ideal in fr[x, y] 
such that both 7 n fe[x] and 7 n fc[y] contain nonzero squarefree polynomials, but that 
7 is not a radical ideal (so the result in (b) need not hold if k is not perfect). [Show 
that x + y € rad 7 but x + y ^ 7 .] 


15.4 LOCALIZATION 

The idea of “localization at a prime” in a ring is an extremely powerful and pervasive 
tool in algebra for isolating the behavior of the ideals in a ring. It is an algebraic 
analogue of the fa mili ar idea of localizing at a point when considering questions of, 
for example, the differentiability of a function / (x) on the real line. In fact one of the 
important applications (and also one of the original motivations for the development) of 
this technique is to translate such “local” properties in the geometry of affine algebraic 
spaces to corresponding properties of their coordinate rings. 

We first consider a very general construction of “rings of fractions.” Let D be a 
multiplicatively closed subset of R containing 1 (i.e., 1 e D and ab e D if a, b e D). 
The next result constructs a new ring D~ l R which is the “smallest” ring in which the 
elements of D become units. This generalizes ffie construction of rings of fractions in 
Section 7.5 by allowing D to contain zero or zero divisors, and so in this case R need 
not embed as a subring of D~ l R. 
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Theorem 36. Let R be a commutative ring with 1 and let D be a multiplicatively 
closed subset of R containing 1. Then there is a commutative ring D~ l R and a ring 
homomorphism tt : R — >■ D~ { R satisfying the following universal property: for any 
homomorphism \}r : R ► S of commutative rings that sends 1 to 1 such that \j/(d) is a 
unit in S for every d e D, there is a unique homomorphism 4> : D~ l R — > S such that 

^ 07 T = \}f. 

Proof: The proof is very similar to the proof of Theorem 15 in Section 7.5. In this 
case we define a relation on R x D by 

(r, d) ~ (s, e) if and only if x(er — ds) = 0 for some x € D. 

This relation is clearly reflexive and symmetric. If (r, d) ~ (s, e) and ( 5 , e) ~ (/, f) 
then x(er — ds) = 0 and y(fs — et) = 0 for some x,y e D. Multiplying the first 
equation by fy and the second by dx and adding gives ex y(fr — dt) = 0. Since D is 
closed under multiplication, (r, d) ~ (/, /) and so ~ is transitive. 

Let r/d denote the equivalence class of (r, d) under ~ and let D~ l R be the set of 
these equivalence classes. Define addition and multiplication in D~ l R by 

a c ad + be , a c ac 

— ( — = and — x - = — . 

b d bd b d bd 

It is an exercise to check that these operations are well defined and make D -1 R into a 
commutative ring with 1 — 1/1. For each d € D, d/1 is a unit in D~ l R (even in the 
degenerate case when D~ l R is the zero ring). 

Finally, define n : R — > D~ l R by n(r) = r j\. It follows easily that 7T is a ring 
homomorphism. Suppose that \}x : R — > S is a homomorphism of commutative rings 
that sends 1 to 1 such that \}x (d) is a unit in S for every d e D. Define 

4':£T 1 /?->S by 4* ^ = & (r)\J/(d)~ l . 

This map is well defined because if rjd = sje then x(er — ds) — 0 for some x € D. 
Then i}r(x)(\Jr(er) — y}/( ds )) = 0 in S, so \j/(er) — \J/(ds) = 0 since \J/(x) is a unit 
in S, and therefore \J/(r)\J/(d)~ l = \);{s)ilr{e)~ l . It is imm ediate that 4> is a ring 
homomorphism and ^ on — \}/. 

Finally, 4* is unique because every element of D~ l R can be written as a product 
(rf 1 ) (d/ 1 )“ 1 . The value of 4> on each element of the form jc/ 1 is uniquely determined 
by \f/, namely ^(;e/l) — 4^ (7 t(jc)) = \J/(x). Since 4> is aring homomorphism, its value 
on u~ l for any unit u is uniquely determined by 4 / («). Thus 4* is uniquely determined 
on every element of D~ l R, completing the proof. 

Corollary 37. In the notation of Theorem 36, 

(1) ker n = {r € R | xr = 0 for some x € D)\ in particular, jr : R D is an 
injection if and only if D contains no zero divisors of R, and 

(2) D~ l R = 0 if and only if 0 e D, hence if and only if D contains nilpotent 
elements. 

Proof: By definition, we have n(r) = 0 if and only if (r, 1) ~ (0, 1), i.e., if and 
only if xr = 0 for some x € D, which is (1). For (2), note that D~ l R — 0 if and only 
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if the 1 of this ring is zero, i.e., (1, 1) ~ (0, 1). This occurs if and only if x\ = 0 for 
some x € D, i.e., if and only if 0 € D. 

Definition. The ring D l R is called the ring of fractions of R with respect to D or the 
localization of Rat D. 

Examples 

(1) Let R be an integral domain and let D = R — {0}. Thai D~ l R is the field of fractions, 
Q, of R described in Section 7.5. More generally, if D is any multiplicatively closed 
subset of R — {0}, then D~ 1 R is the subring of Q consisting of elements r/d with 
r e R and d e D. 

(2) Let R be any commutative ring with 1 and let / be any element of R. Let D be the 
multiplicative set { f n \ n > 0} of nonnegative powers of / in R. Define R f = D~^R. 
Note that Rf = 0 if and only if / is nilpotent. If / is not nilpotent, then / becomes a 
unit in R/. It is not difficult to see that 

R f = R[x]/(xf - 1), 

where /?[*] is the polynomial ring in the variable x (cf. the exercises). Note also that 
R/ and Rf» are naturally isomorphic for any n > 1 since both / and f n are units 
in both rings. If / is a zero divisor then n : R Rf does not embed R into Rf. 
For example, let R = A[x, y]/(jcy), and take f = x. Then x is a unit in R x and y is 
mapped to 0 by the first part of the corollary (explicitly: y = xy/x = 0 in R x ). In this 
case n(R) = A[x] C Rf = A[x, x -1 ]. 

(3) ( Localizing at a Prime) Let P be a prime ideal in any ring R and let D = R — P. 
By definition of a prime ideal D is multiplicatively closed, Passing to the ring D~ l R 
in this case is called localizing R at P and the ring D~ l R is denoted by Rp. Every 
element of R not in P becomes a unit in Rp. For example, if R = Z and P = (p) is 
a prime ideal, then 

%{ P ) = eQ | p\b) cq 

b 

and every integer b not divisible by p is a unit. 

(4) If V is any nonempty set and A: is a field, let R be any ring of A -valued functions on V 
containing the constant functions (for instance, the ring of all continuous real valued 
functions cm the closed interval [0, 1]). For any a c V let M a be the ideal of functions 
in R that vanish at a. Then M a is the kernel of the ring homomorphism from R to 
the field A given by evaluating each function in R at a. Since R contains the constant 
functions, evaluation is surjective and so M a is a maximal (hence also prime) ideal. 
The localization of R at this prime ideal is then 

Rm b = { — I /, g e R, g(a) ± 0 

l g 

Each function in Rm b can then be evaluated at a by if/g)(a) = f(a)/g(a), and 
this value does not depend on the choice of representative for the class f/g, so Rm 0 
becomes a ring of A-valued “rational functions” defined at a. 

We next consider extensions and contractions of ideals with respect to the map 
7t : R -> D~ l R in Theorem 36. To ease some of the notation, if / is an ideal of R, let 
e I denote the extension of 7 to D~ l R (instead of the more cumbersome D~ l R jt(/)), 
and if 7 is an ideal of D l R, let C J denote the contraction of J to 7?. 
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If / is an ideal of R then it is easy to see that every element of 7 can be written 
in the form a/d for some a e 7 and d G D, so the extension of / to D~ ] R is also 
frequently denoted by D~ ] I. 

Proposition 38. In the preceding notation we have 

(1) For any ideal J of D~ l R we have J = e ( C J). In particular, every ideal of D~ ] R 
is the extension of some ideal of R, and distinct ideals of D~ l R have distinct 
contractions in R. 

(2) For any ideal / of R we have 

c ( e I) = [r G R | dr € 7 for some d G D). 

Also, 7 = D~ l R if and only if 7 H D / 0. 

(3) Extension and contraction give a bijective correspondence 

( prime ideals P of R 
with PHD = 0 

(4) If R is Noetherian (or Artinian) then D ^R is Noetherian (Artinian, respec- 
tively). 

Proof: We always have e ( c J) C J. For the reverse inclusion let a/d e J . Then 
a/\ = d(a/d) g J, and so a e n~ l (J) = C J . Thus a/1 G e ( c J)> so we also have 
(a/l)(l/J) — a/d G e ( c J), hence J = e ( c J). This proves the first statement in (1) 
and the second statement follows immediately. 

Let V = [r e R \ dr G 7 for some d G D). We first show /' C c (7). If r € 7' 
then there is some d G D such that dr = a G /. Then r/1 = a/d G 7, so r g c ( 7). 
To show the reverse containment c ( 7) c /', let r g c ( 7) so that r/1 — a/d for some 
a g / and d e D. Thai x(*/r — a) = 0 for some x G Z), so x Jr = xa e I, and because 
x d g £> it follows that r G 7'. This proves the first assertion of (2). Now 7 = D~ l R 
if and only if 1/1 G 7, if and only if 1 g c ( 7) = I'. The second assertion of (2) then 
follows from the definition of V . 

To prove (3) observe first that if Q is a prime ideal in D~ l R, then its preimage 
under any homomorphism sending 1 to 1 is a prime ideal (cf. Exercise 13, Section 7.4), 
so c maps prime ideals of D~ l R to prime ideals of R disjoint from D. In the reverse 
direction, let P be a prime ideal of R disjoint from D and let Q — e P and suppose 
(a/d\)(b/d 2 ) € G- Then (ab)/(did 2 ) e Q, so ab/id^df) = c/d for some c e P and 
d G D. Then x(dab — d&c) = 0 for some x g D. Since c € P we have xdab G P, 
and since P is a prime ideal disjoint from D we have ab g P. Since P is prime, either 
a g P or b £ P , hence a/d\ or b/dt is in Q. This proves Q is a prime ideal and shows 
that e maps prime ideals of R disjoint from D to prime ideals of D~ l R. Finally, it 
follows immediately from (2) that P — c ( e P) for every prime ideal of R disjoint from 
D. Thus c and e are inverse correspondences, hence are bijections between these sets 
of prime ideals. This establishes (3). 

By (1) every ascending (respectively, descending) chain of distinct ideals in D~ l R 
contracts to an ascending (respectively, descending) chain of distinct ideals in 7?, giving 
(4) and completing the proof. 


prime ideals of D l R 
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Because 1 eZ), first localizing the ideal 7 and then contracting that localization as 
in (2) results in an ideal in R containing 7: 7 C c ( e I). 

Definition. Suppose R is a commutative ring with 1 and D is a multiplicatively closed 
subset containing 1 . The saturation of the ideal 7 in F with respect to D is the ideal c ( e I) 
in R, where contraction and extension are computed with respect to n : R i-»- D -1 R. 
If I = c ( e I) then 7 is said to be saturated with respect to D. 

Loosely speaking, (2) of Proposition 38 shows that the saturation of 7 consists of 
elements of R that would lie in 7 if we allowed denominators from D. The ideal is 
saturated with respect to D if we don’t obtain any additional elements even if we allow 
denominators from D. 

We can apply our results on localization to give an algorithm for determining 
whether an ideal P in the polynomial ring k[x i, . . . , x„] with coefficients in the field 
k is prime. The basic idea is to use the fact that k[xi, . . . , jc,-] = k[xi, , Xi_i][x,] to 
consider inductively whether the ideals P, = P D k[xi x* j are prime. 

In general, suppose R is a commutative ring. If P is a prime ideal in R[x] then 
P D R is a prime ideal in R and so S = R/(P D R) is an integral domain. Let F denote 
its quotient field. We then have two natural ring bomomorphisms: 

R[x] — ► (R/P n R)[x] = S[xJ — ► F[x ] 

where the first is the natural projection homomorphism and the second is the natural 
inclusion induced by S Q F. Note that F[x] is the localization of S[x] with respect 
to the multiplicatively closed set D = S — (0}. The next proposition shows that the 
image of P under the first homomorphism is a prime ideal in S[xj that is saturated 
with respect to D and extends to a prime ideal in F[x], and that, conversely, we can 
determine whether an ideal is prime in R[x] by these properties. 

Proposition 39. Suppose R is a commutative ring with 1 and 7 is an ideal in F[xj. 
Then 7 is a prime ideal in 7?[xj if and only if 

i. J — I D R is a prime ideal in R, i.e., S = R/J is an integral domain, and 

ii. if I is the image of 7 in S[x] then IF[x] is a prime ideal in F[x] satisfying 
7F[x] H S[x ] - 7. 

Proof: Suppose 7 is a prime ideal in /?[*], so that J = I n R is a prime ideal in 
R and S = R/ J is an integral domain. By Proposition 2 in Chapter 9, the kernel of 
the reduction homomorphism 7?[xj m>- S[x] = (R/J)[x] is /[x], which is contained 
in 7[x], so we have a ring isomorphism R[x]/I = S[x]/7. Since R[x]/I is an integral 
domain, it follows that 7 is a prime ideal in the integral domain S'lxj. The elements of 
7 D S are the images of the elements in7?n7,so7DS = 0. Since the ring F[x] is the 
localization of S[x] with respect to the multiplicatively closed set S — {0}, condition 
(ii) follows by Proposition 38(3). 

Conversely, if 7 is not prime, then either J is not prime in 7? or J is prime in R 
but 7 is not prime in S[xj. In the latter case either 7 F[x] is not prime in F[x j or, again 
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by Proposition 38(3), / is not saturated. Thus, if / is not prime, either (i) or (ii) fails, 
completing the proof. 

Since F[x] is a Euclidean Domain, the ideal 7F[x] = (h(x)) in Proposition 39 is 
principal, and is prime if and only if h(x) is either 0 or is irreducible in F[x], Suppose 
h(x) is an element in / whose image in £[x] has leading coefficient a G S. The next 
proposition shows that a gives a bound on the denominators necessary for the saturation 
/F[x] D S[x] and can be used to compute this saturation. 

Proposition 40. Let S be an integral domain with fraction field F and let A be a 
nonzero ideal in £[x]. Suppose AF[x] = (h(x)) where h(x) is a polynomial in S[x] 
with leading coefficient a G S. Let S a be the localization of S with respect to the powers 
of a. Then 

(1) i4F[x] n £[x] = AS a [x\ fl £[x], and 

(2) if A denotes the ideal generated by A and 1 - at in the polynomial ring S[x, /], 

then fl S[x] = AC\ £[x]. 

Proof: We first show AF[x] fl S a [x] = AS a [x). Since S a Q F, the containment 
AS a [x] c AF[x] D £ a [x] is immediate. Suppose now that f(x) g AF[x] fl S a [x]. 
If the leading term of / (x) is sx N and the leading term of h(x) is ax m , then since 
AF\x] = (h(x)) we have N > m. Then the polynomial f(x ) — ( s/a)x N ~ m h(x ) is 
again in AF[x] fl and is of lower degree than fix). Iterating, we see that f(x) 
can be written as a polynomial in S a [x] times h(x), so f(x) € AS a [x], Intersecting 
both sides of AF[x] n 5^[x] = AS a [x] with £[x] gives the first statement in the 
proposition. 

To prove the second statement, suppose first that f(x) e A H S[x]. Then we 
can write /(x) = f\(x , t)b(x) + fi(x, /)( 1 — at) for some polynomials b(x ) g A 

fi t fi e S[x,/]. Substituting / = \/a gives /(x) = /i(x, 1 /a)b(x), and since 
/i(x, 1 /a) G S fl [x], we obtain /(x) G AS a [x] Pi S[x]. Conversely, suppose that 
/(x) = b(x)g(x) g S[x] where g(x) G S^(x) and b(x) e A. If a N is the largest power 
of a appearing in the denominators of the coefficients of g(x) then a N g(x ) G 5[x], 
Writing f(x) = ( at) N f(x ) + (1 - ( at) N )f(x ) = b(x)f N (a N g(x)) + (1 - ( at) N )f(x ) 
we see that f(x) g A n S[x], giving the reverse containment and completing the proof. 

Suppose now that P is an ideal in £[xj, x„]. Let Pi for i = 1, . . . , n be 

the intersection of P with k[x \, . . . , x*]. We use Propositions 39 and 40 to determine 
inductively whether P), P^, . . . , P„ = P are prime ideals in their respective polynomial 
rings. 

The ideal Pi will be prime in the Euclidean Domain k[x\\ if and only if it is 0 
or is generated by an irreducible polynomial. Suppose now that i > 2 and we have 
already proved that P/_i is a prime ideal in fc[xi , . . . , x,-\\, so that the quotient ring 
S = k[x i, . . . , x,_i]/P/_i is an integral domain. If F denotes the quotient field of S, 
then by Proposition 39, Pi is a prime ideal in k[x\ , x, ] if and only if its image in 
(fc[xi, . . . , x,- _ i ] / P; _ i ) [x/ ] = S[x, ] is a saturated ideal whose extension to the Euclidean 
Domain F[x,-] is a prime ideal. Suppose h(xj) e £[x,-] is a generator for this ideal and 
a is the leading coefficient of h (x, ) . Then ( h (x, )) is a prime ideal in F[x, ] if and only if 


Sec. 15.4 Localization 


711 



h(xj) = 0 or h(xi) is an irreducible polynomial. By Proposition 40, the image of P, in 
ST*, ] will be saturated if and only if it equals A O £[*,-] where A is the ideal generated 
by Pi and 1 — at in 5 [jc; , f ]. This latter condition can be checked in A[xi, . . . , Xj, /]: it 
is equivalent to checking that the intersection of the ideal generated by Pi and 1 — at 
in k[x i, . . . , Xj, t] with k[xi, .... *,-] is just P, (cf. Exercise 3). 

Combining these observations with our results on Grobner bases from Chapter 9 
we obtain the following algorithm for determining whether the ideal P in k[x \ , . . . , x„] 
is prime (or, equivalently, whether the associated affine algebraic set is a variety). 

Algorithm for Determining when an Ideal in k[x it is Prime 

(1) Compute the reduced Grobner basis G = (g|, . . . , g m ) for P with respect to the 
lexicographic monomial ordering jc w > - - ■ > 

By Proposition 29 in Section 9.6 the elements of G lying in k[xi, . . . , jc f -] will be the 
reduced Grobner basis (gi, . . . , g mj } for P/ = Pfl k[x i, . . . , x;]. 

(2) Determine whether Pi is a prime ideal in k[x i] by checking that Pi = 0 or the 
nonzero generator of P\ is irreducible in 

For each i > 2, suppose P, _ i has been determined to be a prime ideal in k[x \ , . . . , x, _ i] 
(otherwise, P is not a prime ideal in k[x i, . . . , x„])- Let S = k[xi, . . . , Xj_i]/Pj_i and 
let F be the fraction field of S. Apply steps (3) and (4) to determine whether Pj is a 
prime ideal in k[xi, . . . , x f ]. 

(3) If rrij = mi -i then P/ maps to the zero ideal in 5[x ; ], hence is prime. Otherwise the 
image of P/ in £[*;] and in P[xj] is a nonzero ideal, and is generated by the images 
of g m ,_[+i, . . . , g m - Apply the Euclidean algorithm in F[xj] to these generators 
to find an element h(x t ) in Pj whose image in F[Xj] generates the image of P; in 
F[xi], Determine whether h(xj) is irreducible in P[x,-] — if not then Pj and P are 
not prime ideals. 

(Note that after applying the Euclidean algorithm to the generators of the image of 
P/ in P[xj] we can multiply by a single element of S to ‘clear denominators’ in each 
equation so that all remainders (and in particular the last nonzero remainder h(Xi)) will 
be elements in the image of Pj .) 

(4) Let a € k[x \ , . . . , Xj-i] be the leading coefficient of h(xi) (as a polynomial in *;). 
Compute the reduced Grobner basis in k\x i , . . . , x-, , /] for the ideal generated by Pj 
and 1 —at with respect to the lexicographic monomial ordering / > x, > • ■ ■ > X] . 
Determine whether the elements of this reduced basis that lie in k[x i, . . . , x/] are 
{gii • - ■ , gm, ) — if so, then Pj is a prime ideal in k[xi, . . . , x;] and if not then Pj and 
P are not prime ideals. 

Finally, we note that similar ideas (together with some minor modifications to 
extend results on Grobner bases to polynomial rings P[xi , . . . , x„\ with coefficients in 
an integral domain R) can be used to provide algorithms for determining when an ideal 
in, for example, Z[x iy . . . , x„] is prime. 
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Examples 

(1) Consider the ideal P = (xz — y 2 , yz — x 3 , z 2 — x 2 y) in k[x, y, z] for any infinite field 
k. It follows from Exercise 26 in Section 1 that P is a prime ideal since there is an 
injection of k[x, y, z\/P into the integral domain fcfA 1 ] (cf. Exercise 24 in Section 2). 
Here we prove P c <Q>[x, y, z] is prime using the ideas in this section. The reduced 
Grobner basis for P with respect to the lexicographic monomial ordering x > y > z 
is {x 3 — yz , x 2 y — z 2 ,xy 3 — z 3 , xz — y 2 , y 5 ~ z 4 }. Hence Pj = P fi Q[z] = (0), and 
Pz O Q[y, z] = (y 5 — z 4 ). Since P\ = 0, the ideal P\ is prime in Q[z]. 

We next check P 2 is prime in <Q>[y , z], which can be done directly (cf. Exercise 4 
or Exercise 14 in Section 9.1). In this case S — Q[z] and F = Q(z). The image of P 2 
in F[y] is generated by h(y) = y 5 — z 4 , which is irreducible in <Q>(z)[y]. The leading 
coefficient of h(y) is 1, and the reduced Grobner basis for (y 5 — z 4 , 1 — /) in Q[y, z, /] 
with respect to the lexicographic monomial ordering t > y > z is {y 5 — z 4 , 1 — /}. 
The element in the reduced Grobner basis for P 2 is the only element of this basis lying 
in Q[y, z] so Pi is a prime ideal in Q[y, z]. 

We now me the fact that P 2 is prime to prove that P is prime. In this case S is 
the integral domain Q[y, z]/Pz = Q[y, z]/(y 5 — z 4 ) with quotient field F given by 

S = Q[z] + Q[z]y + Q[z]y 2 + Q[z]y 3 + Q[z]y 4 

F = Q(z) + <Q>(z)y + Q(z)y 2 + Q(z)y 3 + Q(z)y 4 

where y 5 = z 4 . The image of P in S[x] is the ideal P generated by the elements 
gi = X 3 - yz, g2 = yx 2 - z 2 , g3 = y 3 x -z 3 >g4 = zx- y 2 , and y 5 - z 4 = 0. 

The greatest common divisor in F[x] of g\ , g 2 , g 4 generating the image of P 
in F[x] is the irreducible polynomial x — y 2 /z. The polynomial h(x) = zx — y 2 in 
P has image generating the same ideal in F[x], so we may take a = z in (4) of the 
algorithm. The reduced Grobner basis for (xz - y 2 , yz — x 3 , z 2 — x 2 y, 1 — zt ) with 
respect to the lexicographic monomial ordering / > x > y > z consists of the reduced 
Grobner basis for P together with the elements /y 2 - x and tz — 1 involving /, so P 
is a prime ideal in Q[x, y, z]. 

(2) Consider the ideal P = (xz— y 3 , xy— z 2 ) inQ[x, y, z], with reduced Grobner basis for 
the lexicographic monomial ordering x > y > z given by {xy — z 2 , xz — y 3 , y 4 — z 3 }. 
Here P\ = 0 and P 2 = P D Q[y, z] = (y 4 — z 3 ) are prime ideals as in Example 1. In 
this case 5 = <Q>[y. z]/P 2 is given by 

5 = Q[z] + Q[z]y + Q[z]y 2 + Q[z]y 3 

with y 4 = z 3 , with quotient field F similar to the previous example, and P = (gi , gi) 
in S[x] where g\ = yx — z 2 and g 2 = zx — y 3 . The extension of P to F[x] is generated 
by the irreducible polynomial yx — z 2 , and h(x) = yx — z 2 is an element of P having 
the same image in F[x], with leading coefficient a = y. The reduced Grbbner basis 
for the ideal (xz — y 3 , xy — z 2 , 1 — y/) in Q[x, y, z, t] using the lexicographic ordering 
t > x > y > z is {x 2 — y^z^xy — z 2 ,xz — y 3 , y 4 — z 3 , ty — 1, tz 2 — x), containing 
the element x 2 — y 2 z not in the reduced Grbbner basis for P, so P is not a prime ideal 
in Q[x, y, z]. This computation not only shows P is not a prime ideal, it does so by 
explicitly showing the image of P in S[x) is not saturated using the localization S a . 
The computation of a = y allows us to find an explicit pair of elements not in P whose 
product is in F: / = x 2 — y 2 z £ P and y ^ F. but some power of y times / lies in 
P. In this case a quick computation verifies that yf e P. 
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Localizations of Modules 


Suppose now that M is an 7? -module and D is a multiplicatively closed subset of R 
containing 1 as above. Then the ideas used in the construction of D~ l R can be used to 
construct a D~ ^-module D~ l M from M in a similar fashion, as follows. Define the 
relation on D x M by 


(d,m)~(e,n) if and only if x(dn — em) = 0 for some x £ D, 

which is easily checked to be an equivalence relation. Let m/d denote the equiva- 
lence class of (d, m ) and let D l M denote the set of equivalence classes. It is then 
straightforward to verify that the operations 


m n em + dn 

h - = 

d e de 

i-i 


^ G) (") = Te 


are well defined and give D l M the structure of a D ^-module. 


Definition. The D ^-module D l M is called the module of fractions of M with 
respect to D or the localization of M at D. 


Note that the localization D~ l M is also an K-module (since each r e R acts by r/1 
on D _1 M), and there is an 7?-module homomorphism 

-i m 

tc \ M — y D M defined by n(m) = 

It follows directly from the definition of the equivalence relation that 
kerTi = {m e M \ dm = 0 for some d € D). 


The homomorphism n has a universal property analogous to that in Theorem 36. Sup- 
pose N is an R -module with the property that left multiplication on N by d is a bijection 
of N for every d € D. If ij/ : M ^ N is any R -module homomorphism then there is a 
unique R -module homomorphism ^ : D~ l M -y N such that o n = yf. 

If M and N are R -modules and <p : M -y N is an 7?-module homomorphism, then 
for any multiplicative set D in R it is easy to check that there is an induced D ~ ^-module 
homomorphism from D~ ] M to D~ l N defined by mapping m/d to <p(m)/d . 

The next result shows that the localization of M at D is related to the tensor product. 


Proposition 41. Let D be a multiplicatively closed subset of R containing 1 and let M 
be an R -module. Then D~ l M = D~ l R <S>r M as D^R -modules, i.e., D~ l M is the 
D _1 7? -module obtained by extension of scalars from the R -module M. 

Proof: The map from D~ l R x M to D~ l M defined by mapping (r/d, m ) to rm/d is 
well defined and R -balanced, so induces a homomorphism from D~ 1 R<B)rM to D~ [ M. 
The map sending m/d to (I/d) ® m gives a well defined inverse homomorphism (if 
m/d — m' /d! in D~ l M then x(d'm — dm') — 0 for some x € D, and then (I/d) ® m 
can be written as (I/xd'd) ® (xd'm) = (1 /xd'd) <g> (xdm 1 ) = (I/d') <gi m'). Hence 
D~ l M is isomorphic to D l R Masan R -module since these inverse isomorphisms 
are also D~ l R -module homomorphisms. 

Localizing a ring R or an R -module M at D behaves very well with respect to 
algebraic operations on rings and modules, as the following proposition shows: 
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Proposition 42. Let R be a commutative ring with 1 and let D~ [ R be its localization 
with respect to the multiplicatively closed subset D of R containing 1 . 

(1) Localization commutes with finite sums and intersections of ideals: If I and J 
are ideals of R , then 

£>-'(/ + J) = D~\l) + D~\J) and D~\l C\ J) = D~\l) D D~ l (J). 
Localization commutes with quotients: 

D~ X R / D~ { I ~ D~\R/I)< 

(where the localization on the right is with respect to the image of D in the 
quotient R/l). 

(2) Localization commutes with taking radicals: If N is the nilradical of R, then 
D~ l N is the nilradical of D~ X R. If / is an ideal in R , then rad(Z) -1 /) is 
D-Yrad/). 

(3) Primary ideals correspond to primary ideals in the correspondence (3) of Propo- 
sition 38. More precisely, suppose Q is a P -primary ideal inf?. If Z) n P ^ 0 
then D~ l Q = D~ l R. If D H P = 0 then D~ l P is a prime ideal, the extension 
D~ X Q of Q is a D~ X P -primary ideal in D~ X R, and the contraction back to R of 
D~ l Q is Q. 

(4) Localization commutes with finite sums, intersections and quotients of modules: 
If L and N are submodules of the R -module M, then 

(a) D~\L + N) = D~ l L + D^N and D~\L n AO = D~ X L n D~ l N t 

(b) D~ X N is a submodule of D~ l M and D~ l M / D~ X N = 

(5) Localization commutes with finite direct sums of modules: If M and N are 
R -modules, then D'\M 0 AO = D~ X M 0 D~'N. 

(6) Localization is exact (i.e., D~ X R is a flat R -module): If 0 ->■ L M 

N ->• 0 is a short exact sequence of R -modules, then the induced sequence 

0 ->■ D~ X L D~ X M ^ D~ X N ->■ 0 of Z) -l /?-modules is also exact 

\f/ (p 

Proof: We first prove (6). Suppose that 0— L — > M — > N — 0 is a short exact 
sequence of R -modules. Every element of D~ L N is of the form n jd for some n € A r and 
d e D. Since <p is surjective, n = ^(m)forsomem G M,so<p'(m/d) = (p(m)/d ~ n/d 
and (p* : D~ l M -> D~ X N is surjective. If m/d is in the kernel of <p’ then dy<p(m) = 0for 
some dy e D. Then <p(d\tri) = 0 imphes dim = \J/(l) for some l € L by the exactness 
of the original sequence at M y so m/d = d\m/(d\d) — \j/(l)/(dyd) = y}/' (l f (d\d)) 
and ker(y>0 C image(^'). If \J/(l)/d g imaged) then <p’(\j/(l)/d) = (p(\J/(l))/d — 0, 
which shows the reverse inclusion imaged) C ker (<p') t and we have exactness of the 
induced sequence at D~ l M. Finally, suppose \f/\l/d) = 0. Then d2V'(0 = 0 for some 
di € £>, i.e.,^^/) = 0, sod 2 f = 0 by the injectivity of y}/. Hencef/d = d^l/idrid) = 0 

and x}/' is injective. This proves that the sequence 0 — > D~ l L D~ l M D~ ] N — > 0 

is exact. 

To prove the first statement in (1), note that (i + j)/d = i/d + j /d for i € I, j € J 
and d g D shows D~\I+J) c D~\l)+D~ l (J)\ andi/d^j /^2 = (dii-\-d\j)/(d\d 2 ) 
for / G /, j G J and d\,d 2 G D shows D~\l) + D~\j) c D~\l + J). For the 
second statement, the inclusion D~\l D J) c D~\l) n D~\J) is immediate. If 
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aid e D~\l) fl D~\J) then d\a G 7 and d^a G J for some d\,d^ G D. Then 
d\dza G / n / and ajd = (d\di.a) f (d\did) gives the inclusion D~\l) fl D~\J) C 
D~\l fl J). The last statement in (1) follows by applying (6) to the exact sequence 

To prove (2), suppose first that a G rad 7, so that a n g / for some n > 1. Then 
( a/d) n = a n /d n G D~ l I so D ~ l ( rad 7) c rad(D _I 7). Conversely, if a/d g ra d(D~ l I) 
then (a/d)" g D~ l I for some n > 1, i.e., dya" G 7 for some d\ G D. Hence 
(d\a) n = g /, so g rad / and then a/d — d\a/(d\d) G D~\radl) 

shows that rad(D~ l T) c D -I (rad /). This proves the second statement in (2), and the 
first statement follows by applying this to the ideal 1 = (0). 

For (3), note first that D fl P = 0 if and only if D n Q = 0 (one inclusion is 
obvious and the other follows since d G D fl P implies d" G D n Q for some n). The 
statement for D n P ^ 0 and the fact that D~ l P is a prime ideal for D fl P = 0 were 
proved in Proposition 38. To see that D~ l Q is a primary ideal in D~ l R, suppose that 
(a/d\)(b/d 2 ) G D~ l Q and a/d\ $ D~ l Q, Then there is some element d g D so that 
dab g Q, and since a £ Q and Q is primary, we have (db)" g Q for some n > 1. 
Then (b/dif = d" bP / (d n d%) g D~ l Q , so that is primary. The radical of Z) _1 2 
is D~ l P by (2). Finally, by (2) of Proposition 38, the contraction of D~ l Q is an ideal 
of R containing Q and consists precisely of the elements r G R with dr g Q for some 
d G D. Since Q is P-primary, the definition of primary implies that if dr g Q and 
d $.P y then r g Q, hence the contraction of D~ X Q is Q> 

The proof of (4) is essentially the same as the proof of(l)andis left as an exercise. 

It is easy to see that if the exact sequence 0-*L-4Af-4//->0of 7? -modules 

. 1 ./ .j 

splits, then the exact sequence 0 — ► D~ X L -4 D~ l M -4 D~ l N -> 0 of D~ l R -modules 
also splits, which gives (5). 

Proposition 38 shows that localizing at the multiplicatively closed set D emphasizes 
the ideals of 7? not containing any elements of D since the other ideals of R become 
trivial when extended to D~ X R. The following proposition provides a more precise 
statement in terms of the effect of localization on primary decomposition of ideals. 

Proposition 43. Let R be a Noetherian ring and let 

7 = n ■ . - n Q m 

be a minimal primary decomposition of the proper ideal 7, where Q-, is a Pi -primary 
ideal. Suppose D is a multiplicatively closed set of R containing 1 and the primary 
ideals Qi , . . . , Q m are numbered so that D n P f = 0 for 1 < i < t and D n Pi ^0 for 
t + 1 < i < m. Then 

zr 1 / = d~ x Qi n ■ - ■ n D~ l Q t 

is a minimal primary decomposition of D~ l I in D~ X R and D~ x Qi is a D ~ ^-primary 
ideal. Further, the contraction of D~ l Qj back to 7? is Q; for 1 < i < / and 

%D- 1 I) = Gi n ■ ■ ■ n Q, 

is a minimal primary decomposition of the contraction of D -1 / back to R. 
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Proof: By (3) of Proposition 42, D~ } Q; — D~ l R for / + 1 < i <m, and D~ l Qj is 
a D~ l Pj -primary ideal with pullback Q\ for 1 < i < t. By (1) of the same proposition, 
D~ l l = D -1 (2i n - ■ • n D~*Q t , and (3) shows that this is a primary decomposition. 
Contracting to R shows that %D~ l I ) = Qi ■ C\ Q t , which also implies that the 
decompositions are minim al. 

In particular we can finish the proof of Theorem 21 : 


Corollary 44. The primary ideals belonging to the isolated primes in a minimal primary 
decomposition of 7 are uniquely defined by 7. 


Proof: Let P be a minim al element in the set (P], P m } of primes belonging 

to 7, and take D = 7? — P in Proposition 43. Then D H P t = 0 only for P = Pi, so 
the contraction of the localization of 7 at D is precisely the primary ideal Q belonging 
to the minimal prime P. Since the prime ideals {Pi, . . . , P m ) of primes belonging to 
7 are uniquely determined by 7, it follows that the primary ideals Q belonging to the 
isolated primes of 7 are also uniquely determined by 7. 

The effect of isolating in on certain prime ideals by localization is particularly 
precise in the case of localizing at a prime P (considered in Example 3 following 
Corollary 37 above). We first recall the definition of an important type of ring (cf. 
Exercises 37-39 in Section 7.4). 

Definition. A commutative ring with 1 that has a unique maximal ideal is called a 
local ring. 


Proposition 45. Let R be a commutative ring with 1 . Then the following are equivalent: 

(1) R is a local ring with unique maximal ideal M 

(2) if M is the set of elements of R that are not units, then M is an ideal 

(3) there is a maximal ideal M of R such that every element 1 + m with m e M is 
a unit in R. 


Proof: If a e R then the ideal ( a ) is either P, in which case a is a unit, or is a proper 
ideal, in which case (a) is contained in a maximal ideal (Proposition 1 1 of Section 7.4). 
It follows that if R is a local ring and M is its unique maximal ideal then every a £ M 
is a unit, so M consists precisely of the set of nonunils in P, showing that (1) implies 
(2). It also follows that if the set M of nonunits in P is an ideal then this ideal must be 
the unique maximal ideal in P, so that (2) implies (1). 

Suppose now that (3) is satisfied If a is an element of P not contained in the 
maximal ideal M, then (a) + M = P, so that ab + m = 1 for some b e P and m e M. 
Then ab = 1 — m is a unit by assumption, so a is also a unit. This shows that M is 
the unique maximal ideal in P, so (3) implies (1). Conversely, if P is a local ring, then 
1 + m £ M for any m € M, so 1 + m is a unit, so (1) implies (3). 
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Proposition 46. For any commutative ring ft with 1, let ftp be the localization of ft at 
the prime ideal ft and let e P be the extension of ft to ftp. 

(1) The ring ftp is a local ring with unique maximal ideal e P. The contraction of 
e P to ft is ft, i.e., L ( e P) — ft, and the map from ft to ftp induces an injection 
of the integral domain ft/ft into R P / e P. The quotient Rp/ e P is a field and is 
isomorphic to the fraction field of the integral domain ft /ft. 

(2) If ft is an integral domain, then ft P is an integral domain. The ring ft injects into 
the local ring ftp, and, identifying ft with its image in ftp, the unique maximal 
ideal of ftp is ftftp. 

(3) The prime ideals in ftp are in bijective correspondence with the prime ideals of 
ft contained in ft. 

(4) If ft is a minim al nonzero prime ideal of ft then ft P has a unique nonzero prime 
ideal. 

(5) If ft = M is a maximal ideal and / is any M primary ideal of ft then 
R M / e I = ft//- In particular, R M / e M = R/M and (fM)/( e M) n = M/M n 
for all n > 1. 

Proof: If ft 7 is a prime ideal of ft, then ft' Pi (ft — ft) = 0 if and only if P' c ft, 
so (3) is immediate from (3) in Proposition 38, and (4) follows. Since e P ^ R P by (2) 
of Proposition 38, it follows from (3) that ftp is a local ring with unique maximal ideal 
e P t which proves the first statement in (1). 

By Proposition 38(2) the contraction c ( <? ft) is the set {r € ft | dr € ft for some d e 
ft — ft}, and since ft is prime, dr G ft with d £ ft implies r € ft. This shows that 
c ( e P) = ft, which is the second statement in (1). 

The kernel of the map from ft to R P / e P is c ( e P) = ft, so the induced map from 
ft/ft into R p / e P is injective. The quotient R p / e P is a field by the first part of (1), so 
there is an induced homomorphism from the fraction field of the integral domain ft/ft 
into R P / e P. The universal property of the localization ftp shows there is an inverse 
homomorphism from ftp/ e P to the fraction field of ft/ ft (since every element of ft not 
in ft maps to a unit in ft/ft). It follows that R P / e P is isomorphic to the fraction field 
of ft/ft. 

If ft is an integral domain, then ft — ft has no zero divisors, so ft injects into ftp by 
Corollary 37; if ft is identified with its image in ftp then e P — ftftp, so (2) follows. 

To prove (5), by Preposition 42(1) we may pass to the quotient ft// and so reduce 
to the case / — 0. In this case the maximal ideal ft — M in ft is the nilradical of ft, 
hence is the unique maximal ideal of ft. By Proposition 45 every element of ft — M is 
a unit, so ftp — ft, and each of the statements in (5) follows immediately, completing 
the proof of the proposition. 

Example 

The results of (5) of the proposition are not true in general if ft is a prime ideal that is not 
maximal. For example, ft = (0) in ft = Z has ft/ft = Z and ftp/ftftp = Q; in this case 
(ftftp)/(ftftp) n = PjP n = 0 for all n > 1 (cf. the exercises). 

Definition. Let M be an ft-module, let ft be a prime ideal of ft and set D = ft — ft. 
The ftp -module D~ V M is called the localization of M at ft, and is denoted by M P . 
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By Proposition 41, Mp can also be identified with the tensor product Rp <g>/j M. 
When P is an integral domain and P = (0), then M ( o) is a module over the field of 
fractions F of R, i.e., is a vector space over F. 

The element m/1 is zero in Mp if and only if rm = 0 for some r e R — P, so 
localizing at P annihil ates the P'- torsion elements of M for primes P' not contained 
in P. In particular, localizing at ( 0 ) over an integral domain annihilates the torsion 
subgroup of M. 

Definition. If R is an integral domain, then the rank of the P-module M is the di- 
mension of the localization M( o) as a vector space over the field of fractions of R. 

It is easy to see that this definition of rank agrees with the notion of rank introduced 
in Chapter 1 2. 

Example 

Let R = 2 and let 2^,) be the localization of 2 at the nonzero prime ideal (p ) . Any abelian 
group M is a 2-module so we may localize M at (p) by forming M( p ) . This abelian group 
is the same as the quotient of M with respect to the subgroup of elements whose order is 
finite and not divisible by p. If M is a finite (or, more generally, torsion) abelian group, 
then M( P ) is a p- group, and is the Sylow p-subgroup or p-primary component of M. The 
localization Afio) of M at (0) is the trivial group. For a specific example, let M = TLItflL 
be the cyclic group of order 6, considered as a 2-module. Then the localization of M at 
p = 2 is 2/22, at p = 3 is 2/32, and reduces to 0 at all other prime ideals of 2. 

Localization of a module M at a prime P in general produces a simpler module 
Mp whose properties are easier to determine. It is then of interest to translate these 
“local” properties of Mp back into “global” information about the module M itself. For 
example, the most basic question of whether a module M is 0 can be answered locally: 

Proposition 47. Let M be an P-module. Then the following are equivalent: 

(1) M = 0, 

(2) Mp = 0 for all prime ideals P of P, and 

(3) M m = 0 for all maximal ideals m of P. 

Proof: The implications (1) implies (2) implies (3) are obvious, so it remains to 
prove that (3) implies (1). Suppose m is a nonzero element in M, and consider the 
annihilator / of m in P, i.e., the ideal of elements r e P with rm — 0. Since m is 
nonzero / is a proper ideal in P. Let m be a maximal ideal of P containing I and 
consider the element m/1 in the corresponding localization M m of M. If this element 
were 0, then rm = 0 for some r € R — nt. But then r would be an element in I not 
contained in m, a contradiction. Hence M m ^ 0, which proves that (3) implies (1). 

It is not in general true that a property shared by all of the localizations of a module 
M is also shared by M. For example, all of the localizations of a ring P can be 
integral domains without P itself being an integral domain (for example, Z/6Z above). 
Nevertheless, a great deal of information can be ascertained from studying the various 
possible localizations, and this is what makes this technique so useful. If P is an integral 
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domain, for example, then each of the localizations Rp can be considered as a subring 
of the fraction field F of R that contains R ; the next proposition shows that the elements 
of R are the only elements of F contained in every localization. 

Proposition 48. Let R be an integral domain. Then R is the intersection of the local- 
izations of R: R = DpRp. In fact, R = n m /? m is the intersection of the localizations 
of R at the ma ximal ideals m of R. 

Proof: As mentioned, R c D m /? m . Suppose now that a is an element of the fraction 
field F of R that is contained in R m for every maximal ideal m of R, and consider 

J a = [d € R | da € R). 

It is easy to check that / is an ideal of R, and that a G R if and only if 1 G I a , i.e., 
l a — R. Suppose that I a ^ R- Then there is a maximal ideal m containing I a , and 
since a G R m we have a — rjd for some r g R and d g R — m. But then d g I a and 
d f m, a contradiction. Hence a e R, so D m /? m c R t and we have proved the second 
assertion in the proposition. The first is then immediate. 

Another important property of a ring R that can be detected locally is normality: 

Proposition 49. Let R be an integral domain. Then the following are equivalent: 

(1) R is normal, i.e., R is integrally closed (in its field of fractions) 

(2) Rp is normal for all prime ideals P of R 

(3) R m is no rmal for all maximal ideals m of R. 

Proof: Let F be the field of fractions of R, so all of the various localizations of R 
may be considered as subrings of F. 

Assume first that R is integrally closed and suppose y g F is integral over Rp. 
Then y is a root of a monic polynomial of degree n with coefficients of the form a { jd t for 
some dj f P. The element y' = yid^d^ • ■ • i) 71 is then a root of a monic polynomial 

of degree n with coefficients from R, i.e., y' is integral over R. Since R is assumed 
normal, this implies y' g R, and so y = y' /{d# ■ ■ - d n - 1 ) g Rp, which proves that (1) 
implies (2). The implication (2) implies (3) is trivial. Suppose now that R m is normal 
for all maximal ideals m of R and let y be an element of F that is integral over R. Since 
R ^ R m * y is in particular also integral over R m and so y G R m for every maximal 
ideal by assumption. Then y g R by the previous proposition, which proves that (3) 
implies (1). 

We now may easily prove the first part of the Going-up Theorem (cf. Section 3) 
that was used in the proof of Corollary 27. 

Corollary SO. Let R be a subring of the commutative ring S with 1 G R, and assume 
that S is integral over R. If P is a prime ideal in R, then there is a prime ideal Q of S 
with P = Q D R. 
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Proof: Let D = R — P so that D is a multiplicatively closed subset of both R and 
S . Then the following diagram commutes: 

R — — ► D~ l R = R P 

'1 1 ' 

S — ► D~ l S 

where the vertical maps are inclusions. It is easy to see that D~ l S is integral over Rp 
(Exercise 20). Let m be any ma ximal ideal of D~ l S. Then m DR pis a maximal ideal in 
Rp by the second statement in Theorem 26(2) (note that the first part of Theorem 26(2) 
was not used in the proof of the second statement). By Proposition 38(1), m n Rp is 
the extension of P to the local ring Rp, and the contraction of this ideal to R is just P. 
Put another way, the preimage of m by the maps along the top and right of the diagram 
above is P. If Q c S denotes the preimage of m by the map along the bottom of the 
diagram, then Q is a prime ideal by Proposition 38(3). Since Q n R is the pullback of 
Q by the map along the left of the diagram above, the commutativity of the diagram 
shows that Q P\ R = P. 

Local Rings of Affine Algebraic Varieties 

For the remainder of this section, let k be an algebraically closed field and let V be an 
affine variety over k with coordinate ring k[V~\. Then k[V] is an integral domain, so we 
may form its field of fractions: 

k(V) = {f/g\fgek[Vl g± 0}. 

The elements of k(V) are called rational functions on V and k(V) is called the field 
of rational functions on V. When k[V ] is a Unique Factorization Domain there is an 
essentially unique representative for f/g that is in “lowest terms,” but in general each 
fraction f/g e k(V) has many representations as a ratio of two elements of Since 

/c[V] is an integral domain, f/g = fi/g x if and only if fg y = f y g. 

The elements of k[V] can be considered as k -valued functions on V, and if the 
denominator doesn’t vanish the some is true for an element of k(V) (which helps to 
explain the terminology for this field). Since the same element of k(V) may be written 
in the form f/g in several ways, we make the following definition: 

Definition. We say f/g is regular at v or defined at the point u € V if there is some 
/i, gi G k[V] with f/g = fi/gi andgi(u) ^ 0. 

If / 2l g 2 is another such pair with g 2 (v) zf 0, then fi(v)/gi(v) — f 2 (v)/g 2 (v) as 
elements of k, so whenever f / g is regular at v there is a well defined way of specifying 
its value in k at u. 

Example 

The variety V = Z(xz — yw) in A 4 has coordinate ring £[V] = k[x , y, z, w]/(xz — yin). 
Consider die element / = x/y in the quotient field k(V) of k[V], Since xz = yui in k[ V], 
the element / can also be written as w/z. From the first expression for / it follows that / 
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is regular at all points of V where y ^ 0, and from the second expression it follows that / 
is regular at all points of V where z / 0. It is not too difficult to show that these are all the 
points of V where / is regular. Furthermore, there is no single expression / — a/b for / 
with a,b e fc[V] such that b(v) ^ 0 for every v where / is regular (cf. Exercise 25). 

If f/g e k(V) is regular at the point v, say f/g = f\/g\ with gi(u) ^ 0, then 
fig is also regular at all the points v in the Zariski open neighborhood V g] of v where 
gi ^ 0. Asa k -valued function on V this means that if f/g is defined at u, then it is 
also defined in a (Zariski open) neighborhood of v. Since any nonempty open set of an 
affine variety is Zariski dense (cf. Exercise 1 1 in Section 2), we see that every rational 
function on V is defined at a dense set of points in V (so “almost everywhere” in a 
suitable sense). Also, each pair f\/g\ and fi/gi representing f/g agree as functions 
on the open neighborhood V JSl D V g2 of u, but the “size” of this neighborhood depends 
on gi and g 2 — there is in general not a common open neighborhood of v where all 
representatives of f/g with nonzero denominator at v are simultaneously defined. 

If v is a fixed point in V, then a rational function f/g is regular at v if and only if 
f/g = fi/gi for some f\,g\ e k[V] with g\ £ T(v), the ideal of functions on V that 
are zero at v. This means that the set of rational functions that are defined at v is die 
same as the localization of fc[V] at the maximal ideal T(y): 

Definition. For each point v e V the collection of rational functions on V that are 
defined at y, 

O v , v - {f/g e k(V) | f/g is regular at u}, 

is called the local ring ofV atv. Equivalendy, the local ring of V at u is the localization 
of k[V~\ at the maximal ideal T(v). 

In particular, O v< v is a local ring with unique maximal ideal m^i/, where 

"W = {// 8 ^ I f/g = fi/gi with Mv) = 0, g!(u) ^ 0) 

is the set of rational functions on V that arc defined and equal to 0 at v. Since O u V 
is a localization of the Noetherian integral domain k[V] at a prime ideal, O v y is 
also a Noetherian integral domain. Note also that O v y/vc^y = k[V]/l(v) = k by 
Proposition 46(5). 

Recall that the polynomial maps from V to £ are also referred to as the regular 
maps of V to k. This is because these are precisely the rational functions on V that are 
regular everywhere: 

Proposition 51. If V is an affine variety over an algebraically closed field k then die 
rational functions on V that are regular at all points of V are precisely the polynomial 
functions k[V], 

Proof: This follows from Proposition 48, which shows that the intersection (in 
k(V)) of all of the localizations of k[V] at the maximal ideals of k[ V] is precisely k[V]. 
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Since the maximal ideals of k[V] are in bijective correspondence with the points of 
V , the fact that the local ring 0 Vs v is the same as the localization of£[V] at the maximal 
ideal corresponding to v shows that O v V depends intrinsically on the ring k[V) and is 
independent of the embedding of V in a particular affine space. 

Suppose <p : V — > W is a morphism of affine varieties with associated fc-algebra 
homomorphism $> : k[W\ -> k[V], If v e V is mapped to w e W by <p, then it is 
straightforward to show that <p induces a homomorphism (also denoted by <p) between 
the corresponding local rings: 

7p : O w>w -► 0 VjV where vQi/k) = !p(h)f<p(k), 

and that under this homomorphism, ^ _l (m 1)> v) = (a homomorphism of local 
rings having this property is called a local homomorphism ). Note that <p does not in 
general extend to a field homomorphism from all of k(W) into k(V) since elements of 
k[W] lying in the kernel of 7p do not map to invertible elements in k(V). It is also easy to 

check that if \}r o</> is a composition of morphisms then on the local rings \}r o <p = 7p o 

The local ring O v y can be used to provide an algebraic definition of the “smooth- 
ness” (in the sense of the existence of tangents) of V at v, as we now indicate. Suppose 
first that V = Z(f) is the hypersurface variety in A" defined by the zeros of an ir- 
reducible polynomial / in , *„). For any point v = (v\, . . . , v n ) on V let 

D v (f)(x \ , . . . , x„) be the linear polynomial: 


n g s 

D v (f)(x u ...,x n ) = 


i=i 


where the partial derivative of / with respect to Xj is given by the usual formal rule 
for the derivative of a polynomial in j c,- (with all other variables considered constant). 
The polynomial D v (f)(x\ —vi,...,x„ — v„) is the first order Taylor polynomial of the 
function / at v, so gives the best linear approximation to /(*],..., x„) € k[x \ , . . . , x„] 
at v. It follows that if Tis the linear variety Z(D v (f)(x { , . . . , x n )) consisting of those 
points where D v (f) is zero, then the translate v + T is “tangent”' to the hypersurface 
Z(f) at v. 


Example 

Suppose / = x 2 — y € k[x, y], so that V = Z(f) is just the parabola y = x 1 . We have 
df/dx = 2x and df/dy = — 1, which at v = (3, 9) are equal to 6 and —1, respectively. 
Then 

^(3.9)(/)(x,y) = 6x -y, 

and the corresponding linear variety T is the line y = 6 x through the origin. The translate 
(3, 9) + T is the usual tangent line to the parabola at (3, 9). The Taylor expansion of 
x 2 — y at (3, 9) is jc 2 — y = [ 6(x — 3) — (y — 9) ] + (x — 3) 2 . Tbe first order terms are 
£>( 3 , 9 ) (/)(x — 3, y — 9) and give the best linear approximation to x 2 — y near (3,9). 

It is straightforward to extend these notions to any affine variety V in A". 
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Definition. Define the tangent space to V at v to be the linear variety 
T\,v = Z({D v (f)( Xl , . . . , x„) | / G X(V)}). 

The formal partial derivatives are k -linear and obey the usual product rule for 
derivatives, so the tangent space may be computed from the generators for X{V)\ 

m 

if = then T VtV = f)Z(D v (fi)). 

1=1 

Note that T u ,v is an intersection of vector spaces, so is a vector subspace of k n . 

This definition of the tangent space T^.v, while making apparent the connection 
with tangents to the variety V, seems to depend on the embedding of V in A”. In fact 
the tangent space can be defined entirely in terms of the local ring 0 Vi v, as the next 
proposition proves. 

Proposition 52. Let V be an affine variety over the algebraically closed field k and let 
v be a point on V with local ring O v y and corresponding maximal ideal m u , y. Then 
there is a /c-vector space isomorphism 

(Tu.v)* = rciuy/vc^y 

where (Tu, y )* denotes the vector space dual (cf. Section 1 1 .3) of the tangent space v 

to V at u. 

Proof: Let (k n y denote the n-dimensional vector space dual to k n . Since each 
D v (f ) is a linear function, D v is a linear transformation from k[x i , . . . , x„] to (k?')*. 

Let be the maximal ideal in . . . , x„] generated by the set Xj — v,- for 
1 < i < n. The image M v /X(V) of M v in k[V] is the ideal X(v) of functions on V that 
are zero at v and X(v ) 2 = M^+X(V). Then O v .v is the localization of k[V] atX(n); and 
identifying X(v) with its image in O v , v we have m L , i y = X(v)0 Vi y (Proposition 46(2)). 
By definition of D v we have D v (x, — u,) = Xj, and since these linear functions form 
a basis of (k n )*, it follows that D v maps M v surjectively onto (k 1 ')*. The kernel of D v 
consists of the elements of k[xi , . . . , x„] whose Taylor expansion at v starts in degree 
at least 2 and these are just the elements in Hence D v defines an isomorphism 

D v : MJM 2 ^ {kry. 

The tangent space T Ui y is a vector subspace of k n , so every linear function on k n restricts 
to a linear function on T^v- Composing D v with this restriction map gives a linear 
transformation 

D:M v -% (*")* (T,,v)* 

which is surjective since the individual maps are each surjective. We have already 
seen that X(v) 2 = + X(V), so X(v)/X(v) 2 = M v /(Ml + Z(V)). It follows 

by Proposition 46(5) that nvv/m^ v = X(v)/X(v) 2 . To prove the proposition it is 
therefore sufficient to show that ker D = M 2 + X( V), since then 

Tn w> v/m l v ~M v /(M 2 v +X(V)) = M v /kerD = (¥„,„)*. 
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The polynomial / is in ker D if and only if D v (f) is zero on TT„ ( y, i.e., if and only if 
the linear term of the Taylor polynomial of / expanded about v lies in X{^ v y). Since 
the linear terms of the functions in Z(V) generate the ideal ZOT^v), it follows that / 
is in ker D if and only if / — g has zero linear term for some g in X(V). But this is 
equivalent to / e X(V) + Ml, so ker D = X(V) + Ml, completing the proof of the 
proposition. 

Recall that the dimension of a variety V is by definition the transcendence degree 
of the field k(V) over k. Since each local ring G ViV has k(V) as its field of fractions, 
the dimension of V is determined by the transcendence degree over k of the field of 
fractions of any of its local rings. 

Definition. We say V is nonsingular at the point v e V (or v is a nonsingular point of 
V) if the dimension of the k- vector space T„ v is dim V. Equivalently (by Proposition 
52), v is a nonsingular point of V if dimj : (m l , i v//m^ = dim V. Otherwise the point 
v is called a singular point. The variety V is nonsingular or smooth if it is nonsingular 
at every point 

The geometric picture is that at a nonsingular point v there are as many independent 
tangents as one would expect a tangent line on a curve, a tangent plane on a surface, 
etc. 

Whether a variety V is nonsingular at a point v can be determined from properties of 
the local ring namely whether dim k /m^ K ) = dimG v y. A local ring having 

this property is said to be a regular local ring. In particular, the notion of singularity 
does not depend on the embedding of V in a specific affine space. This algebraic 
interpretation can be used to define smoothness for abstract algebraic varieties, where 
the geometric intuition of tangent planes to surfaces (for example) is not as obvious. 

If f \, . . . , f, n are generators for Z(V) defining V in A", then the dimension of V 
can be determined from a Grobner basis for X(V) (cf. Exercise 29). Determining the 
dimension of the tangent space T u> v as a vector space over k is a linear algebra problem: 
this vector space is the set of solutions of the m linear equations D v (fi )(x \ , . . . , x n ) — 0. 
If r is the rank of the mxn matrix of coefficients d f, /dxj (u) of this system of equations, 
then visa vector space of dimension n—r. Using this it is not too difficult to establish 
the following: 

1. We have dim V < dim^C^D, v) < n for every point v in V c A". 

2. The set of singular points of V is a proper Zariski closed subset of V. The set 
of nonsingular points of V is a nonempty open subset of V; in particular the 
nonsingular points of V are dense in V (so “most” points of V are nonsingular). 

We also state without proof the following result which further relates the local geometry 
of V to the algebraic properties of the local rings of V : 

3. If v is a nonsingular point, then the local ring O v V is a Unique Factorization 
Domain; in particular, O v v is integrally closed (cf. Example 3 following Corollary 
25). 
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The variety V is said to he factorial if 0 Vi y is a U.F.D. for every point v € V , and 
is said to be a normal variety if D Vt y is integrally closed for every v € V (which by 
Proposition 49 is equivalent to k[V~\ being integrally closed). By (3) above we have 

smooth varieties C factorial varieties c normal varieties . 

In general each of the above containments is proper. In the case when V has dimension 
1, i.e., V is an affine curve , however, these three properties are in fact equivalent: we 
shall prove later that an irreducible affine curve is smooth if and only if it is normal or 
factorial (cf. Corollary 13 in Section 16.2). It follows that over an algebraically closed 
field k, 

an irreducible affine curve C is smooth if and only ifk[C] is integrally closed. 

For any irreducible affine curve C the integral closure, S, of k[V] in k(V) is also the 
coordinate ring of an irreducible affine curve C. Then S is integral over k[V] and, 
by Theorem 30 and Corollary 27 it follows that there is a morphism from the smooth 
curve C onto C that has finite fibers. The curve C is called the normalization or the 
nonsingular model of C, and one can show that it is unique up to isomorphism. Note 
how the existence of a smooth curve mapping finitely to C (a problem in “geometry”) is 
solved by the existence of integral closures in ring extensions (a problem in “algebra”). 

We shall give another characterization of smoothness for irreducible affine curves 
at the end of Section 16.2. 


EXERCISES 

As usual R is a commutative ring with 1 and D is a multiplicatively closed set in R. 

1. Suppose M is a finitely generated -module. Prove that = 0 if and only if dM = 0 

for some d € D. 

2. Let / be an ideal in R, let D be a multiplicatively closed subset of R with ring of fractions 
D~ l R , and let c ( e I ) = R be the saturation of I with respect to D. 

(a) Prove that c ( e I) = R if and only if e l — D~ l R if and only if / D D # 0. 

(b) Prove that I = c ( e I) is saturated if and only if for every d e D, if da e I then a € I. 

(c) Prove that extension and contraction define inverse Injections between the ideals of 
R saturated with respect to D and the ideals of D~ l R. 

(d) Let I = (2jc, 3y) c Z[x, y]. Show the saturation of I with respect to Z — {0} is (x , y). 

3. If / is an ideal in the commutative ring R let tp : /?[xi x„] = (/?//)[jri, . . . , jc,,] be 

the ring homomorphism with kernel I[x\, . . . , x n ] given by reducing coefficients modulo 
/. If A is an ideal in (R/I)[x j x„], let A denote the inverse image of A under <p. 

(a) For any i > 1 show that the inverse image under <p of the subring (/?//)[jq, . . . , jc,-] 

is R[xi, . . . , Xj] + I[xi x n ]. 

(b) Prove that <p(A D . . . . jc,-]) = A D (R/I)[x \, . . . , Jtj] 

4. Let / = y 5 — z 4 , viewed as a polynomial in y with coefficients in Q[z]. 

(a) Prove that / has no roots in Q[z], 

(b) Suppose / = (y 2 + ay + fc)(y 3 + cy 2 +dy + e). Show that a, b, c, d, e satisfy the 
system of equations 

a + c = 0, ac + b + d = 0, ad + be + e — 0, ae + bd = 0, be — z 4 = 0. 
Deduce that e 5 = z 12 and conclude that / is irreducible in Q[y, z], [Use elimination.] 
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5. Suppose R is a U J\D. with field of fractions F and p € 7?[x] is a monic polynomial. 

(a) Show that the ideal pA[x] generated by p in 7?[x] is prime if and only if the ideal 

pF[x ] generated by p in F[x] is prime. [Use Gauss’ Lemma.] 

0)) Show that p7?jx] is saturated, i.e., that pF[x ] n 7?[x] — pR[x]. 

6. Show that 7 = (y 3 — xz, xy 2 — z 2 ) is not a prime ideal in Q[x, y, z] and find explicit 
elements a, b e <Q>[x, y, z] with ab e / but a $. I and b £ I. 

7. Show that P = (y 3 — xz,xy 2 — z 2 , x 2 — yz ) is a prime ideal in <Q>[x, y, z]. 

8. Show that P = (x 2 — yz, w 2 — x 4 z) is a prime ideal in Q[x. y, z, u>]. 

9. Show that P = (xz 2 — u; 3 , xui 2 — y 4 , y 4 z 2 — tn 5 ) is a prime ideal in Q(x, y, z, in]. 

10. Show that 7 = (xy — in 3 , y 2 — zuj) is not a prime ideal in Q[x, y, z, if] and find a, b with 

ab € I but a, b $ I. 

11. Let Rp be the localization of R at the prime P. Prove that if Q is a 7 , -primaiy ideal of R 
then Q = c ( e Q) with respect to the extension and contraction of Q to Rp. Show the same 
result holds if Q is P'-primary for some prime P' contained in P. 

12. Let R — R[x, y, z]/(xy — z 2 ), let P = (x, z) be the prime ideal generated by the images 
of x and y in 7?, and let 7? p be the localization of R at P. Prove that P 2 Rp D 7? = (x) and 
is strictly larger than P 2 . 

13. Prove that if N and N' are two R -submodules of an 7?-module M with Np = N' p in 
the localization Mp for every prime ideal P of 7? (or just for every maximal ideal) then 
N = N'. 

14. Suppose tp : M -+ N is an 7?-module homomorphism. Prove that <p is injective (respec- 
tively, surjective) if and only if the induced Rp -module homomorphism <p \ Mp -> Np is 
injective (respectively, sutjective) for every prime ideal P of 7? (or just for every maximal 
ideal of R). 

15. Let R = Zf\/^5 ] be the ring of integers in the quadratic field <Q>(V— 5 ) and let 7 be the 
prime ideal (2, 1 + v/-5 ) of R generated by 2 and 1 + \/^5 (cf. Exercise 5, Section 8.2). 
Recall that every nonzero prime ideal P of R contains a prime p € Z. 

(a) If P is a prime ideal of R not containing 2 prove that Ip = Rp. 

(b) If P is a prime ideal of R containing 2 prove that P = I and that Ip ~ (1 + V-5)7?p. 

(c) Prove that Ip = Rp as 7? p- modules for every prime ideal P of R but that 7 and R are 
not isomorphic R -modules. (This example shows that it is important in Exercise 14 
to be given the /^-module homomorphism <p.) [Observe that 7 = R as /^-modules if 
and only if 7 is a principal ideal.] 

16. Prove that localization commutes with tensor products: there is a unique isomorphism of 

D -, 7? -modules <p : ( D~ ] N ) = D~\M ® r N) with <p((m/d) ® ( n/d ')) 

given by ( m ®n)/dd ' for any 7?-modules M, N , and multrplicatively closed set Din R. 

17. Prove that the R -module A is a flat 7?-module if and only if Ap is a flat 7?p -module for 
every prime ideal P of 7? (or just for every maximal ideal of R). [Use Proposition 41, 
Exercises 14 and 16, and the exactness properties of localization.] 

18. In the notation of Example 2 following Corollary 37, prove that Rf = R[x]/(fx — 1) 
iff is not nilpotent in R. [Show that the map <p : R[x] — > Rf defined by <p(r) = rf 1 
and ip(x) = Iff gives a surjective ring homomorphism and the universal property in 
Theorem 36 gives an inverse.] 

19. Prove that if 7? is an integrally closed integral domain and D is any multiplicatively closed 
subset of R containing 1 , then D~ l R is integrally closed. 
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20. Suppose that R is a subring of the ring S with lei? and that S is integral over R. If D is 
any multiplicatively closed subset of R, prove that D~*S is integral over D~ [ R. 

21. Suppose ip : R -> S is a ring homomorphism and D' is a multiplicatively closed subset of 
S. Let D — <p~ l (Ly). Prove that D is a multiplicatively closed subset of R and that the 
map <p' : D~ l R -> D y l S given by <p'(r/d) = <p(r)/<p(d) is a ring homomorphism. 

22. Suppose P c Q are prime ideals in R and let Rq be the localization of R at Q. Prove 
that the localization Rp is isomorphic to the localization of Rq at the prime ideal PRq 
(cf. the preceding exercise). 

23. Let (p : A B be a homomorphism of commutative rings with <p(\ a) = In, and let P be 
a prime ideal of A. Let contraction and extension of ideals with respect to cp be denoted 
by superscripts c and e respectively. Prove that P is the contraction of a prime ideal in B 
if and only if P = (P e )L [Localize B at <p(A — P).] 

24. (The Going-down Theorem ) Let S be an integral domain, let R be an integrally closed 
subring of S containing Is, and let k be the field of fractions of R. Suppose that P 2 £ P\ 
are prime ideals in R and that Q\ is a prime ideal in 5 with Q\ 0 R = Pi . Let Sg, be the 
localization of S at Q \ . 

(a) Show that P2 c PiSq { Pi R. 

(b) Suppose that a € P2£g, ^ ^ and write a = s/d with s e PiS and d e. S,d $ Q\. 
If the minimal polynomial of s over k is x n + a n -ix?^ + ■■ -(- a\x + oq with 
oq, . . . , <2 „_i e P2 (cf. Exercise 12 in Section 3) show that the minimal polynomial 

of d over k is x n -\- b n -[X n ~ l H |- b\x + ho where b\ — a\ /a n ~ l and conclude that 

bi € R. [Use Exercise 10 in Section 3.] 

(c) Show that a e P 2 and conclude that P 2 Sg, C\R = Pi. [Show a £ Pi implies b\ e Pi 

for i = 0, 1 n — 1, which would imply d n e PiS c PjS c Qi and so d e Qi.] 

(d) Prove that PiSq^ is contained in a prime ideal P of Sg, with P n P = P 2 . [Use (c) 
and the previous exercise for ip : R -»■ Sg, .] 

(e) Let Qi = P n 5. Prove that Qi £ Q\ and that Qi 0 R = P2. 

(f) Use induction together with the previous result to prove the Going-down Theorem' 
Theorem 26(4). 

25. Let k be an algebraically closed field and let V = Z(xz — yw) c A 4 . Prove that the set 
of points v where / = x/y € k(V) is regular is precisely the set of points (x, y, z, w) 
where y#0orz#0. [If/ = a/b show that ay — bx € (xz — yio ) as polynomials in 
£[x, y, z, w] and conclude that b e (y, 7).] Prove that there is no function a/b e k(V) 
with b(v) ^ 0 for every v where / is regular. 

26. ( Differentials of Morphisms) Let cp : V -> W be a morphism of affine varieties over the 
algebraically closed field k and suppose <p(v) = w. 

(a) Show that <p induces a linear map from the /^-vector space M w /M^ to the A: -vector 
space M v /M\, and use this to show that <p induces a linear map dip (called the differ- 
ential of <p) from the it- vector space T U( y to the it-vector space T w> w. 

(b) Prove that if V c A", W c A m and cp = (P\ (xi , . . . , x„ ) F m ‘(x 1 , . . . , x n )) then 

d<p : T^v -> T U)> w is given explicitly by 


( d<p)(a\ a n ) = (D u (Pi)(ni, .... a n ), . . . , D v (F m )(a\ a„)). 


[If g = g(y y m ) show that the chain rule implies 


B(g °<p) 
dx{ 


U 8 « 


d Xi 


(v). 
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sothatD^o^Hai a„) = D w (g)(b i bm) where bj = D v (Fj)(a\, . . . ,a„). 

Then use the fact that g o <p e X(V) if g e Z(W).] 

(c) If \}r : U -*■ V is another morphism with if(u) = v, prove that the associated 
d{<p o ■&) : T U'U -*■ T^iv is the same as d<p o dty. 

(d) Prove that if tp is an isomorphism then dip is a vector space isomorphism from T y ,i/ 
to w for every <p(v) = w. 

27. Let V = A 1 and W = Z(xz — y 2 , yz — x 3 , z 2 — x 2 y) C A 3 . Let <p : V -*■ W be the 
surjective morphism <p(t) = (f 3 , t 4 ,t 5 ) (cf. Exercise 26 in Section 1). For each t e A 1 
describe the differential dip : T, a' T( r 3 r 4 ^ w in the previous exercise explicitly; in 
particular prove that dip is an isomorphism of vector spaces for all t ^ 0 and is the zero 
map for t = 0. Use this to prove that V and W are not isomorphic. 

28. If k is a field, the quotient k[x\/ {x 2 ) is called the ring of dual numbers over k. If V is an 
a ffin e algebraic set over k, show that a fc- algebra homomorphism from A[ V] to fc[x]/(x 2 ) 
is equivalent to specifying a point v € V with O v y /m^y = k (called a k-rational point 
of V ) together with an element in the tangent space T Wi v of V at v. 

29. ( Computing the dimension of a variety) Let P be a prime ideal in k[x\ x„], set Pq = 0 

and let P/ = P n k[ xj, .... x,]. Define the varieties V/ = Z{P,) c A' with Vo the zero 
dimensional variety consisting of a single point and coordinate ring k. 

(a) Show that dim V/_i < dim Vj < dim V,_i + 1. [First exhibit an injection from k[V t -\] 
intofc[V]; then show that fc [ V ] is a ^-algebra generated by ifc[V;-i] and one additional 
generator.] 

(b) If the ideal generated by P, _i in fc[xi, . . . , x,] equals Py, show that V; = Vy_i x A 1 
and deduce that dim V,- = dim V/_i + 1 • 

(c) If the ideal generated by P,_i in k[x\ r,-] is properly contained in Pi , show that 

dim Vt — dim Vj_l . 

(d) Show that dim V equals the number of i e {1,2 , . . . , n } such that the ideal generated 
by Pi -i in k[x \ , . . . , x* ] equals the ideal Pj . Deduce that if G is the reduced Grobner 
basis for P with respect to the lexicographic monomial ordering x„ > ■ ■ ■ > x\ and 
Gj = G n k[x i , .... x/] where Go = 0, and N is the number of i with Gy # G, - 1 for 
1 < i < n, then dim V ~ n — N . 

The following eleven exercises introduce the notion of the support of an P-module M and its 

relation to the associated primes of M. Cf. also Exercises 29 to 35 in Section 1 and Exercises 

25 to 30 in Section 5. 

Definition. If M is an P-module, then the set of prime ideals P of R for which the localization 

Mp is nonzero is called the support of M, denoted Supp(Af). 

30. Prove that M = 0 if and only if Supp(M) = 0. [Use Proposition 47.] 

31. If 0 — ► L — » M — > N — ► 0 is an exact sequence of P-modules, prove that the localization 
Mp is nonzero if and only if one of the localizations Np and Lp is nonzero and deduce 
that Supp(Af) = Supp(L) U Supp(A0 In particular, if Af = M\ © ■ ■ ■ © M n prove that 
Supp(Af) = Supp(Afr) U ■ ■ - U Supp(M„). 

32. Suppose P C Q are prime ideals in R and that M is an P-module. Prove that the 
localization of the P -module Mq at P is the localization Mp, i.e., (Mq)p = Mp . [Argue 
directly, or use Proposition 41 and the associativity of the tensor product] 

33. Suppose P c Q are prime ideals in R and that Af is an P-module. Prove that if 
P € Supp(Af) then Q e Supp(Af). [Use the previous exercise.] 

34. (a) Suppose M = Rm is a cyclic P-module. Prove that Mp = 0 if and only if there is 
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an element re R,r £ P with rm = 0. Deduce that P e Supp(M) if and only if P 
contains the annihilator of m in R (cf. Exercise 10 in Section 10.1). 

(b) If Af = Rm\ H 1- Rm n is a finitely generated R -module prove that P e Supp(M) if 

and only if P is contained in Supp(J?»n, ) for some i = I n. [Use Proposition 42.] 

Deduce that P e Supp(M) if and only if P contains the annihilator Ann(M) of M in 
R. [Note Ann(M) — n[ l _ 1 Ann(/?nj l ), then use (a) and Exercise 11 of Section 7.4.] 

35. Suppose P is a prime ideal of R with P H D = 0. Prove that if P e Assr(M) then 
D~ X P e Assq-i r (D~ 1 M). [Use Proposition 38(3) and Proposition 42.] 

36. Suppose D~ l P e Ass D -t R (D~ 1 M) where P = (a\ o„) is a finitely generated prime 

ideal in R with P (1 D = 0. 

(a) Suppose m/d e D~ l M has annihilator D~ } P in D~ l R. Show that = 0 e R 
for some d\, .-.,d n e D. 

(b) Let d! = d\di . . . d„. Show that P = Ann {d'm) and conclude that P e Ass r(M). 
[The inclusion P c Ann (d'm) is immediate. For the reverse inclusion, show that 
b e An n(d'm) implies that b/\ annihilates m/d in D~ l M, hence b/1 e D~ l P, and 
conclude b e P.\ 

37. Suppose M is a module over the Noetherian ring R. Use the previous two exercises to 
show that under the bijection of Proposition 38(3) the prune ideals P of Assr(M) with 
P 0 D = 0 correspond bijectively with the prime ideals of Assq-^ r (D~ 1 M). 

38. Suppose M is a module over the Noetherian ring R and D is a multiplicatively closed subset 
of R. Let iS be the subset of prime ideals P in Assr(M) with P HD ^ 0. T his exercise 
proves that the kernel N of the localization map M -*■ D -1 M is the unique submodule N 
of M withAsStf(N) =S and Ass R (M/N) =Ass r (M) ~S. 

(a) If N ’ is a submodule of M with Ass R {N') = S and As s R (M/N') = Assr(M) — S as 
in Exercise 35 in Section 1 , prove that the diagram 

M — — > M/N' 

*1 l*' 

D~ l M — D~ l (M/N') 

is commutative, where n and n' are the natural projections (cf. Proposition 42(6)) 
and tp , <p' are the localization homomorphisms. 

(b) Show that Ass i)-i^(D -, N y ) = 0 and conclude that D~^N' — 0 and that n' is injective. 
[Use the previous exercise, the definition of <S, and Exercise 34 in Section 1.] 

(c) If x is the kernel K of tp' show that Ann(x) nD /0 and that Ass R (K) c S. Show 
that Ass r (K) c Ass^(Af/N / ) implies that Ass/?(/0 = 0, and deduce that K = 0. 

(d) Prove <p and jr have the same kernel, i.e., N = N\ and this submodule of M is unique. 

The next two exercises establish a fundamental relation between the sets Ass R (M) and Supp(M) 

of prime ideals related to the R -module M. 

39. Prove that Ass R (M) c Supp(M). [If Rm = R/P use Proposition 42(4) and Proposition 
46(1) to show that 0 ^ ( Rm)p c Mp.] 

40. Suppose that R is Noetherian and M is an /? -module. 

(a) If P e Supp(M) prove that P contains a prime ideal Q with Q e Assr(M). 

(b) If P is a minimal prime in Supp(AZ), show that P e Ass R (M). [Use Exercise 33 in 
Section 1 to show that Ass R P (Mp) ^ 0 and then use Exercise 37.] 

(c) Conclude that Ass R (M) c Supp(Af) and that these two sets have the same minimal 
elements. 
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15.5 THE PRIME SPECTRUM OF A RING 


Throughout this section the term ‘Ting” will mean commutative ring with 1 and all ring 
homomorphisms <p : R -*■ S will be assumed to map to I 5 . 

We have seen that most of the geometric properties of affine algebraic sets V over 
k can be translated into algebraic properties of the associated coordinate rings k[V] 
of ^-valued functions on V . For example, the morphisms from V to W correspond to 
algebra ring homomorphisms from k[W] to k[ V] . When the field k is an algebraically 
closed field this translation is particularly precise: Hilbert’s Nullstellensatz establishes 
a bijection between the points u of V and the maximal ideals M = T(v) of j fc[ V], and if 
(p : V -> W is a morphism then <p(v) € W corresponds to the maximal ideal 
in k[W]. In this development we have generally started with geometric properties of the 
affine algebraic sets and then seen that many of the algebraic properties common to the 
associated coordinate rings can be defined for arbitrary commutative rings. Suppose 
now we try to reverse this, namely start with a general commutative ring as the algebraic 
object and attempt to define a corresponding “geometric” object by analogy with k[V] 
and V. 

Given a commutative ring /?, perhaps the most natural analogy with £[V] and V 
would suggest defining the collection of maximal ideals M of R as the “points” of the 
associated geometric object. Under this definition, if Tjp : R' — ► R is a ring homomor- 
phism, then should correspond to the maximal ideal M. Unfortunately, the 

inverse image of a maximal ideal by a ring homomorphism in general need not be a 
maximal ideal. Since the inverse image of a prime ideal under a ring homomorphi sm 
(that maps 1 to 1 ) is prime, this suggests that a better definition might include the prime 
ideals of R. This leads to the following: 

Definition. Let R be a commutative ring with 1. The spectrum or prime spectrum of 
R , denoted Spec R, is the set of all prime ideals of R. The set of all maximal ideals of 
R, denoted mSpec R , is called the maximal spectrum of R. 

Examples 

(1) If R is a field then Spec R = mSpec R = {(0)}. 

(2) The points in Spec 2 are the prime ideal (0) and the prime ideals ( p ) where p > 0 is 

a prime, and mSpec 2 consists of all the prime ideals of Spec 2 except (0). 

(3) The elements of Spec 2 [jc] are the following: 

(a) (0) 

(b) (p) where p is a prime in 2 

(c) (/) where / 1 is a polynomial of content 1 (i.e., the g.c.d. of its coefficients is 

equal to 1) that is irreducible in <Q[;c] 

(d) (p, g) where p is a prime in 2 and g is a monic polynomial that is irreducible 
mod p. 

The elements of mSpec 2[x] are the primes in (d) above. 

In the analogy with k[V] and V when k is algebraically closed, the elements / € 
k[V) are functions on V with values in k, obtained by evaluating / at the point v in 
V. Note that “evaluation at v" defines a homomorphism from k[V] to A; with kernel 
X(v), and that the value of / at v is the element of k representing / in the quotient 
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k[VyX(v) = k. Put another way, the value of f e k[V] at v G V can be viewed as the 
element / e k[VyX{v) = k. A similar definition can be made in general: 

Definition. If f e R then the value of / at the point P g Spec R is the element 
f(P ) = fe R/P. 

Note that the values of / at different points P in general lie in different integral domains. 
Note also that in general / 6 R is not uniquely determined by its values, rather / is 
determined only up to an element in the nilradical of R (cf. Exercise 3). 

There are analogues of the maps Z and X and also for the Zariski topology. For 
any subset A of R define 

Z(A) = {P e X | A c />} c Spec R, 

the collection of prime ideals containing A. It is immediate that Z(A) — Z(I), where 
I — (A) is the ideal generated by A so there is no loss simply in considering Z(I) 
where 7 is an ideal of R. Note that, by definition, P g Z(I) if and only if 7 C />, 
which occurs if and only if / G P for every / g 7. Viewing / g /? as a function on 
Spec R as above, this says that P e Z(I) if and only if f(P) — / mod P — 0 6 R/P 
for all / g 7. In this sense, Z(I) consists of the points in Spec R at which all the 
functions in 7 have the value 0. 

For any subset Y of Spec R define 

X (T) = P| 7>, 

Per 

the intersection of the prime ideals in Y. 

Proposition 53. Let R be a commutative ring with 1. The maps Z and X between R 
and Spec R defined above satisfy 

(1) for any ideal 7 of R, Z(l) = Z(rad(7)) = Z(X(Z(I))), an&X(Z(I)) = rad 7, 

(2) for any ideals 7, J of R, Z(I H J) = Z(IJ) - Z(I) U Z(J), and 

(3) if [Jj) is an arbitrary collection of ideals of R, then Z(UIj) = f)Z(Ij). 

Proof: If P is a prime ideal con taining the ideal 7 then P contains rad 7 (Exercise 
8, Section 2), which implies Z{I) = Z(rad(7)). Since rad 7 is the intersection of all die 
prime ideals containing 7 (Proposition 12), the definition of X(I) gives 2 (rad (7)) = 
Z(X(I)). Similarly, 

X(Z(1)) = p| 7> = p|7>=rad7, 

PzZ(l) I^P 

which completes the proof of (1). It is immediate that Z(I H J) = Z(I) U Z(J). 
Suppose the prime ideal P contains 77. If P does not contain 7 then there is some 
element i g 7 with i £ P. Since iJ c P, it follows that 7 C P. This proves 
Z(I J) = Z(I) U Z(J) and completes the proof of (2). The proof of (3) is immediate. 

The first statement in the proposition shows that every set Z( 7) in Spec R occurs 
for some radical ideal 7, and since X(Z(I)) = rad 7, this radical ideal is unique. 
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The second two statements in the proposition show that the collection 

T = {2(1) | / is an ideal of R) 

satisfies the three axioms for the closed sets of a topology on Spec R as in Section 2. 

Definition. The topology on Spec R defined by the closed sets 2(1) for the ideals / 
of R is called the Zariski topology on Spec R. 


By definition, the closure in the Zariski topology of the singleton set {P} in Spec R 
consists of all the prime ideals of R that contain P. In particular, a point P in Spec R 
is closed in the Zariski topology if and only if the prime ideal P is not contained in any 
other prime ideals of /?, i.e., if and only if P is a maximal ideal (so the Zariski topology 
on Spec R is not generally Hausdorff). These points are given a name: 


Definition. The maximal ideals of R are called the closed points in Spec R. 


In terms of the terminology above, the points in Spec R that are closed in the Zariski 
topology are precisely the points in mSpec R . 

A closed subset of a topological space is irreducible if it is not the union of two 
proper closed subsets, or, equivalently, if every nonempty open set is dense. Arguments 
similar to those used to prove Proposition 17 show that the closed subset Y = 2(1) in 
Spec R is irreducible if and only if X(Y) = rad / is prime (cf. Exercise 16). 

The following proposition summarizes some of these results: 


Proposition 54. The maps 2 and X define inverse bisections 

z 

{Zariski closed subsets of Spec R) (radical ideals of R). 

z 

Under this correspondence the closed points in Spec R correspond to the maximal ideals 
in R, and the irreducible subsets of Spec R correspond to the prime ideals in R. 

Examples 

(1) If X = Spec Z then X is irreducible and the nonzero primes give closed points in X. 
The point (0) is not a closed point, in fact the closure of (0) is all of X , i.e., (0) is dense 
in Spec Z. For this reason the element (0) is called a generic point in Spec Z. 

Since every ideal of Z is principal, the Zariski closed sets in Spec Z are 0, Spec Z 
and any finite set of nonzero prime ideals in Z. 

(2) Suppose X — SpecZfx] as in Example 3 previously. For each integer prime p the 
Zariski closure of the element (p) e X consists of the maximal ideals (p, g) of type 
(d). Likewise for each Q- irreducible polynomial / of type (c), the Zariski closure of 
the element (/) is the collection of prime ideals of type (d) where g is some divisor 
of / in Z/pZ[x]. 
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Example: (Affine A-algebras) 

Suppose R -- A[V] is the coordinate ring of some affine algebraic set V c A" over an 
algebraically closed field A. Then R — k[x\, . . . , x„]/Z(V) where T{V) is a radical ideal 
in k[x[ , . . . , *„]. In particular R is a finitely generated A-algebra and since Z(V) is radical, 
R contains no nonzero nilpotent elements. 

Definition. A finitely generated algebra over an algebraically closed field k having no 
nonzero nilpotent elements is called an affine k-algebra. 

If R is an affine A- algebra, then by Corollary 5 there is a surjective A-algebra homomorphism 
ji : k[ jci, . . . , — > R whose kernel 7 — ker n must be a radical ideal since R has no 
nonzero nilpotent elements. Let V — Z(l) cA". Then R = k[x\ , ...,x n ]/I ~ A[V] is 
the coordinate ring of an affine algebraic set over k. Hence qffne k-algebras are precisely 
the rings arising as the rings of functions on affine algebraic sets over algebraically closed 
fields. 

By the Nullstellensatz, the points of mSpec R are in bijective correspondence with V, 
and the points of Spec R are in bijective correspondence with the subvarieties of V. By 
Theorem 6, morphisms between two affine algebraic sets correspond bijectively with (k- 
algebra) homomorphisms of affine A- algebras. In the language of categories these results 
show that over an algebraically closed field A there is an equivalence of categories 

affine algebraic sets 1 ( affine A-algebras 

morphisms of algebraic sets J [ A-algebra homomorphisms 

The map from left to right sends the affine algebraic set V to its coordinate ring A[V]. The 
map from right to left sends the affine A-algebra R to mSpec R. The pair (mSpec R, R) is 
sometimes called the canonical model of the affine A-algebra R. 

Over an algebraically closed field k, a A-algebra homomorphism <p : R -*■ S 
between two affine A-algebras as in the previous example has the property (by the 
Nullstellensatz) that the inverse image of a maximal ideal in S is a maximal ideal 
in R. As previously mentioned, one reason for considering Spec R rather than just 
mSpec R for more general rings is that inverse images of maximal ideals under ring 
homomorphisms are not in general maximal ideals. When 7? is an affine A-algebra 
corresponding to an affine algebraic set V, the space Spec R contains not only the 
“geometric points” of V" (in the form of the closed points in Spec R), but also the non- 
closed points corresponding to all of the subvarieties of V (in the form of the non-closed 
points in Spec R , i.e., the prime ideals P of R that are not maximal). 

In general, if <p : R — > S is a ring homomorphism mapping 1 r to 1$ and Pisa prime 
ideal in S then <p~ l (P) is a prime ideal in R. This defines a map <p* : Spec S — >• Spec R 
with <p*(P ) = <p~ l (P). If Z(I) c Spec R is a Zariski closed subset of Spec R, then 
it is easy to show that (Z(I)) is the Zariski closed subset Z(<p(I)S) defined by 
the ideal generated by <p(I) in S. Since the inverse image of a closed subset in Spec R 
is a closed subset in Spec S, the induced map (p* is continuous in the Zariski topology. 
This proves the following proposition. 

Proposition 55. Every ring homomorphism tp : R — > S mapping 1^ to 1$ induces a 
map <p* : Spec S — > Spec R that is continuous with respect to the Zariski topologies on 
Spec R and Spec S. 
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While the generalization from affine algebraic sets to Spec R for general rings R has 
made matters slightly more complicated, there are (at least) two very important benefits 
gained by this more general setting. The first is that Spec R can be considered even for 
commutative rings R containing nilpotent elements; the second is that Spec R need not 
be a ^-algebra for any field k, and even when it is, the field k need not be algebraically 
closed. The fact that many of the properties found in the situation of affine ^-algebras 
hold in more general settings then allows the application of “geometric” ideas to these 
situations (for example, to Spec R when R is finite). 


Examples 

(1) The natuxalinclusion <p : Z — >■ Z[/] induces a map <p* : Spec Z[j] — y SpecZ. The fiber 
of <p* over the nonzero prime P in Z consists of the prime ideals of Z[i] containing P. 
If P — (p) where p — 2 or p is a prime congruent to 3 mod 4, then there is only one 
element in this fiber, if p is a prime congruent to 1 mod 4, then there are two elements 
in the fiber the primes (jt) and (jt') where p — nn' in Z(i], cf. Proposition 18 in 
Section 8.3. This can be represented pictorially in the following figure: 


II II 

II 1 + (2+i) 

1 

1 

1 

Hit) 

HO) +(l+i) H3) 1 

♦ (P) 

1 

II 1 + (2— i) 

II II 

1 

1 

H*') 
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SpecZ[i] 

V>* 




V 




Y 




(0) (2) (3) (5) ■■■ ( p ) (P) 

P = 3 (4) p = 1 (4) 


* 

SpecZ 


(2) If A is an algebraically closed field then Spec k[x] consists of (0) and the ideals (x — a) 
for a <E k\ the natural inclusion tp : Afx] -> £[x, y] induces the Zariski continuous 
map (p* : SpecAfx, y] -> Spec k [x]. The elements of Spec k (x, y] are 

(a) (0), 

(b) (/) where / is an irreducible polynomial in A [jc, y], and 

(c) (x — a, y — b) with a,b e k 

(cf. Exercise 4). The prime (0) is Zariski dense in Spec Jt(x, y]; the Zariski closure of 
the primes in (b) consists of the primes (x — a, y — b) in (c) with f(a, b) = 0; the 
closed points, i.e., the elements of mSpecJtfjc, y], are the primes in (c). 

By the Nullstellensat 2 , each prime ideal P in Spec Jfcfx, y] is uniquely determined 
by the corresponding zero set Z(P). The prime (0) e Jkfjc, y] corresponds to A 2 . 
The prime (/) corresponds to the points where f(x, y) = 0, and P = (/) is the 
intersection of all the maximal ideals containing P. The ma xim al ideal (x — a, y — b) 
corresponds to the point (a, b) € A 2 . Fibered over Specfc[jc] by the map <p* these 
primes can be pictured geometrically as in the diagram on the following page. 

In this diagram, the prime (x — a) in Speckfx] is identified with the element 
a € Jt. The prime (x) e Spec A:[x, y] corresponds to the points in A 2 with x = 0, i.e., 
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SpecA:[x, y] 
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with the y-axis in A 2 ; the prime (y) e Specfc[x. y] similarly corresponds to the x- 
axis. The prime (/) e Specfc[x, yj corresponds to the irreducible curve f(x, y) = 0 
in A 2 ; the points (a, b) e A 2 lying on this curve correspond to the maximal ideals 
(x — a, y • — b) e SpecJfc[jt, y] containing (/). The closed point (x — a, y — b) e 
Spec it [x, y] corresponds to the “geometric point” (a, b) e A 2 . 

Note that Spec , y] captures all of the geometry of algebraic sets in A 2 : every 
algebraic set in A 2 is the finite union of some subset of the irreducible algebraic sets 
corresponding to the elements of Spec£[x, y] pictured above. With the exception of 
the everywhere dense point (0), the “geometric” picture of Specjfcfjt, y] is precisely 
the usual geometry of the affine plane A 2 . When k is not algebraically closed the 
situation is slightly more complicated, but the picture is similar, cf. Exercise 4. 


(3) The situation for Spec Z[x], viewed as fibered over SpecZ by the natural inclusion 
Z —> Z[x] is very similar to the situation of Spec k[x, yj in the previous example. The 
elements of Spec Z[x] were discussed in Example 2 following Proposition 54 and can 
be pictured as in the diagram on die following page. 

The element (0) is Zariski dense in Spec Z[x], The closure of (p) consists of 
( p ) and all the closed points (p, g ) where g is a monic polynomial in Z[x] that is 
irreducible mod p. The closure of (/) consists of (/) together with the maximal 
ideals (p, g) that contain (/), which is the same as saying that the image of / in the 
quotient Z[x\ /{p, g) is 0, i.e., the irreducible polynomial g is a factor of / mod p. 
The closed points, mSpec Z[x], are the maximal ideals (p, g). 

Note that the maximal ideals (p, g) con tainin g (/) are precisely the closed points 
in mSpec Zfx] in the diagram above where the “function” / on Spec Z[x] (taking 
the prime P to f(P) — f mod P e Z[x]/P) is zero. For example, the polynomial 
/ = x 3 — 4x 2 + x — 9 £ Z[x] fits the diagram above: / is irreducible in Z[x], and 
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over F p factors into irreducibles as follows: 

/ = jc 3 + x + 1 mod 2 

/ = x(x 4- 1) 2 mod 3 

f = (x + l)(x + 2)(jc 4- 3)mod5. 

There is one point in the fiber over (2) intersecting (/), namely the closed point 
(2, jc 3 4- x 4- 1). There are two closed points in the fiber over (3) given by (3, jc) 
and (3, jc + 1) (with some “multiplicity” at the latter point). Over (5) there are three 
closed points: (5,jc + 1), (5,jc + 2), and (5, jc + 3). For the diagram above, the 
prime p might be p = 53, since this is the first prime p greater than 5 for which 
this polynomial has three inredudble factors mod p. Note that while the prime (/) is 
drawn as a smooth curve in this diagram to emphasize the geometric similarity with 
the structure of Spec/c[jc, y] in the previous example, the fibers above the primes in 
SpecZ are discrete, so some care should be exercised. For example, since / factors 
as (jc + 2) (jc 2 + jc + 6) mod 7, the intersection of (/) with the fiber above (7) contains 
only the two points (7, jc + 2) and (7, jc 2 + jc + 6), each with multiplicity one. 

The possible number of closed points in (/) lying in a fiber over (p) € SpecZ 
is controlled by the Galois group of the polynomial / over <Q> (cf. Section 14.8). For 
example, / = jc 4 4 1 has one closed point in the fiber above (2) and either two or four 
closed points in a fiber above (p) for p odd (cf. Exercise 8). 

The space Spec R together with its Zariski topology gives a geometric generaliza- 
tion for arbitrary commutative rings df the points in a variety V. We now consider the 
question of generalizing the ring of rational functions on V. 

When Visa variety over the algebraically closed field k the elements in the quotient 
field k(V) of the coordinate ring k[V] define the rational functions on V. Each element 
a in k(V) can in general be written as a quotient a/f of elements a, f e /:[V] in 
many different ways. The set of points U at which or is regular is an open subset of 
V; by definition, it consists of all the points v € V where a can be represented by 
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some quotient a/f with f(v) ^ 0, and then the representative a/f defines an element 
in the local ring 0 Vi v Note also that the same representative a/f defines a not only 
at v, but also at all the other points where / is nonzero, namely on the open subset 
Vf = {w e V | f(w) ± 0} of V. These open sets Vj (called principal open sets, 
cf. Exercise 21 in Section 2) for the various possible representatives a/f for a give an 
open cover of U. The example of the function a = x/y for V = Zfxz — yw) C A 4 
preceding Proposition 51 shows that in general a single representative for a does not 
suffice to determine all of U — for this example, {/ = Vj. U Vj, and U is not covered 
by any single V/ (cf. Exercise 25 of Section 4). 

This interpretation of rational functions as functions that are regular on open subsets 
of V can be generalized to Spec ft. We first define the analogues X f in X = Spec R of 
the sets Vf and establish their basic properties. 

Definition. For any / e ft let X/ denote the collection of prime ideals in X = Spec R 
that do not contain /. Equivalently, Xf is the set of points of Spec R at which the value 
of / € ft is nonzero. The set Xf is called a principal (or basic) open set in Spec ft. 

Since Xf is the complement of the Zariski closed set Z(f) it is indeed an open set 
in Spec ft as the name implies. Some basic properties of the principal open sets are 
indicated in the next proposition. Recall that a map between topological spaces is a 
homeomorph ism if it is continuous and bijective with continuous inverse. 

Proposition 56. Let / € ft and let Xf be the corresponding principal open set in 
X = Spec ft. Then 

(1) X f = X if and only if / is a unit, and Xf = 0 if and only if / is nilpotent, 

(2) x f nx g = X fg , 

(3) X f C Xg, U- ■ -UX & if and only if / e rad(gi, , . . , g„); in particular Xf = X s 
if and only if rad(/) = rad(g), 

(4) the principal open sets form a basis for the Zariski topology on Spec ft, i.e., 
every Zariski open set in X is the union of some collection of principal open 
sets Xf, 

(5) the natural map from ft to ft/ induces a homeomorphism from Spec ft/ to Xf , 
where ft/ is the localization of ft at /, 

(6) the spectrum of any ring is quasicompact (i.e., every open cover has a finite 
subcover); in particular, Xf is quasicompact, and 

(7) if tp : ft -> S is any homomorphism of rings (with <p( 1 r) = 1$) then under 
the induced map <p* : Y = Spec S —> Spec ft the full preimage of the principal 
open set Xf in X is the principal open set Y V (j) in Y. 

Proof: Parts (1 ), (2) and (7) are left as easy exercises. For (3), observe that, by defi- 
nition, X g , U • • • U X^ consists of the primes P not containing at least one of gi g„. 

Hence X gi U • • ■ U X gn is the complement of the closed set Z((gi , £„)) consisting of 
the primes P that contain the ideal generated by gi , . . . , g n . If (gi , . . . , g„) = ft then 
X gl U - - - U X gri = X and there is nothing to prove. Otherwise, Xf C X gl U - ■ ■ U X gn 
if and only if every prime P with f & P also satisfies P £ Z((gi , .... g n ))- This 
latter condition is equivalent to the statement that if the prime P contains the ideal 
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(g i , . . . , g n ) then P also contains /, i.e., / is contained in the intersection of all the 
prime ideals P containing (gj, . . . , g n ). Since this intersection is rad(gi, .... g n ) by 
Proposition 12, this proves (3). 

If U = X — Z(I) is a Zariski open subset of X, then U is the union of the sets X f 
with / 6 /, which proves (4). 

The natural ring homomorphism from R to the localization Rf establishes a bi- 
jection between the prime ideals in Rf and the prime ideals in R not containing (/) 
(Proposition 38). The corresponding Zariski continuous map from Spec Rf to Spec R 
is therefore continuous and bijective. Since every ideal of Rf is the extension of some 
ideal of R (cf. Proposition 38(1)), it follows that the inverse map is also continuous, 
which proves (5). 

In (6), every open set is the union of principal open sets by (4), so it suffices to 
prove that if X is covered by principal open sets X gl (for i in some index set J) then X 
is a finite union of some of the X gl . If the ideal 7 generated by the g, were a proper ideal 
in R , then / would be contained in some maximal ideal P. But in this case the element 
P in X = Spec R would not be contained in any principal open set X gI , contradicting 
the assumption that X is covered by the X gj . Hence I = R and so 1 e JR can be written 

as a finite sum 1 = aig il H (- a n g in with iy, ... ,i„ e J. Consider the finite union 

X gl U ■ • • U X gn . Any point P in X not contained in this union would be a prime in 
R that contains g, , , . . . , g, n , hence would contain 1, a contradiction. It follows that 
X = X gl U ■ - ■ U X Sn as needed. The second part of (6) follows from (5). 

We now define an analogue for X = Spec R of the rational functions on a variety 
V. As we observed, for the variety V a rational function a € k(V) is a regular function 
on some open set U. At each point ti e (/ there is a representative ajf for a with 
f(v) ^ 0, and this representative is an element in the localization O v y = k[V]x(„). 
In this way the regular function a on U can be considered as a function from U to the 
disjoint union of these localizations: the point v e U is mapped to the representative 
ajf e k[V]i(v). Furthermore the same representative can be used simultaneously not 
only at u but on the whole Zariski neighborhood V/ of u (so, “locally near v ” a is 
given by a single quotient of elements from A:[V]). Note that ajf is an element in the 
localization k[V] ft which is contained in each of the localizations k[V]x( W ) for w eVf. 

We now generalize this to Spec R by considering the collection of functions s from 
the Zariski open subset U of Spec R to the disjoint union of the localizations Rp for 
P e U such that s(P) e Rp and such that s is given locally by quotients of elements 
of R. More precisely: 

Definition. Suppose f/ is a Zariski open subset of Spec R. MV = 0, define 0(U) = 0. 
Otherwise, define 0(jU) to be the set of functions s : V -> U Q6t/ Rq from V to the 
disjoint union of the localizations Rq for Q e U with the following two properties: 

(1) i((2) e Rq for every Q € U, and 

(2) for every P e U there is an open neighborhood X/ c U of P in U and an 
element a/f n in the localization Rf defining r on X/, i.e., s(Q) = a/f n e Rq 
for every Q e Xf. 

If s, t are elements in 0(U) then s + / and st are also elements in 0(U) (cf. 
Exercise 18), so each 0(U) is a ring. Also, every a e R gives an element in 0(U) 
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defined by s( 0 — a € Rq , and in particular 1 e R gives an identity for the ring 0(U). 
If U' is an open subset of U, then there is a natural restriction map from O(IT) to O(U') 
which is a homomorphism of rings (cf. Exercise 19). 

Definition. Let R be a commutative ring with 1, and let X — Spec R. 

(1) The collection of rings G(U) for the Zariski open sets of X together with the 
restriction maps 0(U) — > 0(U') for 7/' C 7/ is called the structure sheaf on 
X t and is denoted simply by O (or O x )- 

(2) The elements s of 0(U) are called the sections ofO over U . The elements of 
0(X) are called the gbbal sections of O. 


The next proposition generalizes the result of Proposition 51 that the only rational 
functions on a variety V that are regular everywhere are the elements of the coordinate 
ring k[V]. 


Proposition 57. Let X — Spec R and let O = O x be its structure sheaf. The global 
sections of O are the elements of R, i.e., O(X) = R. More generally, if Xf is a principal 
open set in X for some f e R, then G(X j) is isomorphic to the localization Rf. 


Proof: Suppose that a/f n is an element of the localization/?/. Then the map defined 
by s(Q) = a/f " € Rq for Q € Xf gives an element in 0(Xf), and it is immediate 
that the resulting map \}r from Rf to 0(Xf) is a ring homomorphism. Suppose that 
a/f n = b/f m in Rq for every Q e Xf, i.e., g(af m - bf n ) = 0 in R for some g £ Q. 
If 7 is the ideal in R of elements r e R with r(af m — bf n ) = 0, it follows from g e / 
that 7 is not contained in Q for any Q € Xf. Put another way, every prime ideal of R 
containing 7 also contains /, Hence / is contained in the intersection of all the prime 
ideals of R containing 7, which is to say that / € rad 7. Then f N e 7 for some integer 
N >0, and so f N (af m — bf n ) = 0 in R. But this shows that a/f n = b/f m in Rf and 
so the map i}/ is injective. Suppose now that s € 0{Xf). Then by definition Xf can be 
covered by principal open sets X gl on which s(Q) = di/g* 1 € Rq for every Q e X Sl . 
By (6) of Proposition 56, we may take a finite number of the g; and then by taking 
different a t we may assume all then,- are equal (since ail gf — (Oj if n is the 
maximum of then/). Since 5(0 = ^,/g- 1 = a j/gj in Rq for all Q € X Blgj = X g .C\X gJ , 
the injectivity of \j/ (applied to Rg igJ ) shows that n, /g" = aj/gj in R glgJ - This means 
that gigj N (ajg” - ajg") = 0, i.e., 


a iS hj n+N = °jg? +N gj N 


in R for some N > 0, and we may assume N sufficiently large that this holds for every 
/ and j. Since Xf is the union of the X Sl = X g *+H, f is contained in the radical of the 
ideal generated by the g" by (3) of Proposition 56, say 



for some M > 1 and b { € R. Define a = Ylhcng? € R. Then 



Y / bi(ajgr f, gi N ) = = &" +N «- 

i i 
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It follows that a/f M = a j/g] in Rgj, and so the element in O(Xf) defined by a/f 14 
in Rf agrees with ,? on every X g} , and so on all of Xj since these open sets cover X/. 
Hence the map rj/ gives an isomorphism Rf = 0(X /). Taking / = 1 gives R = O(X), 
completing the proof. 

In the case of affine varieties V the local ring O v y at the point v e V is the 
collection of all the rational functions in k(V) that are defined at u. Put another way, 
O v y is the union of the rings of regular functions on U for the open sets U containing 
P, where this union takes place in the function field k(V) of V. In the more general 
case of X = Spec R, the rings 0(U) for the open sets containing P e Spec R are not 
contained in such an obvious common ring. In this case we proceed by considering the 
collection of pairs (,?, U) with U an open set of X containing P and s g 0(U). We 
identify two pairs (s, U ) and (s', U ') if there is an open set U" C U fi U' con tainin g 
P on which s and s' restrict to the same element of 0(U"). In the situation of affine 
varieties, this says that two functions defined in Zariski neighborhoods of the point v 
define the same regular function at v if they agree in some common neighborhood of 
u. The collection of equivalence classes of pairs (s, U) defines the direct limit of the 
rings 0(U), and is denoted lup 0(U) (cf. Exercise 8 in Section 7.6). 

Definition. If P g X = Spec R , then the direct limit, tig pO(U), of the rings 0(11) 
for the open sets U of X containing P is called the stalk of the structure sheaf at P, and 
is denoted Op. 

Proposition 58. Let X = Spec R and let O = Oy be its structure sheaf. The stalk of 
O at the point P g X is isomorphic to the localization Rp of R at P: Op ~ Rp. In 
particular, the stalk Op is a local ring. 

Proof: If (s, U) represents an element in the stalk Op, then s(P) is an element of 
the localization Rp. By the definition of the direct limit, this element does not depend 
on the choice of representative (s, U ), and so gives a well defined ring homomorphism 
<p from Op to Rp. If a, f G R with f £ P, then the map s(l2) = a/f e Rq defines 
an element in 0(Xf). Then the class of (s, Xf) in the stalk Op is mapped to a/f 
in Rp by <p, so <p is a surjective map. Tb see that <p is also injective, suppose that the 
classes of (s, U ) and (s', U')inOp satisfy s(P) — s'(P)in Rp. By definition of 0(17), 
s = a Iff on X g for some g £ P. Similarly, s' = b/(g') m on Xg> for some g' £ P. 
Since a/g n — b/(g r ) m in R P , there is some h £ P with h(a(g') m — bg n ) = 0 in R. If 
Q e X gg r h = X g n X s > D X/, this last equality shows that a/f = h/CgO" 1 in Rq, so 
that s and s' agree when restricted to X gg >h. By definition of the direct limit, (s, U ) and 
(s', U ') define the same element in the stalk Op, which proves that <p is injective and 
establishes the proposition. 

Proposition 58 shows that the algebraically defined localization Rp for P g Spec R 
plays the role of the local ring O v .v of regular functions at v for the affine variety 
V. If m p denotes the maximal ideal PRp in Rp and k(P) = Rp/mp denotes the 
corresponding quotient field (which by Proposition 46(1) is also the fraction field of 
R/P ), then the tangent space at P is defined to be the k(P)- vector space dual ofm^/rn^. 
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This is an algebraic definition that generalizes the definition of the tangent space TT„ v/ 
to a variety V at a point u (by Proposition 52). This can now be used to define what it 
means for a point in Spec R to be nonsingular the point P e Spec R is nonsingular or 
smooth if the local ring R P is what is called a “regular local ring” (cf. Section 16.2). 

Proposition 58 also suggests a nice geometric view of the structure sheaf on Spec R. 
If we view each point P € Spec R as having the local ring Rp above it, then above the 
open set U in X - Spec R is a “sheaf’ (in the sense of a “bundle") of these “stalks” 
(in the sense of a “stalk of wheat”), which helps explain some of the terminology. A 
section s in the structure sheaf O(U) is a map from U to this bundle of stalks. The 
image of U under such a section s is indicated by the shaded region in the following 
figure. 



Definition. Let R be a commutative ring with 1 . The pair (Spec R , Ospec /?), consisting 
of the space Spec R with the Zariski topology together with the structure sheaf 
is called an affine scheme. 

The notion of an affine scheme gives a completely algebraic generalization of the 
geometry of affine algebraic sets valid for arbitrary commutative rings, and is the starting 
point for modem algebraic geometry. 

Examples 

(1) If F is any field then X = Spec F = {(0)). In this case there are only two open sets 
X and 0, both of which are principal open sets: X ~ X\ and 0 = Xo. The global 
sections are O(X) = F. There is only one stalk: O(o> = Fq — F. 

(2) IS R = Z then because R is a PJ.D. every open set in X = Spec Z is principal open: 

X n = {(p) \ p \ n) and 

0(X, j) = 'Ln = Z[l /n] = [a/b e <Q> | if the prime p \ b then p \ «}• 
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For nonzero p the stalk at (p) is the local ring Z (p), and the stalk at (0) is Q. All the 
restriction maps as well as the maps from sections to stalks are the natural inclusions. 

(3) For a general integral domain R with quotient field F the s talks and sections are 

0(U) = { a/b e F 1 b i P for all P e U] 

O p = R P = {a/b eF\b$P) 

where the stalk at (0) is F, i.e., O(o> = F. Again, the restriction maps and the maps 
to the stalks are all inclusions. 

(4) For the local ring R = Z( 2 ) = {a/b e Q | b odd) we have Spec/? = {(0), (2)} with 
(2) the only closed point and {(0)} = X 2 a principal open seL The sections O({(0)}) 
are Ri = Q, and the stalks are 0 ( 0 ) = /?(o) = Q and 0 ( 2 ) = R( 2 ) = R- 

We next consider the relationship of the affine schemes corresponding to rings R 
and S with respect to a ring homomorphism from R to S. 

Suppose that <p : R —*■ 5 is a ring homomorphism. We have already seen in 
Proposition 56(7) that there is an induced continuous map <p* from Y = Spec S to 
X = Spec R and that under this map the full preimage of the principal open set X g 
for g € R is the principal open set Y^)- It follows that <p also induces a map on 
corresponding sections, as follows. Let £)' e Y be any element in Spec S and let 
Q = V>*(.Q') = <P~ l (Q') e Y be the corresponding element in Spec R. If U isaZariski 
open set in X containing Q, then U' — (^>*) _1 (U) is a Zariski open set in Y containing 
Q'. Note that <p induces a natural ring homomorphism, <pq say, from the localization 
Rq to the localization Sq> defined by <PQ(a/f) — <p(a)/<p(f) € Sq> for / ^ Q. Let 
s e Ox(U) be a section of the structure sheaf of X given locally in the neighborhood 
X g of P e X by a/g n . It is easy to check that the composite 

QeU Q'aU 

defines a map given locally in the neighborhood Y^( S ) by the element <p(a) / <p(g) n , so 
that s' e Oy (U 1 ) is a section of the structure sheaf of Y. It is then straightforward to 
check that the resulting map <p* : Ox (JJ) — ► Oy (£/') is a ring homomorphism (mapping 
1 € Ox(U) to 1 e Oy(U')) that is compatible with the restriction maps on O x and 
Oy (cf. Exercise 20). It also follows that there is an induced ring homomorphism on 
the stalks: <p M : O x ,p — ► Oy^ for any point P' € Spec S and corresponding point 
P = <p*(P') € Spec R. Under the isomorphism in Proposition 58, the homomorphism 
<p* from Rp = O x ,p to Sp> = Oy t p> is just the natural ring homomorphism (pp on. the 
localizations induced by the homomorphism <p. In particular, the inverse image under 
<p* of the maximal ideal in the local ring Oy t p> is the maximal ideal in the local ring 
®x,p- 

Definition. Suppose (Spec R, O^cr) and (Spec S', Ospecs) are two affine schemes. 
A morphism of affine schemes from (Spec S, Ospecs) to (Spec R, Ospecfl) is a pair 
( <p *, <p*) such that 

(1) (f* : Spec S — > Spec R is Zariski continuous, 

(2) there are ring homomorphisms : 0(U) — ► 0(<p*~ ] (U)) for every Zariski 
open subset U in Spec R that commute with the restriction maps, and 
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(3) if P' € Spec S with corresponding point P = (p*(P) e Spec R, then under the 
induced homomorphism on stalks <p M : Ospec/?,p — > ^spec.s,p' the preimage of 
the maximal ideal of O^ptcs.p 1 is the maximal ideal of OspecR.p- 

A homomorphism ^ : A B from the local ring A to the local ring B with 
die property that the preimage of the maximal ideal of B is the ma xim al ideal of A is 
called a local homomorphism of local rings. The third condition in the definition is 
then the statement that the induced homomorphism on stalks is required to be a local 
homomorphism. 

With this terminology, the discussion preceding the definition shows that a ring ho- 
momorphism <p : S induces a morphism of affine schemes from (Spec S, Ospecs) 

to (Spec/?,£> S pecp)- 

Conversely, suppose (<p * , <p M ) is a morphism of affine schemes from (Spec S, Os pec s) 
to (Spec R, Ospccp)- Then in particular, for U = Spec R, (^*) -1 ( U ) = Spec 5, so by 
assumption there is a ring homomorphism <p* : Os pec/? (Spec/?) —y <9 Sp ecs (Spec .S’) 
defined on the global sections. By Proposition 57, we have Os p^/? (Spec R) = R and 
Ospec s (Spec S) = S as rings. Composing with these isomorphisms shows that <p § gives 
a ring homomorphism <p : R S. By Proposition 58 we have a local homomorphism 
(p** : Rp — >■ S p>, and by the compatibility with the restriction homomorphisms it follows 
that the diagram 

R — S 

R p > S pi 

commutes, where the two vertical maps are the natural localization homomorphisms. 
Since <p # is assumed to be a local homomorphism, (P'Spi) = PR P , from which 
it follows that <p~ 1 (P ') = P . Hence the continuous map from Spec S to Spec R induced 
by <p is the same as <p*, and it follows easily that <p also induces the homomorphism (p # . 
This shows that there is a ring homomorphism <p : R — >■ S inducing both <p* and <p M as 
before. 

We summarize this in the following proposition: 

Theorem 59. Every ring homomorphism <p : R — ► S induces a morphism 
( <p *, <p # ) : (Spec 5, C> S pecs) -► (Specie, (^Spec /?) 

of af fine schemes. Conversely, every morphism of affine schemes arises from such a 
ring homomorphism (p. 

Theorem 59 is the analogue for Spec R of Theorem 6, which converted geometric 
questions relating to affine algebraic sets to algebraic questions for their coordinate 
rings. 

The condition that the homomorphism on stalks be a local homomorphism in the 
definition of a morphism of affine schemes is necessary: a continuous map on the 
spectra together with a set of compatible ring homomorphisms on sections (hence also 
on stalks) is not sufficient to force these maps to come from a ring homomorphism. 
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Example 

Let R = Z( 2 ) and S = Q as in the preceding set of examples. Define <p* : Spec <Q> 
Spec Z( 2 ) by 40 * ((0)) = (2) (which is Zariski continuous). Define <p n : 0(Spec R) -»■ 
O(SpecS) to be the inclusion map Z( 2 ) Q and define for all other V c Spec R simply 

to be the zero map. It is straightforward to check that these homomorphisms commute 
with the restriction maps. This family of maps does not arise from a ring homomorphism, 
however, because on the stalks for (0) e Spec S and <^*((0)) = (2) e Spec R the induced 
homomorphism 

V** : C?Specrt,<2) ^ C*SpecS.(0) 

is the injection 7Lq\ Q, which is not a local homomorphism (the inverse image of (0) 
is (0) and not the maximal ideal 2 Z( 2 ))- 

The proof of Theorem 59 shows that a morphi sm (<p* , yP) of affine schemes necessarily 
comes from the ring homomorphism defined by y>* cm global sections. In this example, 
the homomorphism on global sections is the inclusion map of R into S. The inclusion map 
from R to S defines a map from Spec S to Spec R that maps (0) e Spec S to (0) e Spec R 
and not to (2) € Spec R, so this map does not agree with the original map y>*. 

The previous example shows that the converse in Theorem 59 would not be true 
without the third (local homomorphism) condition in the definition of a morphism of 
affine schemes. As a result. Theorem 59 shows that the appropriate place to view affine 
schemes is in the category of locally ringed spaces . Roughly speaking, a locally ringed 
space is a topological space X together with a collection of rings 0(1 / ) for each open 
subset of X (with a compatible set of homomorphisms from 0(U) to 0(1/') if V C U 
and with some local conditions on the sections) such that the stalks Op = lirp 0(U) for 
P € U are local rings. The morphi sms in this category are continuous maps between the 
topological spaces together with ring homomorphisms between corresponding 0(U) 
with precisely the same conditions as imposed in the definition of a morphism of affine 
schemes. 

A scheme is a locally ringed space in which each point lies in a neighborhood 
isomorphic to an affine scheme (with some compatibility conditions between such 
neighborhoods), and is a fundamental object of study in modem algebraic geometry. 
The affine schemes considered here form the building blocks that are “glued together” 
to define general schemes in the same way that ordinary Euclidean spaces form the 
building blocks that are “glued together” to define manifolds in analysis. 

EXERCISES 

All rings are assumed commutative with identity, and all ring homomorphisms are assumed to 
map identities to identities. 

1. If TV is the nilradical of R, prove that Spec R and SpecR/N are homeomorphic. [Show 
that the natural homomorphism from R to R/N induces a Zariski continuous isomorphism 
from Spec R/N to Spec /?.] 

2. Let / be an ideal in the ring R. Prove that the continuous map from Spec R/I to Spec R 
induced by the canonical projection homomorphism R -*■ R/I maps Spec R/I homeo- 
morphically onto the closed set 2,(1) in Spec R. 
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3 . Prove that two elements f,geR have the same values at all elements P in Spec R if and 
only if / — g is contained in the nilradical of R. In particular, prove that an element in an 
affine fc-algebra is uniquely determined by its values. 

4 . Let it be an arbitrary field, not necessarily algebraically closed. Prove that the prime ideals 
in k\x, y] (i.e., the elements of Specfc[x, y]) are 

(0 (0), 

(ii) (/) where / is an irreducible polynomial in k[x, y], and 

(iii) (p(x), g( x, y)) where p(x) is an irreducible polynomial in fc[x] and g(x,y) is an 
irreducible polynomial in fc[x, y] that is irreducible modulo p(x), i.e.,g(x, y) rem ains 
irreducible in the quotient k[x, y]/(p(x)). 

Prove that mSpeck[x, y] consists of the primes in (iii). [Use Exercise 20 in Section 1.] 

5. Let m = (p(x), g(x, y)) be a maximal ideal in fc[x, y] as in the previous exercise. Show 
that K = k[x, y]/m is an algebraic field extension of k, so that k[x, y] can also be viewed 
as a subring of K[x, y]. If x, y are mapped to a, fi e K, respectively, under the canonical 
homomorphism k[x, y] -> k[x. y]/m, prove that m = it[x, y] H (x — a, y — fi) c K[x, y]. 

6. Describe the elements in Spec IR[x] and Spec C[x]. Describe the elements in Spec Z@) [x] 
where Z@) = [a/b e Q | b is odd} is the localization of Z at the prime (2). 

7. Let (/) = (x 5 + x + 1) in SpecZfx] viewed as fibered over SpecZ as in Example 3 
following Proposition 55. Show that there are two closed points in the fiber over (2), three 
closed points in the fiber over (5), four closed points in the fiber over (19), and five closed 
points in the fiber over (211). 

8. Let (/) = (x 4 + 1) in Spec Z[x] viewed as fibered over Spec Z as in Example 3 following 
Proposition 55. Prove that there is one closed point in the fiber over (2), four closed points 
in the fiber over p for p odd, p = 1 mod 8, and two closed points in the fiber over p for 
all other odd primes p (cf. Corollary 16 in Section 3 of Chapter 14). 

9. Prove that the elements in the fiber over (p) of the Zariski continuous map from Spec Z[x] 
to Spec Z are homeomorphic with the elements in Spec(Z[x] <8>z F p ). 

10 . Let X = Spec R and let Xf be the principal open set corresponding to / € R. Prove that 
Xf n X g — Xf g . Prove that Xf = X if and only if / is a unit in R, and that X/ = 0 if 
and only if / is nilpotent. 

11 . If X f and X g are principal open sets in X = Spec R , prove that the open set Xf U X g is 
the complement of the closed set Z{I) where I = (/, g) is the ideal in R generated by f 
and g. 

12. Prove that a Zariski open subset U of X — Spec R is quasicompact if and only if U is 
a finite union of principal open subsets. Give an example of a ring R, a Zariski open 
subset U of Spec R, and a Zariski open covering of U that cannot be reduced to a finite 
subcovering. 

13 . Let <p : R S be a homomorphism of rings. Prove that under the induced map <p* from 
Y = Spec S to X = Spec R the full preimage of the principal open set Xf in X is the 
principal open set Y^f) in Y. 

14 . Suppose that R = /?i x Ri is the direct product of the rings R\ and Rj- Prove that 
X = Spec/? is the disjoint union of open subspaces Xi, X% (which are therefore also 
closed), where X\ is homeomorphic to Spec R\ and X% is homeomorphic to Spec f? 2 - 

15 . Prove that X = Spec R is not connected if and only if R is the direct product of two 
nonzero rings if and only if R contains an idempotent e with e # 0, 1 (cf. the previous 
exercise). 
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16. Prove that A — Spec R is irreducible (i.e., any two nonempty open subsets have a nontrivial 
intersection) if and only if Xf 0 X g ^ 0 for any two nonempty principal open sets X y and 
X g . Deduce that X = SpccR is irreducible if and only if the nilradical of R is a prime 
ideal. [Use Exercise 10.] 

17. Let G = (a ) be a group of order 2, let R = Z [G] = [a + be \ a, b e Z) be the 
corresponding group ring* and let X = Spec R. 

(a) Prove that the nilradical of R is (0) but is not a prime ideal. Prove that X = A + UX~ 
where A + = Z( 1 — o) and X~ = Z( 1 + a). [Use (1 + <r)(l — a) — 0.] 

(b) Prove that the homomorphism Z[G] — ► Z defined by mapping a to 1 induces a home- 
omorphism of A+ with SpecZ, and the homomorphism mapping a to — 1 induces a 
homeomorph ism of X~ with Spec Z. 

(c) Prove that X + D X~ consists of the single element m = (1 + cr, 1 — o) = (2, l— a) 
and that this is a closed point in X. 

(d) Show that (1 — a) and (1 + a) are the unique non-closed points in X, with closures 
X + and X~, respectively. Describe the closed points, mSpec R, in X and prove that 
Spec Z [ ( a )] can be pictured as follows: 



SpecZ[(<T )J 

<P* 


* 

SpecZ 


18. Let O be the structure sheaf on X ~ Spec R, let U be an open set in X , and suppose 
s, t € 0(U). If 5- = a/j[ l on Ay, and t — b/f ™ on Ay,, show that 

^ = (tffc/r /£)/(/, / 2 r +m and s + t = (af' l n fZ ,+n +bfF +n ft)/(f 1 f 2 y +m 

on A y,y 2 . Deduce that 0(U) is a commutative ring with identity. 

19. Let O be the structure sheaf on A = Speci?, let V c JJ be opai sets in A* and let 
s e 0(11). Suppose P e V and that s — a/f n on Ay c U. 

(a) Show that there is a principal open set Ay- c v n Ay containing P. 

(b) Show that (/') m = bf for some b e R. 

(c) Show that s = (ab " ) /(/')'”” on Ay/ and conclude that restricting ^ to V gives a well 
defined ring homomorphism from 0(11) to 0(V). 

20. Let <p : R S be a homomorphism of rings, let A — Spec /?, Y — Spec S, and let 
V c (/be Zariski open subsets of A. Set V' = (#>*) -1 (V) and V — (<p + ) ! (f0, the 
corresponding Zariski open subsets of Y with respect to the continuous map <p* : Y — > X 
induced by cp. Prove that the induced map <p* : Ox(U ) -* Oy(U ') on sections is a ring 
homomorphism. Prow that V ’ c U' and that <p # is ccmpatible with restriction i.e., that 
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the diagram 


Ox(U) > Oy(U') 

I > I 

Ox(V) — Oy(V') 

is commutative, where the vertical maps are the restriction homomorphisms. 

21. Suppose D is a multiplicatively closed subset of R. Show that the localization homo- 
morphism R —*■ £) -1 P induces a homeomorphism from Spec(D -1 P) to the collection of 
prime ideals P of R with P fl D = 0. 

22. Show that Sp eck[x, y]/(xy) is connected but is the union of two proper closed subsets 
each homeomorphic to Spec/c[x], hence is not irreducible (cf. Exercise 16). 

23. For each of the following rings R exhibit the elements of Spec R, the open sets U in Spec R, 
the sections 0(U) of the structure sheaf for Spec R for each open U , and the stalks Op at 
each point P e Spec R: 

(a) Z/4Z (b) Z/6Z (c) Z/2Z x Z/3Z (d) Z/2Z x Z/2Z x Z/2Z. 

24. (a) If every ideal of I? is principal, show every open set in Spec R is a principal open set. 

(b) Show that if R = Z[jc]/(4, x 2 ) then R contains a nonprincipal ideal, but every open 
set in Spec I? is a principal open set. 

25. (a) If M is an P-module prove that Supp(Af) is a Zariski closed subset of Spec R. [Use 

Exercise 33 of Section 4.] 

(b) If M is a finitely generated P-module prove that Supp(AZ) = -Z(Ann(M)) C Spec R. 
[Use Exercise 34 of Section 4.] 

26. Suppose M is a finitely generated module over the Noetherian ring R . 

(a) Prove that there are finitely many minimal primes *P[ , _ . . , P„ cont ainin g Ann (M). 
[Use Corollary 22.] 

(b) Prove that { Pi , . . . , P„ } is also the set of minimal primes in Ass r(M) and that Supp(M) 

is the union of the Zariski closed sets Z(P\) Z(P n ) in Spec R. [Use the previous 

exercise and Exercise 40 in Section 4.] 

The previous exercise gives a geometric view of a finitely generated module M ova- a Noetherian 
ring R: over each point P in Spec R is the localization Mp (the stalk ova- P). The stalk is 
nonzero precisely over the points in the Zariski closed subsets Z(P \), .... Z(P„) where the Pi 
are the minimal primes in Ass r (M). These ideas lead to the notion of the (coherent) module 
sheaf on Spec R associated to M (with a picture similar to that of the structure sheaf following 
Proposition 58), which is a powerful tool in modem algebraic geometry. 

27. Let R = y] and let M be the ideal (x, y) in R. Prove that Supp(M) = Spec R and 
A &s r (M) = [0}. 

The next two exercises show that the associated primes for an ideal / in a Noetherian ring I? in 
the sense of primary decomposition are the associated primes for / in the sense of Ass r (R/I). 

28. This exercise proves that the ideal Q in a Noetherian ring R is P-primary if and only if 
A ssp(R/Q) = (PI 

(a) Suppose Q is a P-primary ideal and let Af be the P -module R/Q. If 0 ^ m e M, 
show that Q c Ann(/n) C P and that rad Ann (w) = P. Deduce that if Ann(m) is a 
prime ideal then it is equal to P and hence that Ass r(R/Q) = {P}- [Use Exercise 33 
in Section 1.] 


748 


Chap. 15 Commutative Rings and Algebraic Geometry 



(b) For any ideal Q of R, let 0 ^ M C R/Q. Prove that the radical of Ann(Af) is the 
intersection of the prime ideals in Supp(A7). [Use Proposition 12 and Exercise 25.] 

(c) For M as in (b), prove that the radical of ficnnM is also the intersection of the prime 
ideals in Assr(M). [Use Exercise 26(b).] 

(d) If Q is an ideal of R with As&r(R/Q) = {P} prove that rad Q = P. [Use the fact 
that Q = Ann (R/Q) and (c).] 

(e) If Q is an ideal of R with Assr (R/Q) = (P) prove that Q is P -p rimar y [I fab € Q 
with a $ Q consider 0 ^ M = ( Ra + Q)/Q C R/Q and show that b is con- 
tained in ArmM C rad Ann (M). Use Exercises 33-34 in Section 1, to show that 
Assr(M) — (P), then use (c) to show that rad Ann (M) = P, and conclude finally 
that£ € P.] 

29. Suppose I = Qi n ■ - ■ n Q„ is a minimal primary decomposition of the ideal 7 in 
the Noetherian ring R with P, = rad (2/. i = l,...,n. This exercise proves that 
A$sr(R/I) = {Pi, , P„). 

(a) Prove that the natural projection homoroorphisms induce an injection of R/I into 
R/Qi ©••■(£ R/Qn and deduce that Ass r(R/I) c {Pi, ... , P w }. [Use Exercise 34 
in Section 1 and the previous exercise.] 

(b) Let Q\ = Dj^jQj. Show that the mini mality of the decomposition implies that 
0 ^ Q'i/I = (Q'i + Qi)/Qi c R/Q,. Deduce that Ass*(£>-/7) = {Pi). [Use Exer- 
cises 33-34 in Section 1 and the previous exercise.] Deduce that {P, } e As sr(R/I), 
so that Ass r(R/I) = (Pi, ... , P„). [Use (2^/7 c R/l and Exercise 34 in Section 1.] 

30. Let / be the ideal ( x 2 , xy, xz, yz) in R = k[x, y, z]. Prove that Ass r(R/I) consists of the 
primes {(*, y), (x, z), (x, y, z)}. 

31. (Spec for Quadratic Integer Rings) Let R be the ring of integers in the quadratic field 
K — Q(VD ) where D is a squarefree integer and let P be a nonzero prime ideal in R. 
This exercise shows how the prime ideals in R are determined explicitly from the primes 

( р ) in Z, giving in particular a description of Spec R fibered over Spec Z. 

As in the discussion and example following Theorem 29, we have R = Z[o)\ where 
a = VD if D = 2, 3 mod 4 (respectively, to = (1 + -jD)/2 if D = 1 mod 4), with minimal 
polynomial m w (x) = x 2 — D (respectively, m^x) =x 2 — x+(l — D)/4), and PfiZ = pZ 
is a nonzero prime ideal of Z. 

(a) For any prime p in Z show that R/pR = Z[x]/(p, n/^(x)) = FpJxJ/fm^fx)) as rings, 
where in^(x) is the reduction of m„>(x) modulo p. Deduce that there is a prime ideal 
P in R with P HZ = (p) (this gives an alternate proof of Theorem 26(2) in this case). 

(b) Use the isomorphism in (a) to prove that P is determined explicitly by the factorization 
of m^fx) modulo p : 

(i) If m w (x) = (x - a) 2 mod p where a € Z then P = (p, to — a) and pR = P 2 . 
Show that this case occurs only for the finitely many primes p dividing the 
discriminant of m a (x). 

(ii) Ifm^x) = (x — a)(x — b) rood p with integers a, be Z that are distinct modulo 
p then P is either Pi = (p, to — a) or P 2 = (p, to — b) and Pi, P 2 are distinct 
prime ideals in R with pR = Pj Pi- 

(in) Ifin^(x) is irreducible modulo p then P = pR. 

(с) Show that the picture for Spec R over Spec Z for any D is similar to that for the case 
R = Z[i] when D = — 1: there is precisely one nonclosed point (0) e Spec 7? over 
(0) e Spec Z, precisely one closed point P e Spec R over each of the primes (p) in 
Spec Z in (i) (called ramified primes) and over the primes in (iii) (called inert primes), 
and precisely two closed points over the primes in (ii) (called split primes). 


Sec. 15.5 The Prime Spectrum of a Ring 
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CHAPTER 16 


Artinian Rings, 
Discrete Valuation Rings, 
and Dedekind Domains 


Throughout this chapter R will denote a commutative ring with 1^0. 

16.1 ARTINIAN RINGS 

In this section we shall study the basic theory of commutative rings that satisfy the 
descending chain condition (D.C.C.) on ideals, the Artinian rings (named after E. Artin). 
While one might at first expect that these rings have properties analogous to those for 
the commutative rings satisfying the ascending chain condition (the Noetherian rings), 
in fact this is not the case. The structure of Artinian rings is very restricted; for example 
an Artinian ring is necessarily also Noetherian (Theorem 3). Noncommutative Artinian 
rings play a central role in Representation Theory (cf. Chapters 18 and 19). 

Definition. For any commutative ring R the Krull dimension (or simply the dimension ) 
of R is the ma xim um possible length of a chain Pq C Pi C P 2 C ■ ■ C P„ of distinct 
prime ideals in R. The dimension of R is said to be infini te if R has arbitrarily long 
chains of distinct prime ideals. 

A ring with finite dimension must satisfy both the ascending and descending chain 
conditions on prime ideals (although not necessarily on all ideals). A field has dimension 
0 and a Principal Ideal Do main that is not a field has dimension 1. 

We shall see shortly that rings with D.C.C. on ideals always have dimension 0 
(i.e., primes are maximal). If R is an integral domain that is also a finitely generated 
A-algebra over a field k, then the dimension of R is equal to the transcendence degree 
over k of the field of fractions of R (cf. Exercise 1 1). In particular, the Krull dimension 
agrees with the definition introduced earlier for the dimension of an affine variety. The 
advantage of the definition above is that it does not refer to any ^-algebra structure and 
applies to arbitrary commutative rings R. 

Definition. The Jacobson radical of R is the intersection of all maximal ideals of R 
and is denoted by Jac R. 
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The Jacobson radical is analogous to the Frattini subgroup of a group, and it enjoys 
some corresponding properties (cf. Exercise 24 in Section 6.1): 


Proposition 1. Let 3 be the Jacobson radical of the commutative ring R. 

(1) If 7 is a proper ideal of R, then so is (7, 3), the ideal generated by 7 and J . 

(2) The Jacobson radical contains the nilradical of R : rad 0 C Jac R. 

(3) An element x belongs to J if and only if 1 — rx is a unit for all r € 7?. 

(4) (Nakayama’s Lemma) If M is any finitely generated 7? -module and JM — M, 
then M = 0. 


Proof: If 7 is a proper ideal in R, then 7 c M for some maximal ideal M. Since 
3 C M , also (1,3) C M, which proves (1). 

Part (2) follows from the definitions of the two radicals and Proposition 12 in 
Section 15.2 since maximal ideals are prime. 

Suppose 1 — rx is not a unit and let M be a maximal ideal containing 1 — rx. Since 
1 £ M,rx £ A7, so x cannot belong to J because 3 c M. Conversely, suppose* £ 3 , 
i.e., there is a maximal ideal M with x £ M. Then R = (x, M), hence 1 = rx + y for 
some y € M. Thus 1 — rx = y e M and so 1 — rx is not a unit, which proves (3). 

To prove (4), assume M ^ 0 and let n be the smallest integer such that M is 
generated by n elements, say . . . , m„. Since M = 3M we have 

m n = r^m^ + r 2 m 2 H for some n, r %, ..., r„ € 3- 

Thus (1 — r n )m n = r\m\ + ■ • ■ + r„_im„_] . By (3), 1 — r„ is a unit, so m„ lies in the 
module generated by mi , , w„_i, contradicting the minimality of «. Hence M = 0, 
completing the proof. 


Definition. A commutative ring R is said to be Artinian or to satisfy the descending 
chain condition on ideals (or D.C.C. on ideals) if there is no infinite decreasing chain of 
ideals in R t i.e., whenever 7i 3 I 2 5 h 2 * ■ * is a decreasing chain of ideals of R, then 
there is a positive integer m such that h = l m for all k > m. Similarly, an /? -module 
M is said to be Artinian if it satisfies D.C.C. on submodules. 


It is immediate from the Lattice Isomorphism Theorem that every quotient R/I of 
an Artinian ring 7? by an ideal 7 is again an Artinian ring. 

The following result for Artinian rings is parallel to results in Theorem 15.2. The 
proof is completely analogous, and so is left as an exercise. 


Proposition 2. The following are equivalent: 

(1) 7? is an Artinian ring. 

(2) Every nonempty set of ideals of R contains a minimal element under inclusion. 


The next result gives the main structure theorem for Artinian rings. 


Sec. 16.1 Artinian Rings 
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Theorem 3. Let P be an Artinian ring. 

(1) There are only finitely many ma xim al ideals in R. 

(2) The quotient P/(Jac R) is a direct product of a finite number of fields. More 
precisely, if M \, . . . , M n are the finitely many maximal ideals in R then 

P/(JacP) = ki x > > ■ x k ni 
where ki is the field R/Mj for 1 < i < n. 

(3) Every prime ideal of R is maximal, i.e., R has Krull dimension 0. The Jacobson 
radical of R equals the nilradical of R and is a nilpotent ideal: (Jac R) m — 0 
for some m > 1. 

(4) The ring R is isomorphic to the direct product of a finite number of Artinian 
local rings. 

(5) Every Artinian ring is Noetherian. 


Proof: To prove (1), let S be the set of all ideals of R that are the intersection of 
a finite number of maximal ideals. By Proposition 2, <5 has a minimal element, say 
M[ fl M 2 H ■ ■ ■ fl M n . Then for any maximal ideal M we have 

m n Mi n m 2 n - ■ - n M n = Mi n m 2 n - ■ - n m„, 


so M n M 2 n ■ ■ ■ n M„. By Exercise 11 in Section 7.4, M ^ M- t for some i. 
Thus M — Mi and so Mi , ... , M n are all the maximal ideals of R. 

The proof of (2) is immediate from the Chinese Remainder Theorem (Section 7.6) 
applied to Mi , . . . , M nt since these maximal ideals are clearly pairwise comaximal and 
their intersection is Jac R. 

For (3), we first prove J = Jac R is nilpotent By D.C.C. there is some m > 0 
such that J m = for all positive i . By way of contradiction assume J m ^ 0. Let 
S be the set of proper ideals 7 such that IJ m ^ 0, so J e <S. Let 7 q be a minimal 
element of <S. There is some x e Iq such that xj m ^ 0, so by minimality we must 
have 7 q = (x). But now = xJ m+{ — xj m , so it follows by minimality of 

(x) that (x) — (x)J. By Nakayama’s Lemma above, (x) = 0, a contradiction. This 
proves Jac R is nilpotent 

Since Jac R is nilpotent in particular Jac R c rad 0, so these two ideals are equal 
by the second statement in Proposition 1. 

Every prime ideal P in R contains the nilradical of R t hence contains Jac R by 
what has already been proved,. The image of P is a prime ideal in the quotient ring 
R / (Jac R) = ki x - x k„. But in a direct product of rings Pi x P 2 (where each P/ 
has a 1) every ideal is of the form 7i x 7 2 , where Ij is an ideal of Rj for j = 1,2 (cf. 
Exercise 3 in Section 7.6). It follows that a prime ideal in k\ x • ■ ■ xk n consists of the 
elements that are 0 in one of the components. In particular, such a prime ideal is also a 
maximal ideal in ki x ■ ■ ■ x k n and it follows that P was a maximal ideal in P, which 
finishes the proof of (3). 

Let M\ , . . . , M n be all the distinct maximal ideals of P and let (Jac R) m = 0 as in 
(3). Then 


n«rs 

1=1 


n \ m 

] Mi ] c (Jac P) m = 0. 

,i=i / 
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By the Chinese Remainder Theorem it follows that 

R = (R/Mf) x (R/M™) x ■ ■ ■ x (R/M™), 

and each R/Mf is an Artinian ring with unique maximal ideal M-, /Mf, proving (4). 

To prove (5), it suffices by (4) to prove that an Artinian local ring is Noetherian, so 
assume R is Arti ni an with unique maximal ideal M. In this case we have M = Jac R, 
so M m = (Jac R) m = 0 for some positive m. Then R — R/M m , and in this case it is 
an exercise to see that R/M m is Noetherian if and only if it is Artinian (cf. Exercise 8). 

Corollary 4. The ring R is Artinian if and only if R is Noetherian and has Krull 
dimension 0. 


Proof: The forward implication was proved in Theorem 3. Suppose now that R is 
Noetherian and that R has Krull d ime nsion 0, i.e., that prime ideals of R are maximal. 
Since R is Noetherian, by Corollary 22(3) in Section 15.2, the ideal (0) = P\ • • • P„ 
is the product of (not necessarily distinct) prime ideals, and these prime ideals are 
then maximal since R has dimension 0. By the Chinese Remainder Theorem, R is 
isomorphic to the direct product of a finite number of Noetherian rings of the form 
R/M m where M is a maximal ideal in R. As in the proof of (5) of the theorem, RjM m 
is Artinian, and it follows that R is Artinian. 

Examples 

(1) Let n > 1 be an integer. Since the ring R = Z/nZ is finite, it is Artinian. If 

n = 1 ■ ■ ■ Ps s is the unique factorization of n into distinct prime powers, then 

Z/nZ = (Z/p^Z) x (Z /p£ 2 Z) x ■ • ■ x (Z/p a s ‘Z). 

Each Z/p“'Z is an Artinian local ring with unique maximal ideal (pi)/(pf‘), so this 
is the decomposition of Z//iZ given by Theorem 3(4). The Jacobson radical of R 
is the ideal generated by p^p 2 ■ ■ ■ p x > the squarefree part of n and ft /(Jac R) = 
(Z/ piZ) x ■ ■ ■ x (Z / p s Z) is a direct product of fields. The ideals generated by p-, for 
i = 1 s are the maximal ideals of ft. 

(2) For any field k, a k- algebra ft that is finite dimensional as a vector space over k is 
Artinian because ideals in ft are in particular fc-subspaces of ft, hence the length of 
any chain of ideals in ft is bounded by dim*ft. 

(3) Suppose / is a nonzero polynomial in k[x] where k is a field. Then the quotient ring 
ft = k[x]/(f(x)) is Artinian by the previous example. The decomposition of ft as a 
direct product of Artinian local rings is given by 

*[*]/(/(*)) = *M/(/l(*) ai ) x - ■ ■ x *[*]/(/,(*)*) 

where f(x) = f\ (jc) 01 ■ ■ ■ f s (x) a ’ is the factorization of /( x) into powers of distinct 
irreducibles in k[x\ (cf. Proposition 16 in Section 9.5). The Jacobson radical of ft is 
the ideal generated by the squarefree part of f(x) and the maximal ideals of ft are the 
ideals generated by the irreducible factors f(x) fori = 1 , . . . , s s imil ar to Example 1. 
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EXERCISES 


Let R be a commutative ring with 1 and let J be its Jacobson radical. 

1. Suppose R is an Artinian ring and / is an ideal in R. Prove that R/I is also Artinian. 

2. Show that every finite commutative ring with 1 is Artinian. 

3. Prove that an integral domain of Kruli dimension 0 is a field. 

4. Prove that an Artinian integral domain is a field. 

5. Suppose / is a nilpotent ideal in R and M = IM for some JLmodule M. Prove that 
M = 0. 

6. Suppose that 0 — ► M' — *■ M — > M" — > 0 is an exact sequence of ^ modules. Prove 
that M is an Artinian R -module if and only if M' and M" are Artinian /^-modules. 

7. Suppose R — F is a field. Prove that an R -module M is Artinian if and only if it is 
Noetherian if and only if M is a finite dimensional vector space over F. 

8. Let M be a maximal ideal of the ring R and suppose that M n = 0 for some n > 1 . Prove 
that R is Noetherian if and only if J? is Artinian. [Observe the each successive quotient 

M‘ /M I+i ,i = 0 n — 1 in the filtration R 3 M 3 ■ ■ - 3 M n ~ l D M n = 0 is a module 

over the field F = R/M. Then use the previous two exercises and Exercise 6 of Section 
15.1.] 

9. Let M be a finitely generated R -module. Prove that if x\ , . . . , x„ are elements of M whose 

images in M/J M generate M/J then they generate M. Deduce that if J? is Noetherian 

and the images of nj a„ in J /J 2 generate J / J 2 , then J = (a\ a„). [Let N 

be the submodule generated by xj ..... x n and apply Nakayama's Lemma to the module 
A = M/N.] 

10. Let R = Z( 2 > be the localization of Z at the prime ideal (2). Prove that Jac R = (2) is the 
ideal generated by Z If M = <Q>, prove that M/2M is a finitely generated fl-module but 
that M is not finitely generated over R. Why doesn’t this contradict the previous exercise? 
[Note the hypotheses in Nakayama's Lemma.] 

11. Let V be an affine variety over a field k and let R = k[V] be its coordinate ring. Let 
d, (R) denote the transcendence degree of the field of fractions k(V) over k, and let d p (R) 
be the Krull dimension of R defined in terms of chains of prime ideals. This exercise 
shows d,(R) = d p (R). By Noether’s Normalization Lemma there is a polynomial subring 

= fc[yi , . . . , y m ] of R such that R is integral over R\ . 

(a) Show that d t (R\) = d,(R) = m and that d p (R\) — d p (R). Deduce that we may 
assume R = R\. [Use the Going-up and Going-down Theorems (cf. Theorem 26, 
Section 15.3) to prove the second equality.] 

(b) When = show that d p (R) > d t (R) by exhibiting an explicit chain of prime 
ideals of length m. 

(c) When R = R\ show that any nonzero prime ideal of R contains an element / such 
that R(f) is transcendental over R of transcendence degree 1. Use induction to show 
thald p (/?) < d,(R), and deduce that d p {R) = d t (R). 

12. Let R be a Noetherian local ring with maximal ideal M. 

(a) The quotient M/M 1 is a module (Le., vector space) over the field R/M. Prove that 
d = dim r/m(M/M 2 ) is finite. 

(b) Prove that M can be generated as an ideal in R by d elements and by no fewer. [Use 
Exercise 9.] 

(c) Let R = fc[xi x„ ] ( X| be the localization of the polynomial ring k[x \ , . . . , x„ ] 

ova: the field k at the maximal ideal (jq x„), and let M be the maximal ideal in 
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R. Prove that dim rjm(M/M 2 ) = n = dim R. [Cf. the previous exercise.] 

It can be shown that dim r/m(M/M 2 ) > dim R for any Noetherian local ring R with maximal 

ideal M. A Noetherian local ring R is called a regular local ring if dim r /m (M/M 2 ) = dim R. 

It is a fact that a regular local ring is necessarily an integral domain and is also integrally closed* 

13. If R is a Noetherian ring, prove that the Zariski topology on Spec R is discrete (i.e., every 
subset is Zariski open and also Zariski closed) if and only if R is Artinian. 

14. Suppose / is the ideal (xi . x 2 , , . . . ) in the polynomial ring k[xy , * 2 . * 3 . > • ■ 1 where k is 

a field and let R be the quotient ring k[xy , X 2 , * 3 , . . . ]//. Prove that the image of the ideal 
(xi , X 2 , X 3 , . . . ) in R is the unique prime ideal in R but is not finitely generated. Deduce 
that R is a local ring of Krull dimension 0 but is not Artinian. 


16.2 DISCRETE VALUATION RINGS 

In the previous section we showed that the Artinian rings are the Noetherian rings having 
Krull dimension 0. We now consider the easiest Noetherian rings of dimension 1, the 
Discrete Valuation Rings first introduced in Section 8.1: 

Defi ni tion. 

(1) A discrete valuation on a field K is a function v : AT* — >• Z satisfying 

(i) v is surjective, 

(ii) v(xy) = v(x) + v(y) for all x, y e AT*, 

(iii) v(x 4- y) > min{v(x), v(y)} for all x, y e AT* with x + y =/ 0. 

The subring [x e K \ v(x) > 0} U {0} is called the valuation ring of v. 

(2) An integral do main R is called a Discrete Valuation Ring (D.VR.) if R is the 
valuation ring of a discrete valuation v on the field of fractions of R. 

The valuation u is often extended to all of AT by defining v(0) = + 00 , in which case 
(ii) and (iii) hold for all a, b e AT. 

Examples 

(1) The localization Z( p ) of Z at any nonzero prime ideal (p ) is a D.VR. with respect 
to the discrete valuation v p on Q defined as follows (cf. Exercise 27, Section 7.1). 
Every element a/b € Q* can be written uniquely in the form p ' 1 (ay /by) where n € Z, 
ay /by € <Q>* and both ay and by are relatively prime to p. Define 

One easily checks that the axioms for a D.VR are satisfied. We call v p the p-aAic 
valuation on Q. The corresponding valuation ring is the set of rational numbers with 
n > 0 together with 0, i.e., the rational numbers a/b where b is not divisible by p, 
which is Z(j,y 

(2) For any field F, let / be an irreducible polynomial in F[x]. Every nonzero element in 
the field F (x) can be written uniquely in the form f n (a /b) where n e Z ,a/b e F[x] * 
and both a and b are relatively prime to /. Then 
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defines a valuation on F(x) and the corresponding valuation ring is the localization 
F[x]y of F[x] at / consisting of the rational functions in F(x) whose denominator is 
not divisible by /. When f = x— a is a polynomial of degree 1 in F[x], the valuation 
v/ gives the order of the zero (if n > 0) or pole (if n < 0) of the element in F( x) at 
x = a. 


(3) The ring of formal Laurent series F((x)) with coefficients in the field F has a discrete 
valuation v defined by 


v 




V>” / 


= n 


(cf. Exercise 5, Section 7.2). The corresponding D.VJR.. is the ring F[[xj] of power 
series in x with coefficients in F. 


Note that v(l) = v(l) + v(l) implies that v(l) = 0, so every Discrete Valuation 
Ring R is a ring with identity 1^0. Since R is a subring of a field by definition, R is 
in particular an integral domain. It is easy to see that a D.V.R. is a Euclidean Domain 
(cf. Example 4 in Section 8.1), so in particular is also a P.I.D. and a U.F.D. In fact 
the factorization and ideal structure of a D.V.R. is very simple, as the next proposition 
shows. 


Proposition 5. Suppose R is a Discrete Valuation Ring with respect to the valuation v, 
and let t be any element of R with v(r) = 1 . Then 

(X) A nonzero element u e R is a unit if and only if v(u) = 0. 

(2) Every nonzero element r e R can be written in the form r = ut n for some unit 
u € R and some n > 0. Every nonzero element x in the field of fractions of R 
can be written in the form x = ut n for some unit u € R and some n € Z. 

(3) Every nonzero ideal of R is a principal ideal of the form (t n ) for some n > 0. 
In particular, R is a Noetherian ring. 

Proof: If u is a unit, then uv = 1 for some v e R and then u(u)+v(t>) = u(«u) = 1 
with v(u) > 0 and v(u) > 0 shows that v(n) = 0. Conversely, if u is nonzero and 
u(«) = 0 then u~ l € K satisfies u(« -1 ) + u(n) = v(l) = 0. Hence v(w -1 ) = 0 and 
m _i e /?, so u is a unit This proves (1). 

For (2), note that if u(x) = n then v(xt~ n ) = 0, so xt~ n = u is a unit in R by (1). 
Hence x — ut n , where x e R if and only if n = v(jc) > 0. 

If / is a nonzero ideal in R, let r € 7 be an element with v(r) minimal. If v(r) = «, 
then r differs from r" by a unit by (2), sot" € I and (t n ) C / . If now a is any nonzero 
element of I, then v(a) > n by choice of n. Then v(at~ n ) > 0 and so at~ n e. R , 
which shows that a e (t n ). Hence I — (t n ), proving the first statement in (3). It is then 
clear that ascending chains of ideals in R are finite, proving that R is Noetherian and 
completing the proof. 

Definition. If R is a D.V.R. with valuation v, then an element t of R with v(/) = 1 is 
called a uniformizing (or local) parameter for R. 
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Corollary 6. Let R be a Discrete Valuation Ring. 

(1) The ring R is an integrally closed local ring with unique maximal ideal given by 
the elements with strictly positive valuation: M = {r e R \ v(r) > 0}. Eveiy 
nonzero ideal in R is of the form M” for some integer n > 0. 

(2) The only prime ideals of R are M and 0, i.e.. Spec R = (0, M). In particular, 
a D.V.R. has Krull dimension 1. 

Proof: Any U.F.D. is integrally closed in its fraction field (Example 3 in Section 
15.3), so R is integrally closed. The remainder of the statements follow immediately 
from the description of the ideals of R in Proposition 5. 

The definition of a Discrete Valuation Ring is extremely explicit in terms of a 
valuation on the fraction field, and as a result it appears that it might be difficult to 
recognize whether a given ring R is a D.V.R. from purely “intern aT algebraic properties 
of R. In fact, the ring-theoretic properties in Proposition 5 and Corollary 6 characterize 
Discrete Valuation Rings. The following theorem gives several alternate algebraic 
descriptions of Discrete Valuation Rings in which there is no explicit mention of the 
valuation. 

Theorem 7, The following properties of a ring R are equivalent: 

(1) R is a Discrete Valuation Ring, 

(2) R is a P.T.D. with a unique maximal ideal P / 0, 

(3) R is a U.F.D. with a unique (up to associates) irreducible element t, 

(4) R is aNoetherian integral domain that is also a local ring whose unique maximal 
ideal is nonzero and principal, 

(5) R is a Noetherian, integrally closed, integral domain that is also a local ring of 
Krull dime nsion 1 i.e., R has a unique nonzero prime ideal: Spec R = {0, M}. 

Proof: That (1) implies each of the other properties was proved above. 

If (2) holds then (3) is immediate since irreducible elements generate prime ideals 
in a U.F.D. (Proposition 12, Section 8.3). 

If (3) holds, then every nonzero element in R can be written uniquely in the form 
uP' for some unit u and some n > 0. Then every nonzero element in the fraction field 
of R can be written uniquely in the form ut" for some unit u and some re Z. It is now 
straightforward to check that the map v(ut n ) = n is a discrete valuation on the field of 
fractions of R, and R is the valuation ring of v, and (1) holds. 

Suppose (4) holds, let M = (t) be the unique ma ximal ideal of R, and let Mq = 
Then Mq = MMq , and since R is Noetherian Mq is finitely generated. By 
hypothesis M = Jac R, so by Nakayama’s Lemma Mo = 0. If 7 is any proper, nonzero 
ideal of R then there is some n > 0 such that 7 C M n but 7 ^ M n+1 . Let a € 7 — M" +5 
and write a = t n u for some u € R. Then u M, and sou Isa unit in the local ring 
7?. Thus (a) = ( t n ) = M n for every a e I — M n+l . This shows that 7 = (f n ), and so 
every ideal of R is principal, which shows that (2) holds. 

We have shown that (1), (2), (3) and (4) are equivalent, and that each of these 
implies (5). To complete the proof we show that (5) implies (4), which amounts to 
showing that the ideal M in (5) is a principal ideal. Since 0 M — Jac R and M is 
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finitely generated because R is Noetherian, by Nakayama’s Lemma (Proposition 1(4)), 
M 7^ M 2 . Let t G M — M 2 . We argue that M = ( t ). By Proposition 12 in Section 15.2, 
the assumption that M is the unique nonzero prime ideal in R implies that M = rad (/), 
and then Proposition 14 in Section 15.2 implies that some power of M is contained 
in (/). Proceeding by way of contradiction, assume (/) ^ M, so that A/ 71 c (r) but 
A/ 71-1 <£. (/) for some n > 2. Then there is an element x e A/ 71-1 — (/) such that 
xM c (/). Note that t ± 0 so y = x/t belongs to the field of fractions of R . Also, 
y & R because x = ty £ (t). However, by choice of x we have yM c R, and then 
one checks that yM is an ideal in R. If yM = R then 1 = ym for some m G M. This 
leads to a contradiction because we would then have / = x m € Af 2 , contrary to the 
choice of t. Thus yM is a proper ideal, hence is contained in the unique maximal ideal 
of R, namely yM C M. Now M is a finitely generated /?-module on which y acts by 
left multiplication as an R -module homomorphism. By the same (determinant) method 
as in the proof of Proposition 23 in Section 15.3 there is a monic polynomial p with 
coefficients in R such that p(y)m = 0 for all m e M. Since p(y) is an element of a 
field containing R and M, we must have p(y) = 0. Hence y is integral over R. Since 
R is integrally closed by assumption, it follows that y e R y a contradiction. Hence 
M — (t) is principal, so (5) implies (4), completing the proof of the theorem. 

Corollary 8. If R is any Noetherian, integrally closed, integral domain and P is a 
minimal nonzero prime ideal of R, then the localization Rp of R at P is a Discrete 
Valuation Ring. 

Proof: By results in Section 15.4, the localization Rp is a Noetherian (Proposition 
38(4)), integrally closed (Proposition 49), integral domain (Proposition 46(2)), that is 
a local ring with unique nonzero prime ideal (Proposition 46(4)), so Rp satisfies (5) in 
the theorem. 

Examples 

(1) If R is any Principal Ideal Domain then every localization Rp of R at a nonzero prime 
ideal P = (p) is a Discrete Valuation Ring. This follows immediately from Corollary 
8 since R is integrally closed (being a U.F.D., cf. Example 3 in Section 15.3) and 
nonzero prime ideals in a PJ.D. are maximal (Proposition 8.7). Note that the quotient 
field K of Rp is the same as the quotient field of R, so each nonzero prime p in R 
produces a valuation v p on K, given by the formula 

v(p^) = „ 

where a and b are elements of R not divisible by p. This generalizes both Examples 
1 and 2 above. 

(2) The ring Z p of p-adic integers is a Discrete Valuation Ring since it is a PXD. with 
unique maximal ideal pX p (cf. Exercise 11, Section 7.6). The fraction field of is 
called ihc field of p-adic numbers and is denoted (Q> p . The element p is a uniformizing 
parameter for Z p , so every nonzero element in Q p can be written uniquely in the form 
pPu for some n e Z and unit u € Z* , (where u = oq + a\p 4- ai p 2 + ... with 
0 < ao < p as in Exercise 11(c), Section 7.6). The corresponding p-adic valuation 
v p on Q p is then given by v p (p n u) = n. 
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A discrete valuation v on a field K defines an associated metric (or “distance 
function"), d v , on K as follows: fix any real number 0 > 1 (the actual value of 0 does 
not matter for verifying the axioms of a metric), and for alla,b e K define 

d v (a,b) = \\a-b\\ v where || a || y = p~ vM 

and where we set d v (a, a) = 0. It is easy to check that d v satisfies the three axioms for 
a metric: 

(i) d v (a, b) > 0, with equality holding if and only if a = b, 

(ii) d v (a, b) = d v (b , a), i.e., d v is symmetric, 

(iii) d v (a, b) < d v (a , c ) +d v (c, b), for all a, b, c e K, i.e., d v satisfies the “triangle 
inequality.” 

The triangle inequality is a consequence of axiom (iii) of the discrete valuation. Indeed, 
a stronger version of the triangle inequality holds: 

(iiiy d v (a, b) < ma x{d v (a, c ), d v (c , b)}, for all a, b, c e K. 

For this reason d v is sometimes called an ultrametric . One may now use Cauchy 
sequences to form the completion of K with respect to d v , denoted by K v , in the same 
way that the real numbers M are constructed from the rational numbers Q. It is not 
difficult to show that K v is also a field with a discrete valuation that agrees with v on 
the dense subset K of K v . 

Examples 

(1) Consider the p-adic valuation v p on Q and take 0 = p. Write || a || p for || a ||„ , so 
that for a, b relatively prime to p. 



Note that integCTS (or rational numbers) have small p-adic absolute value if they are 
divisible by a large power of p. For example, the sequence 1 , p, p 2 , p 3 , . . . converges 
to zero in the p-adic metric. 

It is not too difficult to see that the completion of Q with respect to the p-adic 
metric is the field Q p of p-adic numbers, and the completion of Z is the ring Z p of 
p-adic integers. One way to see this is to check that each element a of the completion 
may be represented as a p-adic Laurent series : 
oo 

a = ^ a; p' where no gZ and ai G {0, 1 , p — 1} for all i, 

n=n 0 

and then use Example 2 previously. In terms of this expansion, the p-adic valuation 
is given by v p (a) = no (when a m / 0). 

(2) In a s imilar way, the completion of F(x) with respect to the valuation v* in Example 
2 at the beginning of this section gives the field F((x)) with corresponding valuation 
ring F[[jcJ] in Example 3 in the same set of examples. 

The completion of a field K with respect to a discrete valuation v is a field K v 
in which the elements can be easily described in terms of a uniformizing parameter. 
In addition, K v is a topological space where the topology is defined by the metric d v . 
Furthermore, Cauchy sequences of elements in K v converge to elements of K v (i.e., K v 
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is complete in the u-adic topology). This is similar to the situation of the completion 
K of Q with respect to the usual Euclidean metric. This allows the application of ideas 
from analysis to the study of such rings, and is an important tool in the study of algebraic 
number fields and in algebraic geometry. 

Fractional Ideals 

We complete our discussion of Discrete Valuation Rings by giving another characteri- 
zation of D.V.R.s in terms of “fractional ideals,” which can be defined for any integral 
domain: 

Definition. For any integral domain R with fraction field K, a fractional ideal of R 
is an R- submodule A of K such that dA C R for some nonzero d e R (equivalently, a 
submodule of the form d~ l / for some nonzero d € R and ideal / of R). 

The equivalence of these two definitions follows from the observation that dA is 
an R -submodule (i.e., an ideal) of R. 

The notion of a fractional ideal in K depends on the ring R. Loosely speaking, 
a fractional ideal is an ideal of R up to a fixed “denominator” d. The ideals of R are 
also fractional ideals of R (with denominator d — 1 ) and are the fractional ideals that 
are contained in R. For clarity these are occasionally called the integral ideals of R. 
When R is a Noetherian integral domain, a fractional ideal of R is the same as a finitely 
generated R-submodule of K (cf. Exercise 6). 

For any x e K the (cyclic) R -module Rx = [rx \ r € R} is called the principal 
fractional ideal generated by x. 

If A and B are fractional ideals, their product, AB, is defined to be the set of all 
finite sums of elements of the form ab where a & A and b e B. If A = d~ l I and 
B = ( d')~ l J for ideals 7, / in R and nonzero d, d' € R, then AB = ( dd')~ l IJ where 
/ / is the usual product ideal. In particular, this shows that the product of two fractional 
ideals is a fractional ideal. 

Definition. The fractional ideal A is said to be invertible if there exists a fractional 
ideal B with AB — R, in which case B is called the inverse of A and denoted A -1 . 

If A is an invertible fractional ideal, the fractional ideal B with AB = R is unique: 
AB = AC = R implies B = B{AC) = ( BA)C = C. 

Proposition 9. Let R be an integral domain and let A be a fractional ideal of R. 

(1) If A is a nonzero principal fractional ideal then A is invertible. 

(2) If A is nonzero then the set A’ = [x € K \ xA c R) is a fractional ideal of 
R. In general we have A A! c R and A A' = R if and only if A is invertible, in 
which case A -1 = A'. 

(3) If A is an invertible fractional ideal of R then A is finitely generated. 

(4) The set of invertible fractional ideals is an abelian group under multiplication 
with identity R. The set of nonzero principal fractional ideals is a subgroup of 
the invertible fractional ideals. 
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Proof: If A = xR is a nonzero principal fractional ideal, then taking B = x~ l R 
shows that A is invertible, proving (1). 

One easily sees that A' is an R -submodule of K. If A is a nonzero fractional 
ideal there is some nonzero element d € R such that dA c if, so A contains nonzero 
elements of R. Let a be any nonzero element of A contained in R. Then by definition 
of A' we have a A' C R t so A' is a fractional ideal. Also by definition, A A' c R. If 
AA ' = R then A is invertible with inverse A -1 = A'. Conversely, if AB = R, then 
B C A' by definition of A'. Then R = AB c AA! C R t showing that AA! — R y 
proving (2). 

If A is invertible, then AA! = R by (2) and so 1 = a\a\ + 1- <3,, < 2 ' for some 

fli, . . . , On 6 A andflp ....a'eA'. If a e A, then a = {aa\)a\^ i-(aa' n )a n , where 

each aa\ e R by definition of A! . It follows that A is generated over R by ai, . . . , a„ 
and so A is finitely generated, proving (3). 

Finally, it is clear that the product of two invertible fractional ideals is again invert- 
ible. This product is commutative, associative, and RA — A for any fractional ideal. 
The inverse of an invertible fractional ideal is an invertible fractional ideal by definition, 
proving the first statement in (4). The second statement in (4) is immediate since the 
product of xR and yR is (xy)R and the inverse of xR is x~ l R. 

Definition. If I? is an integral domain, then the quotient of the group of invertible 
fractional ideals of R by the subgroup of nonzero principal fractional ideals of I? is 
called the class group of R. The order of the class group of R is called the class number 
oiR . 


The class group of R is the trivial group and the class number of R is 1 if and only 
if R is a P.I.D. The class group of R measures how close the ideals of I? are to being 
principal. 

Whether a fractional ideal A of I? is invertible is also related to whether A is 
projective as an R -module. Recall that an I? -module M is projective over R if and only 
if M is a direct summand of a free module (Proposition 30, Section 10.5). Equivalently, 
M is projective if and only if there is a free I? -module F and I? -module homomorphisms 
f : F -*■ M and g : M -> F with / o g = 1 (Proposition 25, Section 10.5). 


Proposition 10. Let R be an integral domain with fraction field K and let A be a nonzero 
fractional ideal of R. Then A is invertible if and only if A is a projective R-module. 


Proof: Assume first that A is invertible, so a i a 'i = 1 f° r some a t e A and 
a- € A' as in (2) of Proposition 9. Let F be the free R -module on y \ , . . . , y„. Define 
/ : F -> A by /(£? =I = IX 1 r ‘ a i and g : A -► F by /(c) = - 11 

is immediate that both / and g are R-module homomorphisms (note that ca\ e R by 
definition of A 7 ). Since 


(/ 0 £)(c) = 


= f = Y^ ca 'i) a i = c = c ' 


so / o g = 1 and A is a direct s umm and of F, hence is projective. 
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Conversely, suppose that A is nonzero and projective, so there is a free R -module 
F and R-homomorphisms / : F -> A and g : A -> F with / o g = 1 . Fix any 

0 7 ^ a e A and suppose g(a) = & i7i where a-, e R and yi , . . . , y„ is part of a set 

of free generators for F. Define a , = /(y,) and o' =-o\ja e K for i = 1 For 
any b € A we have bg(d) — ag(b ) = g(ab) since g is an R - module homomorphism. 
Write g(b) = E”=i h >yi + Y.jej tyyj where (y^) for j € J are the remaining elements 
in the set of free generators for F. Then 

n n 

^2(bdi)y, = ^(abdyi + . 

i = 1 i=1 jtj 

We may equate coefficients of the elements in the free R -module basis for F in this 
equation and it follows that g(b) = b,yi where b t € R and that bd( = abj for 

1 = V, ... ,n. In particular, it follows from the definition of a\ that ba\ — b(dj/a) = b t 
is an element of R for every element b of A. This shows that o'. € A! for / = 1, . . . , n. 
Since / o g = 1, we have 

( n \ n n / n 

^2^iy> ] = 5^*1 = ^2(aa-)a, = a 1^2 Oj a- 

i=i / i=i r=i \i=i 

and so a; a' = 1. It follows that A A' — R and so A is invertible by Proposition 
9, completing the proof. 

The next result shows that if the integral domain R is also a local ring, then whether 
fractional ideals are invertible determines whether R is a D.V.R. 

Proposition 11. Suppose the integral domain R is a local ring that is not a field. Then 
R is a Discrete Valuation Ring if and only if every nonzero fractional ideal of R is 
invertible. 

Proof: If R is a D.V.R. with uuiformizing parameter t, then by Proposition 5 every 
nonzero ideal of R is of the form (t n ) for some n > 0 and every element d in R can 
be written in the form ut rn for some unit u € R and some m > 0. It follows that every 
nonzero fractional ideal of R is of the form t N R for some N e Z, so is a principal 
fractional ideal and hence invertible by the previous proposition. 

Conversely, suppose that every nonzero fractional ideal of R is invertible. Then 
every nonzero ideal of R is finitely generated by (3) of Proposition 9, so R is Noetherian. 
Let M be the unique maximal ideal of R. If M = M 2 then M = 0 by Nakayama’s 
Lemma, and then R would be a field, contrary to hypothesis. Hence there is an element 
l with t € M — M 2 . By assumption M is invertible, and since / € M , the fractional 
ideal t M~ l is a nonzero ideal in R. If iM~ ] c A/, then t € M 2 , contrary to the choice 
of t. Hence t A / -1 = R, so ( t ) = A/, and Af is a nonzero principal ideal. It follows by 
the equivalent condition 4 of Theorem 7 that R is a D.V.R., completing the proof. 

We end this section with an application to algebraic geometry. 
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Nonsingularity and Local Rings of Affine Plane Curves 

Let k be an algebraically closed field and let C be an irreducible affine curve over k. 
In other words, C is an affine algebraic set whose coordinate ring k[C\ is an integral 
domain and whose field of rational functions k(C ) has transcendence degree 1 over k 
(cf. Section 15.4). 

Recall that, by definition, the point v on C is nonsingular if nvc/m* c is a 1- 
dimensional vector space over k, where m„ c is the unique maximal ideal in the local 
ring 0 V 'C of rational functions on C defined at v. 

Proposition 12. Let v be a point on the irreducible affine curve C over k. Then C is 
nonsingular at u if and only if the local ring O u ,c is a Discrete Valuation Ring. 

Proof: Suppose first that v is nonsingular. Then dim * (m l)i c /m^ c ) = 1, and since 
O v c is Noetherian, it follows from Exercise 12 in Section 1 that m u C is principal. 
Hence O v ,c is a D.V.R. by Theorem 7(4). Conversely, suppose O v _c is a D.V.R. and t is 
a uniformizing element for O v _c- Then every element in tn^c can be written uniquely 
in the form at for some a in O v C - The map from m^.c to CVc/hVc defined by 
mapping at to a mod m u c is easily checked to be a surjective O v c~ module homomor- 
phism with kernel mj; c . Hence nv.c/^ c is isomorphic as an C^.c/nVc -module to 
£>t,.c/ m i>,c- Since = * (Proposition 4fi(5) in Section 15.4), it follows that 

din**(m„ ( c/niu c) — 1> and so v is a nonsingular point on C. 

Definition. If v is a nonsingular point on C with corresponding discrete valuation 
defined on k(C), then v u (f) = n for / e k(V) is the order of zero of f at v (if n > 0) 
or the order of the pole of f at v (if n < 0). 

Using the criterion for nonsingularity for points on curves in Proposition 12 we can 
prove a result first mentioned in Section 15.4: 

Corollary 13. An irreducible affine curve C over an algebraically closed field k is 
smooth if and only if its coordinate ring k[C] is integrally closed. 

Proof: The curve C is smooth if and only if every localization 0 Vm c is a D.V.R. 
Since k[C\ has Krull dimension 1 (Exercise 1 1 in Section 1), the same is true for each 
0 ViC . It then follows by Theorem 7(5) that every localization O v ,c is a D.V.R. if and 
only if O v c is integrally closed. By Proposition 49 in Section 15.4, this in turn is 
equivalent to the statement that k[C] is integrally closed, which proves the corollary. 

EXERCISES 

1. Suppose R is a Discrete Valuation Ring with respect to the valuation v on the fraction field 
K of R. If*, y € K with v(x) < v(y) prove that v(x -F y) = min(v(x), v(y)). [Note that 
* + y = x{\ +}>/*).] 

2. Suppose R is a Discrete Valuation Ring with unique maximal ideal M and quotient 
F = R/M. For any n > 0 show that M n /M ,,+l is a vector space over F and that 
dim f(M 71 /AP' +1 ) = 1. 


Sec. 16.2 Discrete Valuation Rings 


763 



3- Suppose R is an integral domain that is also a local ring whose unique maximal ideal 
M = (t) is nonzero and principal, and suppose that n„>i(r") = 0. Prove that R is a 
Discrete Valuation Ring. [Show that every nonzero element in R can be written in the 
form ut ' 1 for some unit u e R and some n > 0.] 

4. Suppose R is a Noetherian local ring whose unique maximal ideal M = (t) is principal. 
Prove that either R is a Discrete Valuation Ring or t n =0 for some n > 0. In the latter 
case show that R is Amman. 

5. Suppose that R is a Noetherian integral domain that is also a local ring of Kmll dimension 
1 . Let M be the unique maximal ideal of R and let F = R/M, so that M/M 2 is a vector 
space over F. 

(a) Prove that if dim f (M/M 2 ) = 1 then R is a Discrete Valuation Ring. 

(b) If every nonzero ideal of R is a power of M prove that R is a Discrete Valuation Ring. 

6. Let R be an integral domain with fraction field K. Prove that every finitely generated R- 
submodule of K is a fractional ideal of R. If R is Noetherian, prove that A is a fractional 
ideal of R if and only if R is a finitely generated R-submodule of K . 

7. If R is an integral domain and A is a fractional ideal of R, prove that if A is projective 
then A is finitely generated. Conclude that every integral domain that is not Noetherian 
contains an ideal that is not projective. 

8. Suppose R is a Noetherian integral domain that is also a local ring with nonzero maximal 
ideal M. Prove that R is a D.V.R. if and only if the only M-primary ideals in R are the 
powers of M. 

9. Let C = Z(xz — y 2 , yz — x 3 , z 2 — x 2 y) c A 3 over the algebraically closed field k. If 
v = (0, 0, 0) 6 C, prove that dim^m^c/m 2 c ) = 3 so that v is singular on C. Conclude 
that A[C] is not integrally closed in k(C) and determine its integral closure, [cf. Exercise 
27, Section 15.4.] 


16.3 DEDEKIND DOMAINS 

In the previous section we showed that Discrete Valuation Rings are the local rings that 
are integrally closed Noetherian integral domains of Krull dimension 1. In this section 
we consider the effect of relaxing the condition that the ring he a local ring: 

Definition. A Dedekind Domain is a Noetherian, integrally closed, integral domain 
of Krull dimension 1. 

Equivalently, R is a Dedekind Domain if R is a Noetherian, integrally closed, 
integral domain that is not a field in which every nonzero prime ideal is maximal. 

The first result shows that Dedekind Domains are a generalization of the class of 
Principal Ideal Domains. We shall see later (Theorem 22) that there is a structure 
theorem for finitely generated modules over a Dedekind Domain extending the corre- 
sponding result for P.I.D.s proved in Section 12.1. 

Proposition 14. 

(1) Every Principal Ideal Domain is a Dedekind Domain. 

(2) The ring of integers in an algebraic number field is a Dedekind Domain. 
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Proof: A P.I.D. is clearly Noetherian, is integrally closed since it is a U.F.D. (Ex- 
ample 3, Section 15.3), and nonzero prime ideals are maximal (Proposition 7 in Section 
8.2), which proves (1). Let Ok be the ring of integers in the numb er field K, i.e., 
the integral closure of Z in K. Then Corollary 25 in Section 15.3 shows that O k is 
integrally closed. Ok is Noetherian by Theorem 29 in Section 15.3, and the fact that 
nonzero prime ideals in Ok are maximal was proved in the discussion following the 
same theorem. This proves (2). 

The following theorem gives a number of important equivalent characterizations of 
Dedekind Domains. Recall that the basic properties of fractional ideals were developed 
in the previous section. 

Theorem 15. Suppose R is an integral domain with fraction field K ^ R. The following 
are equivalent conditions for R to be a Dedekind Domain: 

(1) The ring R is Noetherian, integrally closed, and every nonzero prime ideal is 
maximal. 

(2) The ring R is Noetherian and for each nonzero prime P of R the localization 
Rp is a Discrete Valuation Ring. 

(3) Every nonzero fractional ideal of R in K is invertible. 

(4) Every nonzero fractional ideal of R in K is a projective P-module. 

(5) Every nonzero proper ideal I of R can be written as a finite product of prime 
ideals: / = Pi Pi • ■ ■ P„ (not necessarily distinct). 

When the condition in (5) holds, the set of primes {Pi , . . . , P„] is uniquely 
determined and so every nonzero proper ideal I of R can be written uniquely 
(up to order) as a product of powers of prime ideals. 

Proof: If R satisfies (1), then Rp is a D.V.R. by Corollary 8, so (1) implies (2). 
Conversely, assume each Rp is a D.V.R. Then R is integrally closed by Proposition 
49 in Section 15.4 and every nonzero prime ideal is maximal by Proposition 46(3) in 
Section 15.4, so (2) implies (1). 

Suppose now that (1) is satisfied and that A is a nonzero fractional ideal of R. 
Let A' = [x e K \ xA C P} as in Proposition 9. For any prime ideal P of R the 
behavior of P-modules under localization shows that (AA') P = A P (A')p = A P (Apy 
(cf. Exercise 4). Since R P is a D.V.R. by what has already been shown, A P (ApY = Rp 
by Proposition 11. Hence (AA')/> = Rp for all nonzero primes P of P, so A A’ = R 
(Exercise 13 in Section 15.4), and A is invertible, showing (1) implies (3). Conversely, 
suppose every nonzero fractional ideal of R is invertible. Then every ideal in R is 
finitely generated by Proposition 9(3), so R is Noetherian. Every localization R P of R 
at a nonzero prime P is a local ring in which the nonzero fractional ideals are invertible 
(cf. Exercise 4), hence is a D.V.R. by Proposition 1 1 . Hence (3) implies (2) and so (1), 
(2) and (3) are equivalent. The equivalence of these with (4) is given by Proposition 10. 

Suppose now that (1) is satisfied, and let I be any nonzero proper ideal in R. Since 
R is Noetherian, / has a minimal primary decomposition 1 = Qi C\ ■ ■ ■ C\ Q n as in 
Theorem 21 of Section 15.2. The associated primes P, — rad Qt for i = 1, . . . , n are 
all distinct, and since primes are maximal in R by hypothesis, the associated primes are 
all pairwise comaximal, and it follows easily that the same is true for the Qt (Exercise 
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5). It follows that Qi D - ■ ■ D Q„ — Q i • • • Q„ (Theorem 17 in Section 7.6) so that I is 
the product of primary ideals. The P -primary ideals of R correspond bijectively with 
the PRp -primary ideals in the localization Rp (Proposition 42(3) in Section 15.4), and 
since Rp is a D.V.R. (because (1) implies (2)), it follows from Corollary 6 that if Q is 
a P -primary ideal in R then Q = P m for some integer m > 1. Applying this to Q it 
i = 1 , . . . , n shows that / is the product of powers of prime ideals, which gives the first 
implication in (5). 

Conversely, suppose that all the nonzero proper ideals of R can be written as a 
product of prime ideals. We first show for any integral domain that a factorization of 
an ideal into invertible prime ideals is unique, i.e., if P\ - ■ • P n = P\ ■ • ■ P m are two 
factorizations of 7 into invertible prime ideals then n = m and the two sets of primes 
{Pi, . . . , P„} and {P t , , P m ) are equal. Suppose P\ is a minimal element in the set 
{Pi, . . . , P m }. Since Pi ■ • ■ P„ c p lt the prime ideal 7\ contains one of the primes 

Pi, ... , P„, say Pi C p l# Similarly Pi contains P, for some i — 1 m, Then 

Pi C P\ C P, and by the minimali ty of Pi it follows that P, — P[ — P t , so the 
factorization becomes Pi Pi - - - P„ = P 1 P 2 - • • P m . Since Pi is invertible, multiplying 
by the inverse ideal shows that P 2 ■ ■ • P„ — P 2 ■ ■ - P m and an easy induction finishes the 
proof. In particular, the uniqueness statement in (5) now follows from the first statement 
in (5) since in a Dedekind domain every fractional ideal, in particular every prime ideal 
of R, is invertible. 

We next show that invertible primes in R are maximal. Suppose then that P is an 
invertible prime ideal in R and take a e R,a £ P. We want to show that P -f aR = R. 
By assumption, the two ideals P + aR and P + a 2 R can be written as a product of 
prime ideals, say P + aR = P\ ■ - P„ and P + a 2 R = Pi ■ - • P m . Note that P c P, 
for i = 1, . . . , n and also P c Pj for j = 1, . . . , m. In the quotient R/P, which 
is an integral domain, we have the factorization (a) = (Pi/P) * - - (P„/P), and each 
Pi/P is a prime ideal in P/P. Since the product is a principal ideal, each P,/P is 
also an invertible P/P-ideal (cf. Exercise 2). Similarly, (a 2 ) = (P\/P) * ■ • (P m /P) 
is a factorization into a product of invertible prime ideals. Then (a) 2 = (P 1 /P ) 2 ■ ■ ■ 
(P„/P) 2 = (Pi/P) ■ * * (P m /P) give two factorizations into a product of invertible 
prime ideals in the integral domain P/P, so by the uniqueness result in the previous 
paragraph, m = 2n and {Pi/P. Pi/P, . . . , P„/P. P„/P } = {Pi/P, .... Pm/P)- It 
follows that the set of primes Pi , . . . , P m in P consists of the primes Pi, ... , P„, each 
repeated twice. This shows that P + a 2 R = (P + aR) 2 . Since P c P -f a z R and 
(P -I- aR) 2 C. P 2 + aR f we have P c P 2 + aR, so every element x in P can be written 
in the form x = y +az where y e P 2 and z e R. Then az — x — y e P and since 
a £ P, we have z e P, which shows that P c P 2 + aP. Clearly P 2 + aP c P 
and so P = P 2 + aP = P(P + aR). Since P is assumed invertible, it follows that 
P — p + aR for any a e R — P, which proves that P is a maximal ideal. 

We now show that every nonzero prime ideal is invertible. If P is a nonzero prime 
ideal, let a be any nonzero element in P. By assumption, Ra = P\ • • - P„ can be 
written as a product of prime ideals, and Pi, .... P„ are invertible since their product is 
principal (by Exercise 2 again). Since Pi ■ ■ P„ = Ra c P, the prime ideal P contains 
P, for some 1 < i < n. Since P/ is maximal by the previous paragraph, it follows that 
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P = Pi is invertible. 

Finally, since every nonzero proper ideal of R is a product of prime ideals, it follows 
that every nonzero ideal of R is invertible, and since every fractional ideal of R is of 
the form (d~ l )I for some ideal in R, also every fractional ideal of R is invertible. This 
proves that (5) implies (3), and complete the proof of the theorem. 

The following corollary follows immediately from Proposition 14: 

Corollary 16. If Ok is the ring of integers in an algebraic number field K then every 
nonzero ideal 7 in Ok can be written uniquely as the product of powers of distinct 
prime ideals: 

T D e l pe„ 

1 ~ M r 2 r n ' 

where Pi , . . . , P n are distinct prime ideals and e, > 1 for i = 1 , . . . , n. 

Remark : The development of Dedekind Domains given here reverses the historical 
development As mentioned in Section 9.3, the unique factorization of nonzero ideals 
into a product of prime ideals replaces the failure of unique factorization of nonzero 
elements into products of prime elements in rings of integers of number fields. This 
property of rings of integers in Corollary 1 6 is what led originally to the definition of an 
ideal, and Dedekind originally defined what we now call Dedekind Domains by property 
5 in Theorem 15. It was Noether who observed that they can also be characterized by 
property (1), which we have taken as the initial definition of a Dedekind Domain. 

The unique factorization into prime ideals in Dedekind Domains can be used to 
explicitly define the valuations vp on R with respect to which the valuation rings are 
the localizations Rp in Theorem 15(2) (cf. Exercise 6). We now indicate how unique 
factorization for ideals can be used to define a divisibility theory for ideals similar to 
the divisibility of integers in Z. 

Definition. If A and B are ideals in the integral domain R then B is said to divide A 
(and A is divisible by B) if there is an ideal C in 7? with A = BC. 

If B divides A then certainly A C. B. If 7? is a Dedekind Domain, the converse is 
true: A C. B implies C = AB~ l c BB~ l = R so C is an ideal in R with BC — A. 

We can also define the notion of the greatest common divisor (A , B) of two ideals 
A and B\ (A, B) divides both A and B and any ideal dividing both A and B divides 
(A, B). The second statement in the next proposition shows that this greatest common 
divisor always exists for integral ideals in a Dedekind Domain and gives a formula for 
it s imil ar to the formula for the greatest common divisor of two integers. 

Proposition 17. Suppose R is a Dedekind Domain and A, B are two nonzero ideals 
in R, with prime ideal factorizations A — P^ ■ ■ ■ P* n and B = p{' - ■ ■ p/" (where 
e, , > 0 for i = 1 , . . . , n). Then 

(1) A C B if and only if B divides A (i.e., “to contain is to divide”) if and only if 
fi < for i = 1 , . . . , n. 
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(2) A + B = (A, B) — . . . p^( e n.fn) ^ SQ m pgftjcuiaj. a and B are 

relatively prime* A + B = P, if and only if they have no prime ideal factors in 
co mm on. 

Proof: We proved the first statement in (1) above. If each f < e iy then taking 
C = - - ■ Pn n -* H c R shows that B divides A. Conversely* if B divides A, then 

writing C as a product of prime ideals in A — BC shows that f < e-, for all 1 , which 
proves all of (1). Since A + B is the smallest ideal containing both A and B, (2) now 
follows from (1). 

Proposition 18. (Chinese Remainder Theorem) Suppose R is a Dedekind Domain, 
Pi, P 2 , . . . , P„ are distinct prime ideals in R and a, > 0 are integers, i = 1, .... n. 
Then 

R/P\ - ■ ' K " = &IK x P/Pf x • • ■ x P/P“". 

Equivalently, for any elements r \ , 7 - 2 , . . . , r„ e R there exists an element r e R, unique 
up to an element in Pf 1 • ■ ■ P^ t with 

r = r\ mod P ?' , r = r 2 mod Pf 2 , . - . , r =r n mod P“ n . 


Proof: This is immediate from Theorem 17 in Section 7.6 since the previous propo- 
sition shows that the P°‘ are pairwise comaxima] ideals. 

Corollary 19. Suppose / is an ideal in the Dedekind Domain R. Then 

(1) there is an ideal 7 of P relatively prime to I such that the product I J = (a) is 
a principal ideal* 

(2) if / is nonzero then every ideal in the quotient P/7 is principal; equivalently, if 
7i is an ideal of P containing 7 then 7i = 7 + Rb for some fceP, and 

(3) every ideal in P can be generated by two elements; in fact if 7 is nonzero and 
0 ^ a e 7 then 7 = Ra + Rb for some b e 7. 


Proof: Suppose / — P* 1 . . - is the prime ideal factorization, of 7 in P. For 
each i = 1, let r,- be an element of Pf ! — P^ i+1 . By the proposition* there is an 
element a e R with a = r,- mod 7^ i+1 for all i . Hence a e P* 1 — P/' +l for all i, so the 
power of P, in prime ideal factorization of (a) is precisely e, by (1) of Proposition 17: 


(a) = 



pe n p e «+i . . . pe m 
‘ n 1 «+l 1 m 


for some prime ideals P n +u • . . , P m distinct from P\ t ... t P n . Letting J = P*X\ ’ 1 * 
gives (1). For (2), by the Chinese Remainder Theorem it suffices to prove that every 
ideal in R/P m is principal in the case of a power of a prime ideal P , and this is immediate 
since R/P m = R P /P m R P and the localization R P is aP.I.D. Finally, (3) follows from 
(2) by taking 7 = Ra. 


The first statement in Corollary 19 shows that there is an integral ideal J relatively 
prime to 7 lying in the inverse class of I in the class group of P. One can even impose 
additional conditions on 7* cf. Exercise 11. 
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Corollary 20. If ft is a Dedekind Domain then ft is a P.I.D. (i.e., R has class number 
1) if and only if I? is a U.F.D. 

Proof: Every P.I.D. is a U.F.D. , so suppose that R is a U.FD. and let P be any 
prime ideal in R. Then P = Ra + Rb for some a ^ 0 and b in R by Corollary 19. 
We have ( a ') c p for one of the irreducible factors a' of a since their product is an 
element in the prime P, and then P divides (o') in R by Proposition 17(1). It follows 
that P = (a') is principal since (a') is a prime ideal (Proposition 12 in Section 8.3). 
Since every ideal in R is a product of prime ideals, every ideal of R is principal, i.e., R 
is a P.I.D. 

Corollary 20 shows that the class number of a Dedekind domain R gives a measure 
of the failure of unique factorization of elements. It is a fundamental result in algebraic 
number theory that the class number of the ring of integers of an algebraic number field 
is finite. For general Dedekind Domains, however, the class number need not be finite. 
In fact, for any abelian group A (finite or infinite) there is a Dedekind Domain whose 
class group is isomorphic to A. 

Modules over Dedekind Domains and the Fundamental Theorem 
of Finitely Generated Modules 

We turn next to the consideration of modules over Dedekind Domains R. Every frac- 
tional ideal of I? is an ft-module and the first statement in the following proposition 
shows that two fractional ideals of R are isomorphic as ft-modules if and only if they 
represent the same element in the class group of R. 

Proposition 2L Let R be a Dedekind Domain with fraction field K. 

(1) Suppose 7 and J are two fractional ideals of R. Then 7 = J as ft -modules 
if and only if 7 and J differ by a nonzero principal ideal: 7 = (a)J for some 
0 =£ a eK. 

(2) More generally, suppose 7i , h, ■ . - , 7„ and J\, J ^, . . . , J m are nonzero fractional 
ideals in the fraction field K of the Dedekind Domain ft. Then 

7i © h © ■ • * © h = j\ © Jl © ■ ■ ■ © Jm 

as ft-modules if and only if n = m and the product ideals hh ‘ ’ h and 
J\Ji- ■ ■ J n differ by a principal ideal: 

hh - m In = (P)J\J2 ■ • J n 

for some K. 

(3) In particular, 

7l ® h e ■ - - 0 /„ = ft ® ■ ■ ■ © ft W\h • ■ ■ In) 



n — \ factors 

and ft” ©/ = ft” © J if and only if / and J differ by a principal ideal: I = (a) J, 
a € K. 

Proof: Multiplication by 0 ^ a e K gives an ft -module isomorphism from J to 
(a) 7, so if I = (a)J we have I = J as ft-modules. For the converse, observe that we 
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may assume 7^0 and then 7 = 7 Implies 72 = 7 -1 7. But this says that 7 _1 7 = aR 
is principal (with generator a given by the image of 1 e 72), i.e., I = (a)7, proving (1). 

We next show that for any nonzero fractional ideals 7 and 7 that 7©7 = 72©77. 
Replacing 7 and 7 by isomorphic 72-modules a I and bJ, if necessary, we may assume 
that 7 and 7 are integral ideals that are relatively prime (cf. Exercise 12), so that 
I + J = R and / n 7 = 7 7. It is easy to see that the map from 7©7to7 + 7 = 72 
defined by mapping (x , y ) to x -|- y is a surjective 72 -module homomorphism with kernel 
/ H 7 = 77, so we have an exact sequence 

0 — > IJ — > I — >7? — > 0 

of 72-modu!es. This sequence splits since R is free, so 7 © 7 = 7? © 77, as claimed. 

The first statement in (3) now follows by induction, and combining this statement 
with (1 ) shows that if 7i ■ ■ ■ I n = (a)Jy ■ ■ ■ J n for some nonzero a e K then 7i © ■ ■ • © 7„ 
is isomorphic to 7i © ■ ■ ■ © J„. This proves the “if’ statement in (2). It remains to prove 
the “only if’ statement in (2) since the corresponding statement in (3) is a special case. 
So suppose 7i © 72 © • ■ • © 7„ = 7i © 7 2 © ■ ■ ■ © 7,„ as 72 -modules. 

Since 7 7C is the localization of the ideal 7 in K (cf. Proposition 41 in Section 
15.4) it follows that 7 ® R K = K for any nonzero fractional ideal 7 of K. Since tensor 
products commute with direct sums, (7[ © ■ - ■ © 7„) K = K n is an n -dimensional 
vector space over K. Similarly, 7i © * ■ ■ © J m K = K m , from which it follows that 
n = m. 

Note that replacing 7i by the isomorphic fractional ideal a\~ x h for any nonzero 
element ay e Iy does not effect the validity of the statements in (2). Hence we may 
assume Iy contains R, and similarly we may assume that each of the fractional ideals 
in (2) con tains R. Let <p denote the 72 -module isomorphism from Iy © • • • © I„ to 
7i © ■ - © 7„ . For i = 1,2 , ,n define 

^((0, . . . , 0, 1, 0, . . . , 0)) = (ay j, a 2 .... ) G J\ © 7 2 © • 1 * © J n 

where 1 € 7, on the left hand side occurs in position 1 . Since <p is an 72-module 
homomorphism it follows that 

Jj — -(- Qj^I'l T ' ' “I - Gj.i 7/ -j- Oj >n I n 

for each j = 1, 2, . . . , n. Taking the product of these ideals for j = 1, 2, . . . , n it 
follows that 

( a hA a j2.2 ’ ' * ^ jn, n )^i-^2 7„ C 7,7 2 ’ ■ Jn 

for any permutation [j 1 , j 2 , - ■ ■ , ./„} of (1, 2, ... , u}. Hence 

dhh ■ * • /„ £ 7, J 2 - ■ ■ 1, 

where d is the determinant of the matrix (fl;.;), since the determinant is the sum of 
terms g (cr )ni. a (i) - * where e(cr) is the sign of the permutation a of {1, 2, . . . ,n}. 

Similarly, for j = 1, . . . , n, define 

(p 1 ((0, .... 0, 1, 0, ... , 0)) = (hi,;, b^j , . . . , bnj) G I\ © 72 © ■ ■ - © J,, 

where 1 G 7; on the left hand side occurs in position j. The product of the two matrices 
(fl;,;) and (bjj) is just the identity matrix, so d ^ 0 and the determinant of the matrix 
(bij) is d~ l . As above we have 

d l JyJ 2 - ■ ■ 7„ C Iyl 2 ■ ■ ■ 7„, 
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which shows that I\h - - l n = (fl) J 2 • • • Jn, where 0 ^ a = d 1 e K, completing 
the proof of the proposition. 

We now consider finitely generated modules over Dedekind Domains and prove 
a structure theorem for such modules extending the results in Chapter 12 for finitely 
generated modules over P.LD.s. 

Recall that the rank of M is the maximal number of R-linearly independent elements 
in M, or, equivalently, the dimension of M (&r K as a AT- vector space, where K is the 
fraction field of R (cf. Exercises 1-4, 20 in Section 12.1). 

Theorem 22. Suppose M is a finitely generated module over the Dedekind Domain R. 
Let n > 0 denote the rank of M and let Tor (M) be the torsion submodule of M. Then 

M = R©R©---©A©/ © Tor(M) 

1 1 

n factors 

for some ideal I of A, and 

Tor(M) = R/ Pi' x R/P? x • - • x A//f 

for some s > 0 and powers Pf s <?i > 1, of (not necessarily distinct) prime ideals. The 
ideals Pf for i = 1 , . . . , s are unique and the ideal 1 is unique up to multiplication by 
a principal ideal. 

Proof: Suppose first that M is a finitely generated torsion free module over R, 
i.e., Tor(M) = 0. Then the natural A-module homomorphism from M to M K 
is injective, so we may view M as an A -submodule of the vector space M ®r K. If 
M has rank n over R, then M K is a vector space over K of dime nsion n. Let 
jci, . . . , x„ be a basis for M <S)j? K over K and let m \, . . . , m s be R -module generators 
for M. Each m,-, i — 1, . . . , s can be written as a AT -linear combination of * 1 , . . . , x n . 
Let 0 ^ d e R be a common denominator for all the coefficients in AT of these linear 
combinations, and set y t - = x t /d, i = 1 n. Then 

M c Ayi +-■■-(- Ry n C Kx\ + • • • + Kx n 

which shows that M is contained in a free /? -submodule of rank « and every element 
m in M can be written uniquely in the form 

m = fliyi H h any n 

with 01 , . . . , a„ e R. The map <p : M R defined by ^(aiyi H h a„y n ) = a„ is 

an A-module homomorphism, so we have an exact sequence 

0 — > ker <p — ► M — I\ — > 0 

where 1 1 is the image of <p in R, hence is an ideal in R. The submodule ker $2 is 
also a torsion free A- module whose rank is at most n — 1 (since it is contained in 

Ryi-\ 1- Ry n ~\)* and it follows by comparing ranks that 7i is nonzero and that ker <p 

has rank precisely n — 1. By (4) of Theorem 15, 1\ is a projective R-module, so this 
sequence splits: 

M = I\ © (ker <p). 
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By induction on the rank, we see that a finitely generated torsion free R -module is 
isomorphic to the direct sum of n nonzero ideals of R: 

M = J\ © I 2 0 ■ ■ ■ © I n - 

Since are each projective P-modules, it follows that any finitely generated 

torsion free P- module is projective. 

If now M is any finitely generated P-module, the quotient /If /Tor (A/) is finitely 
generated and torsion free, hence projective by what was just proved. The exact se- 
quence 

0 — y To r(M) — * M — y MfTor(M) — y 0 
therefore splits, and so 

M = Tor(Af) © (Af/Tor(Af)). 

By the results in the previous paragraph M /Tor(Af) is isomorphic to a direct sum of n 
nonzero ideals of R, and by Proposition 21 we obtain 

M = P©P©---©P©/© Tbr(Af) 

V 

n factors 

for some ideal I of R. The uniqueness statement regarding the ideal / is also imm ediate 
from the uniqueness statement in Proposition 21(3). 

It remains to prove the statements regarding the torsion submodule Tor(Af). Sup- 
pose then that N is a finitely generated torsion P-module. Let 1 = Ann(A0 be the 
annihilator of /V in R and suppose / = Pf 1 - Pf‘ is the prime ideal factorization of / 
in R, where Pi, . . . , P, are distinct prime ideals. Then /V is a module over R/I, and 

R/I = R/P « x R/P f x • ■ ■ x R/P/'. 

It follows that 

N = (N/P?N) x (N/P?N) x * - ■ x (N/P? N) 

as P-modules. Each N/P e N is a finitely generated module over R/P e = R P /P e Rp 
where Rp is the localization of R at the prime P , i.e., is a finitely generated module over 
Rp that is annihilated by P e R p . Since R is a Dedekind Domain, each R P is a P.I.D. 
(even a D.V.R.), so we may apply the Fundamental Theorem for Finitely Generated 
Modules over a P.I.D. to see that each N/P € N is isomorphic as an R P -module to a 
direct sum of finitely many modules of the form Rp/ P* Rp where / < e. It follows 
that each N/P e N is isomorphic as an P-module to a direct sum of finitely many modules 
of the form R/P^R where / < e. This proves that N is isomorphic to the direct sum 
of finitely many modules of the form R/ p/' for various prime ideals P; . Hence Tor (A/) 
can be decomposed into a direct sum as in the statement in the theorem. 

Finally, it remains to prove that the ideals P/' for i = 1 , . . . , s in the decomposition 
of Tor( Af ) are unique. This is similar to the uniqueness argument in the proof ofTheorem 
10 in Section 12.1 (cf. also Exercises 11-12 in Section 12.1): for any prime ideal P of 
R, the quotient P‘~ l M/P l M is a vector space over the field R/P and the difference 
dim P /pP‘~ l M/ P‘ M — dim p/pP l M/P t+l M is the number of direct summands of M 
isomorphic to R/P‘, hence is uniquely determined by M. This concludes the proof of 
the theorem. 
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If M is a finitely generated module over the Dedekind Domain R as in Theorem 22, 
then the isomorphism type of M as an 7? -module is determined by the rank n, the prime 
powers P* for i = 1, .... s (called the elementary divisors of M, and the class of the 
ideal 7 in the class group of R (called the Steinitz class of M). Note that a P.LD. is the 
same as a Dedekind Domain whose class number is 1 , in which case every nonzero ideal 
/ of 7? is isomorphic as an 7?-module simply to 7?. In this case. Theorem 22 reduces 
to the elementary divisor form of the structure theorem for finitely generated modules 
over P.I.D.s in Chapter 12. There is also an invariant factor version of the description 
of the torsion R -modules in Theorem 22 (cf. Exercise 14). 

The next result extends the characterization of finitely generated projective modules 
over P.LD.s (Exercise 21 in Section 12. 1) to Dedekind Domains. 

Corollary 23. A finitely generated module over a Dedekind Domain is projective if 
and only if it is torsion free. 

Proof: We showed that a finitely generated torsion free 7? -module is projective in 
the proof of Theorem 22, so by the decomposition of M in Theorem 22, M is projective 
if and only if Tor(Af) is projective (cf. Exercise 3 in Section 10.5). To complete the 
proof it suffices to show that no nonzero torsion 7?-module is projective, which is left 
as an exercise (cf. Exercise 1 5). 


EXERCISES 

1. If 7? is an integral domain, show that every fractional ideal of R is invertible if and only if 
every integral ideal of R is invertible. 

2. Suppose 7? is an integral domain with fraction field K and A\, A2 A„ are fractional 

ideals of 7? whose product is a nonzero principal fractional ideal: Ai A2 ■ ■ ■ A n = Rx for 
some Of: x € K. For each i = ,n prove that A; is an invertible fractional ideal with 
inverse (x -, )Ai ■ ■ ■ A,-_iA,‘+i • ■ ■ A„. 

3. Suppose 7? is an integral domain with fraction field K and P is a nonzero prime ideal in 
R. Show that the fractional ideals of K p in K are the Pp-modules of the form ARp where 
A is a fractional ideal of R. 

4. Suppose R is an integral domain with fraction field K and A is a fractional ideal of 7? in 
K. Let A! = {jc € K | xA C 7?] as in Proposition 9. 

(a) For any prime ideal Pin R prove that the localization (AO p of A' at P is a fractional 
ideal of Rp in K. 

(b) If A is a finitely generated 7? -module, prove that ( A')p — {Ap)' where (Ap)' is the 
fractional Rp ideal {x e K \ xAp Q Rp) corresponding to the localization Ap. 

5. If Cl is a Pi -primary ideal and Qi is a P 2 -primary ideal where P| and P 2 are comaximal 
ideals in a Noetherian ring 7?, prove that Q 1 and Q2 are also comaximal. [Use Proposition 
14 in Section 15.2.] 

6. Suppose 7? is a Dedekind Domain with fraction field K. 

(a) Prove that every nonzero fractional ideal of 7? in K can be written uniquely as the 
product of distinct prime powers Pj 5 ’ P° n where the a,- are nonzero integers, possibly 

negative. 
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(b) If 0 ^ x e K t let p v ^ be the power of the prime P in the factorization of the 
principal ideal (x) as in (a) (where vp (x) = 0 if P is not one of the primes occurring). 
Prove vp is a valuation on K with valuation ring Rp, the localization of R at P. 

7. Suppose R is a Noetherian integral domain that is not a field. Prove that R is a Dedekind 
Domain if and only if for every maximal ideal M of R there are no ideals 7 of R with 
M 2 c 7 C M. [Use Exercise 12 in Section 1 and Theorems 7 and 15.] 

8. Suppose 7? is a Noetherian integral domain with Krull dimension 1. Prove that every 
nonzero ideal 7 in 7? can be written uniquely as a product of primary ideals whose radicals 
are all distinct. [Cf. the proof of Theorem 15. Use the uniqueness of the primary compo- 
nents belonging to the isolated primes in a minimal primary decomposition (Theorem 21 
in Section 15.2).] 

9. Suppose 7? is an integral domain. Prove that Rp is a D.V.R. for every nonzero prime ideal 
P if and only if Rm is a D.V.R. for every nonzero maximal ideal. 

10. Suppose R is a Noetherian integral domain that is not a field. Prove that R is a Dedekind 
Domain if and only if nonzero primes M are maximal and every M -primary ideal is a 
power of M. 

1 1. If 7 and 7 are nonzero ideals in the Dedekind Domain R show there exists an integral ideal 
7i in 7? that is relatively prime to both 7 and 7 such that 7i 7 is a principal ideal in R. 

12 . If 7 and 7 are nonzero fractional ideals for the Dedekind Domain R prove there are elements 
a, 0 e K such that a 7 and (U are nonzero integral ideals in 7? are relatively prime. 

13. Suppose 7 and 7 are nonzero ideals in the Dedekind Domain R. Prove that there is an ideal 
I\ = I that is relatively prime to 7. [Use Corollary 19 to find an ideal 7 2 with 7 2 7 = (a) 
and (7 2 , 7) = R. If h = Pf ■ ■ ■ /£" , choose b e R with/? e Pf' -Pf i+l andh s 1 mod P 
for every prime P dividing 7. Show that (b) = hl\ for some ideal 7i and consider (a)l\ 
to prove that 7i = I.] 

14 . Prove that every finitely generated torsion module over a Dedekind Domain R is isomorphic 
to a direct sum R/l\ © RJh © • ■ - © R/l n with unique nonzero ideals 7i, . . . , l n of 7? 
satisfying l\ c 7 2 c - c 7„ (called the invariant factors of M). [cf. Section 12.1.] 

15. If P is a nonzero prime ideal in the Dedekind Domain R prove that R/P n is not a projective 
7?-modu1e for any n > 1. [Consider the exact sequence 0 -> P n /P” +1 ->■ R/P n+l -»• 
R/P n ->■ 0.] Conclude that if M ^ 0 is a finitely generated torsion 7?-module then M is 
not projective, [cf. Exercise 3, .Section lfi.5.] 

16. Prove that the class number of the Dedekind Domain 7? is 1 if and only if every finitely 
generated projective R -module is free. 

17. Suppose 7? is a Dedekind Domain. 

(a) Show that 7 ~ J if and only if 7 = J as R -modules defines an equivalence relation 
on the set of nonzero fractional ideals of R. Let C(R) be the corresponding set 
of 7?-module isomorphism classes and let [7] e C(R) denote the equivalence class 
containing the fractional ideal 7 of 7?. 

(b) Show that the multiplication [7][/] = [7 ® 7] gives a well defined binary operation 
with respect to which C(7?) is an abelian group with identity 1 = [7?]. 

(c) Prove that the abelian group C(R) in (b) is isomorphic to the class group of R. 

18 . If 7? is a Dedekind Domain and 7 is any nonzero ideal, prove that R/I contains only finitely 
many ideals. In particular, show that R/I is an Artinian ring. 

19 . Suppose 7 is a nonzero fractional ideal in the Dedekind Domain 7?. Explicitly exhibit 7 
as a direct summand of a free 7?-module to show that 7 is projective. [Consider 7 © 7 -1 
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and use Proposition 21.] 

20. Suppose I and 7 are two nonzero fractional ideals in the Dedekind Domain R and that 
I n = J n for some « ^0. Prove that 7 = 7. 

21. Suppose K is an algebraic number field and Op is the ring of integers in K. If P is a 
nonzero prime ideal in Ok prove that P = (p, Jt) for some prime p e Z and algebraic 
integer n G Ok - 

22. Suppose K = Q(VD ) is a quadratic extension of Q where D is a squarefree integer and 
Ok is the ring of integers in K . 

(a) Prove that |Op/(p)| = P 2 - [Observe that Ok = Z 2 as an abelian group.] 

(b) Use Corollary 1 6 to show that there are 3 possibilities for the prime ideal factorization 
of ( p ) in Ok' 

(0 ( p ) = P is a prime ideal with \Ok/P\ = p 2 , 

(ii) (p) = P 1 P 2 with distinct prime ideals Pj, P 2 and |Op/Pi I = \Ok/Pi\ = p, 

(iii) (p) = P 2 for some prime ideal P with \QkJP\ = P- 

(In cases (i), (ii), and (iii) the prime p is said to be inert , split, or ramified in Ok, respec- 
tively. The set of r amifi ed primes is finite: the primes p dividing D if D = 1, 2 mod 4; 
p = 2 and the primes p dividing D if D = 3mod4. Cf. Exercise 31 in Section 15.5.) 

(c) Determine the prime ideal factorizations of the primes p = 2, 3, 5, 7, 11 in the ring 
of integers Ok — Z[\/— 5] of K = (Q(a/— 5)- 

23. Let O be the ring of integers in the algebraic closure <Q> of Q. 

(a) Show that the infinite sequence of ideals in O (2) c (s/l) c (-^ 2 ) c ( ^2) c ■ - ■ is 
strictly increasing, and so O is not Noetherian. 

(b) Show that O has Kmll dimension 1. [Use Theorem 26 in Section 15.3.] 

(c) Let AT be a number field and let 7 be any ideal in Ok- Show that there is some fini te 
extension L of K such that I becomes principal when extended to Op, i.e., the ideal 
IOl is principal (where L depends on 7) — you may use the theorem that the class 
group of AT is a finite group, [cf. Exercise 20.] 

(d) Prove that O is a Bezout Domain (cf. Section 8.1). 

24. Suppose F and K are algebraic number fields with Q c F c K, with rings of integers 
Or and Ok, respectively. Since Of c Ok, the ring Ok is naturally a module over Of- 

(a) Prove Ok is a torsion free Op-module of rank n = [AT : P]. [Compute ranks over 
Z.] If Ok is free over Of then Ok is said to have a relative integral basis over Of- 

(b) Prove that if F has class number 1 then Ok has a relative Integral basis over Op- 

If K = Q(v^5, V2) then the ring of integers Ok is given by 

Ok = Z + Zy/—5 + Z\/— 10 + Z a> where to = (V— 10 + y/l)/2. 

(c) If Fi = Q(\/2) prove that Ok has a relative integral basis over Op, and find an 
explicit basis {a, /?): Ok = Op { - a + Op, ■ fi. 

(d) If F 2 = Q(\/— 5), show that 7^ = (3, 1 +- V^5) = (3, 5 — a/— 5 ) is a prime ideal 
of Op 2 that is not principal and that Ok — Of 2 ■ 1 + ( 1 / 3 )P 3 - a>. [Check that 
V— 10 = — (5 — ■>/— 5 )co/3.] Conclude that the Steinitz class of Op as a module over 
Op 2 is the nontrivial class of Pi in the class group of Op 2 and so there is no relative 
integral basis of Op over Op,. 

(e) Determine whether Op has a relative integral basis over the ring of integers of the 
remaining quadratic subfield P 3 = Q(a/— 10 ) of AT. 

25. Suppose C is a nonsingular irreducible affine curve over an algebraically closed field k. 
Prove that the coordinate ring k[C] is a Dedekind Domain. 
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CHAPTER 17 


Introduction to Homological Algebra 
and Group Cohomology 


Let /? be a ring with 1. In Section 10.5 we saw that a short exact sequence 

0 — ► — ► 0 (17.1) 

of /^-modules gives rise to an exact sequence of abelian groups 

0 — > Horn* (IV, D ) Hom*(Af, D) Horn R (L, D) (17.2) 

for any /^-module D and that the homomorphism is in general not surjective so 
this sequence cannot always be extended to a short exact sequence. Equivalently, 
homomorphisms from L to D cannot in general be lifted to homomorphisms from M 
into D. In this chapter we introduce some of the techniques of “homological algebra,” 
which provide a method of extending some exact sequences in a natural way. For 
the situation above one obtains an infinite exact sequence involving the “cohomology 

groups” Ext^( , D) (cf. Theorem 8), and these groups provide a measure of the set 

of homomorphisms from L into D that cannot be extended to M. We then consider 
the analogous questions for the other two functors considered in Section 10.5, namely 
taking homomorphisms from D into the terms of the sequence (1) and tensoring the 
sequence (1) with D. 

In the subsequent sections we concentrate on an important special case of this 
general type of homological construction — the “cohomology of finite groups.” We 
make explicit the computations in this case and indicate some applications of these 
techniques to establish some new results in group theory. In this sense. Sections 2-4 
may be considered as an explicit “example” illustrating some uses of the general theory 
in Section 1 . 

Cohomology and homology groups occur in many areas of mathematics. The for- 
mal notions of homology and cohomology groups and the general area of homological 
algebra arose from algebraic topology around the middle of the 20 th century in the 
study of the relation between the higher homotopy groups and the fundamental group 
of a topological space, although the study of certain specific cohomology groups, such 
as Schur’s work on group extensions (described in Section 4), predates this by half a 
century. As with much of algebra, the ideas common to a number of different areas were 
abstracted into general theories. Much of the language of homology and cohomology 
reflects its topological origins: homology groups, chains, cycles, boundaries, etc. 
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1 7.1 INTRODUCTION TO HOMOLOGICAL ALGEBRA— EXT AND TOR 


In this section we describe some general terminology and results in homological al- 
gebra leading to the so called Long Exact Sequence in Cohomology. We then define 
certain (cohomology) groups associated to the sequence (2) and apply the general ho- 
mological results to obtain a long exact sequence extending this sequence at the right 
end. We then indicate the corresponding development for sequences obtained by taking 
homomorphisms from D to the terms in (1) or by tensoring the terms with D. 

We begin with a generalization of the notion of an exact sequence, namely a se- 
quence of abelian group homomorphisms where successive maps compose to zero (i.e., 
the image of one map is contained in the kernel of the next): 

Definition. Let C be a sequence of abelian group homomorphisms: 

0 — ► C° C 1 — ► ► C"~ l c n ^ (17.3) 

(1) The sequence C is called a cochain complex if the composition of any two 
successive maps is zero: d„ + 1 o d„ = 0 for all n. 

(2) If C is a cochain complex, its /I th cohomology group is the quotient group 
ker dn+y/ image d n , and is denoted by H n (C). 

There is a completely analogous “dual” version in which the homomorphisms are 
between groups in decreasing order, in which case the sequence corresponding to (3) is 

written • • > C„ 4 >- • ■ ■ - 4 - Co 0. Then if the composition of any two successive 
homomorphisms is zero, the complex is called a chain complex , and its homology 
groups are defined as H„(C) = kerd„/ image d„ + \. For chain complexes the notation 
is often chosen so that the indices appear as subscripts and are decreasing, whereas for 
cochain complexes the indices are superscripts and are increasing. We shall instead use 
a uniform notation for the maps on both, since it will be clear from the context whether 
we are dealing with a chain or a cochain complex. 

Chain complexes were the first to arise in topological settings, with cochain com- 
plexes soon following. With our applications in Section 2 in mind, we shall concentrate 
on cochains and cohomology, although all of the general results in this section have 
similar statements for chains and homology. We shall also be interested in the situation 
where each C n is an /?-module and the homomorphisms d n are /?-module homomor- 
phisms (referred to simply as a complex of R-modules), in which case the groups H n (C) 
are also R -modules. 

Note that if C is a cochain (respectively, chain) complex then C is an exact sequence 
if and only if all its cohomology (respectively, homology) groups are zero. Thus the 
/I th cohomology (respectively, homology) group measures the failure of exactness of a 
complex at the stage. 

Definition. Let A — {A”} and B = {2?"} be cochain complexes. A homomorphism 
of complexes a : A B is a set of homomorphisms a n : A n -> B n such that for every 
n the following diagram commutes: 
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(17.4) 


*A n 




a„ 


^B n 


gn+i v • • . 


Proposition 1. A homomorphism a : A ->■ B of cochain complexes induces group 
homomorphisms from H n (A) to H n (B) for n >0 on their respective cohomology 
groups. 

Proof: It is an easy exercise to show that the commutativity of (4) implies that 
the images and kernels at each stage of the maps in the first row are mapped to the 
corresponding images and kernels for the maps in the second row, thus giving a well 
defined map on the respective quotient (cohomology) groups. 

Definition. Let A — {A”}, B = {£"} and C = [C r '} be cochain complexes. A short 

CX $ « 

exact sequence of complexes 0 -v A ->■ B ->■ C ->■ 0 is a sequence of homomorphisms 
of complexes such that 0 ->■ A” B n C n ->■ 0 is short exact for every n. 

One of the main features of cochain complexes is that they lead to long exact 
sequences in cohomology, which is our first main result: 


Of ^ 

Theorem 2. ( The Long Exact Sequence in Cohomology) Let 0-*A-*B->C->0 
be a short exact sequence of cochain complexes. Then there is a long exact sequence 
of cohomology groups: 


0 -► H°(A) -* H°(B) -► H°(C) ^ H l (A) 

-+ H l (B ) -► H\C) 4 H\A) • 


(17.5) 


where the maps between cohomology groups at each level are those in Proposition 1. 
The maps &„ are called connecting homomorphisms . 


Proof: The details of this proof are somewhat lengthy. For each n the verification 
that the sequence H n (A) — > H n (B) — > H n (C ) is exact is a straightforward check of 
the definition of exactness of each map, similar to the proof of Theorem 33 in Section 
10.5. The construction of a connecting homomorphism 8„ is outlined in Exercise 2. 
Some work is then needed to show that 8 n is a homomorphism, and that die sequence 
is exact at S„. 

One immediate consequence of the existence of the long exact sequence in Theorem 
2 is the fact that if any two of the cochain complexes A B, C are exact, then so is the 
third (cf. Exercise 6). 
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Homomorphisms and the Groups ExtJJ (A f B) 

To apply Theorem 2 to analyze the sequence (2), we try to produce a cochain complex 
whose first few cohomology groups in the long exact sequence (5) agree with the terms 
in (2). To do this we introduce the notion of a “resolution” of an /^-module: 

Definition. Let A be any P -module. A projective resolution of A is an exact sequence 

> P n />„_! — ► \ P D -4 A — ► 0 (17.6) 

such that each P* is a projective P -module. 

Every P -module has a projective resolution: Let Po be any free (hence projective) 
P- module on a set of generators of A and define an P -module homomorphism e from 
P 0 onto A by Theorem 6 in Chapter 10. This begins the resolution € : P 0 ► A -*> 0. 
The surjectivity of c ensures that this sequence is exact Next let Kq = ker e and let Pi 
be any free module mapping onto the submodule Kq of Po; this gives the second stage 
Pi -*> Po — ► A which, by construction, is also exact. We can continue this way, taking 
at the n ,h stage a free P-module P n+ i that maps surjectively onto the submodule ker d n 
of P„, obtaining in fact a free resolution of A. 

One of the reasons that projective modules are used in the resolution of A is that 
this makes it possible to lift various maps (cf. the proof of Proposition 4 following, for 
instance). 

In general a projective resolution is infin ite in length, but if A is itself projective, then 

it has a very simple projective resolution of finite length, namely 0 — > A — U- A — > 0 
given by the identity map from A to itself. 

Given the projective resolution (6), we riiay form a related sequence by takin g 
homomorphisms of each of the terms into Z), keeping in mind that this reverses the 
direction of the homomorphisms. This yields the sequence 

0 — >Homfl(A, D) Hom^(P 0 , D ) Hom^(Pi, D ) —fy ■ ■ ■ 

- - ^ Hom /? (P„_i, D) Horn F (P n , D) ^ ■ < • (17.7) 

where to simplify notation we have denoted the induced maps from Hom*(P„_i, D) to 
Horn R ( P„, D) for n > 1 again by d„ and similarly for the map induced by c (cf. Section 
10.5). This sequence is not necessarily exact, however it is a cochain complex (this 
is part of the proof of Theorem 33 in Section 10.5). The corresponding cohomology 
groups have a special name. 

Definition. Let A and D be a P -modules. For any projective resolution of A as in (6) 
Ietd„ : Hom/?(P„_i, D) > Homj?(P„, D) for alln > 1 as in (7). Define 

Ext^(A, D) = ker d n+ i/ imaged,, 

where Ext^(A, D ) = kerdi- The group Ext^ (A, D ) is called the 71 th cohomology group 

derived from the functor Hom^( , D). When R = Z the group Ext^(A, D) is also 

denoted simply Ext" (A, Z)). 
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Note that the groups Ext^(A, D) are also the cohomology groups of the cochain 
complex obtained from (7) by replacing the term Hom ff (A, D) with zero (which does 
not effect the cochain property), i.e., they are the cohomology groups of the cochain 
complex 0 -v Hom^(Po* D) -> - - 

We shall show below that these cohomology groups do not depend on the choice 
of projective resolution of A. Before doing so we identify the 0 th cohomology group 
and give some examples. 

Proposition 3. For any R -module A we have Ext^(A, D) = Horn/? (A, D). 


Proof: Since the sequence Pi Pq -U A 0 is exact, it follows that the 

corresponding sequence 0 Hom/?(A,D) -A- Hom/?(Po> £) Horn R (Pi, D) is 

also exact by Theorem 33 in Section 10.5 (noting the first comment in the proof). 
Hence Ext^(A, D) = kerd\ — image 6 ~ Hom ff (A, D), as claimed. 

Examples 

(1) Let R = Z and let A = Z/mZ for some m > 2. By the proposition we have 
ExtgfZ/iwZ, D) = Homz(2/»iZ. D), and it follows that Ext ^(Z/mZ, D) = m D, 
where m D — [d e D \ md = 0) are the elements of D that have order dividing m. 
For the higher cohomology groups, we use the simple projective resolution 

Z/mZ — ► 0 

for A given by multiplication by m on Z. Taking homomorphisms into a fixed Z - 
module D gives the cochain complex 

0 — ► Horn z(Z/mZ, D) — > Homz(Z, D ) Hbmz(Z, D) — >• 0 — > 

We have D = Homz(Z, D) (cf. Example 4 following Corollary 32 in Section 10.5) 
and under this isomorphism we baveExt^(Z/mZ, D) = D/mD for any abelian group 
D. It follows immediately from the definition and the cochain complex above that 
ExtJ^Z/mZ, D) = 0 for all n > 2 and any abelian group D, which we summarize as 

Ex&Z/wZ, D) S m D 

Ext^Z/znZ, D) = D/mD 

Ex^(Z/mZ, D) =0, for all n> 2. 

(2) The same abelian groups may be modules over several different rings R and the Ext/? 
cohomology groups depend on R. For example, suppose R = Z/mZ for some integer 
m > 1 . An R- module D is the same as an abelian group D with exponent dividing m, 
i.e., mD = 0. In particular, for any divisor d of m, the group ZfdZ is an /?-module, 
and 

■ ■ ■ ZfmZ -A Z/mZ Z/mZ -A Z/mZ — > Z/dZ — > 0 

is a projective Cm fact, free) resolution of Z/dZ as a Z/mZ-module, where the final 
map is the natural projection mapping x modm to x mod d. Taking homomorphisms 
into the Z/mZ-module D, using the isomorphism Hom 2 / m z(Z/mZ, D) = D, and 
removing the first term gives the cochain complex 
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Hence 


M/mZ^Z. D ) = dD, 

Ext^y mi^l 'dTL, D ) = (m/d)D /dD, n odd, n > 1, 

Extg/ mZ (Z/dZ, Z)) = dD/(m/d)D, n even, n > 2, 

where = {d e D \ kd = 0} denotes the set of elements of D killed by k. In 
particular, Ex^^ 2s (Z//?Z, Z/pZ) = TLfpTL for all n > 0, whereas, for example. 
Ext ^(Z/pZ, Z/pZ) = 0 for all n > 2. 


In order to show that the cohomology groups Ext^(A, D) are independent of the 
choice of projective resolution of A we shall need to be able to “compare” resolutions. 
The next proposition shows that an ^-module homomorphism from A to £ lifts to a 
homomorphism from a projective resolution of A to a projective resolution of £ — this 
lifting property is one instance where the projectivity of the modules in the resolution 
is important. 


Proposition 4 . Let / : A — ► A' be any homomorphism of £ -modules and take 
projective resolutions of A and A', respectively. Then for each n > 0 there is a lift f„ 
of / such that the following diagram commutes: 


<h 




d\ 




A 


0 


4 




Jb 

f 

7 . 4 1 

V * 




->-A' 


0 


(17.8) 


where the rows are the projective resolutions of A and A', respectively. 


Proof: Given the two rows and map / in (8), then since Po is projective we may 
lift the map fe : Po -v A 7 to a map /o : Po -> Pq in such a way that e'fo = fe 
(Proposition 30(2) in Section 10.5). This gives the first lift of /. Proceeding inductively 
in this fashion, assume f n has been defined to make the diagram commutative to that 
point Thus image f n d n+ \ C ker d’ n . The projectivity of P„ + i implies that we may lift 
the map f n d n + 1 : P«+i ->■ P' n to a map f n+i : P M+] -> P^ +l to make the diagram 
commute at the next stage. This completes the proof. 

The commutative diagram in Proposition 4 implies that the induced diagram 


0 >-Hom/j(A, D ) ► Homj?(Po, D ) >■ Homs (Pi, D) ■ - - 

A A I 

/ /d fi 

0 >■ Homfl(A, D) 5- Horn r(Pq, D) >- Hom/?(P,\ D) - ■ ■ 

(17.9) 

is also commutative. The two rows of this diagram are cochain complexes, and this 
commutative diagram depicts a homomorphism of these cochain complexes. By Propo- 
sition 1 we have an induced map on their cohomology groups: 
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Proposition 5. Let / : A -y A' be a homomorphism of /^-modules and take projective 
resolutions of A and A' as in Proposition 4. Then for every n there is an induced group 
homomorphism (p n : Ext^(A', D) -y Ext^(A, D) on the cohomology groups obtained 
via these resolutions, and the maps <p„ depend only on /, not on the choice of lifts f n 
in Proposition 4. 

Proof: The existence of the map on the cohomology groups Ext^ follows from 
Proposition 1 applied to the homomorphism of cochain complexes (9). The more 
difficult part is showing these maps do not depend on the choice of lifts /„ in Proposition 
4. This is easily seen to be equivalent to showing that if / is the zero map, then the 
induced maps on cohomology groups are also all zero. Assume then that / = 0. By the 
projectivity of the modules P, one may inductively define /^-module homomorphisms 
s n : P n — >■ P' n+l with the property that for all n, 

fn = <*'_!_! S n + Sn-idn (17.10) 

so the maps s n give reverse downward diagonal arrows across the squares in (8). (The 
collection of maps {a„} is called a chain homotopy between the chain homomorphism 
given by the /„ and the zero chain homomorphism, cf. Exercise 4.) Taking homo- 
morphisms into D gives diagram (9) with additional upward diagonal arrows from the 
homomorphisms induced by the s„, and these induced homomorphisms satisfy the re- 
lations in (10) (i.e., they form a homotopy between cochain complex homomorphisms). 
It is now an easy exercise using the diagonal maps added to (9) to see that any ele- 
ment in Hom/?(P^, D ) representing a coset in Ext'j/A', D ) maps to the zero coset in 
Ext^(A, D) (cf. Exercise 4). This completes the argument. 

One may also check that the homomorphism <pQ : Ext^(A', D) -y Ext^(A, D) in 
Proposition 5 is the same as the map / : Horn/? (A', D ) -y Hom^(A, D) defined in 
Section 10.5 once the corresponding groups have been identified via the isomorphism 
in Proposition 3. 

Theorem 6. The groups Ext^ (A , D) depend only on A and D, i.e., they are independent 
of the choice of projective resolution of A. 

Proof: In the notation of Proposition 4 let A' = A, let / : A — ► A' be the 
identity map and let the two rows of (8) be two projective resolutions of A. For any 
choice of lifts of the identity map, the resulting homomorphisms on cohomology groups 
(p n : Ext^(A', D) — ► Ext^(A, D) are seen to be isomorphisms as follows. Add a third 
row to the diagram (8) by copying the projective resolution in the top row below the 
second row. Let g be the identity map from A ' to A and lift g to maps g„ : P' n P„ 
by Proposition 4. Let xjf, , : Ext^(A, D) Ext^(A', D) be the resulting map on 
cohomology groups. The maps g„ o /„ : P n -*■ P n are now a lift of the identity map 
go/, and they are seen to induce the homomorphisms <p„ o \J/ n on the cohomology 
groups. However, since the first and t hir d rows are identical, taking the identity map 
from P n to itself for all n is a particular lift of g o /, and this choice clearly induces the 
identity map on cohomology groups. The last assertion of Proposition 5 then implies 
that (p n o \j/ n is also the identity on Ext^ (A, D). By a symmetric argument \J/„ o <p n is the 
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identity on Ext^(A', D). This shows the maps ip n and \{r n are isomorphisms, as needed 
to complete the proof. 

For a fixed R -module D and fixed integer n > 0, Proposition 5 and Theorem 6 show 

that Ext^( , D) defines a (contravariant) functor from the category of /^-modules to 

the categoiy of abelian groups. 

The next result shows that projective resolutions for a submodule and corresponding 
quotient module of an -module M can be fit together to give a projective resolution 
of M. 


Proposition 7. (Simultaneous Resolution) Let 0— > L — > M — > N — ► 0 be a short 
exact sequence of /^-modules, let L — A have a projective resolution as in (6) above, 
and let N have a similar projective resolution where the projective modules are denoted 
by P n . Then there is a resolution of M by the projective modules P„ 0 P n such that 
the following diagram commutes: 


0 Pi 


0 


t 

Po 


0 ► L 


♦ 

0 


t _ 

Pi ©Pi 


— Po © Po 


T 

M 

T 

0 


Jr 

Pi 


♦ 

-Po 


♦ 

-► N 


0 


> 0 


0 


0 


(17.11) 


Moreover, the rows and columns of this diagram are exact and the rows are split 


Proof: The left and right nonzero columns of (11) are exact by hypothesis. The 
modules in the middle column are projective (cf. Exercise 3, Section 10.5) and the row 
maps are the obvious ones to make each row a split exact sequence. It remains then to 
define the vertical maps in the middle column in such a way as to make the diagram 
commute. This is accomplished in a straightforward manner, working inductively from 
the bottom upward — the first step in this process is outlined in Exercise 5. 

Theorem 2 and Proposition 7 now yield the long exact sequence for Ext* that 
extends the exact sequence (2). 
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Theorem 8. Let 0— >-L— »A/->/V->0bea short exact sequence of P -modules. 
Then there is a Jong exact sequence of abelian groups 

0 -► Horn* (W, D) -> Horn* (A/, D) Horn P (L, D) Ext l(N, D ) 

(17.12) 

-► Ext ' f (M, D ) -> Ext^(L, D) X Ext^( N, D) ■ • ■ 

where the maps between groups at the same level n are as in Proposition 5 and the 
connecting homomorphisms S n are given by Theorem 2. 

Proof: Take a simultaneous projective resolution of the short exact sequence as 
in Proposition 7 and take homomorphisms into D. To obtain the cohomology groups 
Ext^ from the resulting diagram, as noted in the discussion preceding Proposition 3 we 
replace the lowest nonzero row in the transformed diagram with a row of zeros to get 
the following commutative diagram: 


0 ► Horn/? (Pi, D) Hom^(Pi © P\, D) Horn* (Pi, D ) - > - 0 


0 Hom/?(Po» D) ^ Hom*(Po © Po. P>) Hom/?(Po, D) 0 

A 


0 0 0 (17.13) 

The columns of (13) are cochain complexes, and the rows are split by Proposition 29(2) 
of Section 10.5 and the discussion following it. Thus (13) is a short exact sequence of 
cochain complexes. Theorem 2 then gives a long exact sequence of cohomology groups 

whose terms are, by definition, the groups Ext^( , D), for n > 0. The 0 th order terms 

are identified by Proposition 3, completing the proof. 


Theorem 8 shows how the exact sequence (2) can be extended in a natural way and 
shows that the group ExtJjf/V, D) is the first measure of the failure of (2) to be exact on 
the right — in fact (2) can be extended to a short exact sequence on the right if and only 
if the connecting homomorphism So in (12) is the zero homomorphism. In particular, if 
Extjf (N, D) = 0 for all R -modules N, then (2) will be exact on theright for every exact 
sequence (1). We have already seen (Corollary 35 in Section 10.5) that this implies the 
P -module D is injective. Part of the next result shows that the converse is also true and 
characterizes injective modules in terms of Ext/? groups. 


Proposition 9. For an P-module Q the following are equivalent: 

(1) Q is injective, 

(2) ExtJj(A, Q) = 0 for all R -modules A, and 

(3) Ext^(A, Q) — 0 for all R -modules A and all n > 1. 
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Proof: We showed (2) implies (1) above, and (3) implies (2) is trivial, so it remains 
to show that if Q is injective then Ext^(A, Q) = 0 for all P-moduIes A and all n > 1. 
Take a projective resolution 

> P„ — > P „- 1 — ► > P Q — ► A — ► 0 

for A. Since Q is injective, the sequence 

0 — > Hom*(A, Q ) — > Hom^fPo. Q ) — > ► Hom/j(F„_i , Q ) — > Homfl(P n , Q) — > ■ ■ ■ 

is still exact (Corollary 35 in Section 10.5), so all of the cohomology groups for this 
cochain complex are 0. In particular, the groups Ext^(A, Q) for n > 1 are all trivial, 
which is (3). 

For a fixed P -module Z>, the result in Theorem 8 can be viewed as explaining what 
happens to the short exact sequence 0 -> L M — ► A/ ->■ 0 on the right after 

applying the left exact functor Hom/?( , D). This is why the (contravariant) functors 

Ext^( , D) are called the right derived Junctors for the functor Homy?( , £>). 

One can also consider the effect of applying the left exact functor Horn R (D, ) . i.e. , 

by taking homomorphisms from D rather than into D. The next theorem shows that in 

fact the same Ext/? groups define the (covariant) right derived functors for Horn/? (Z>, ) 

as well. 

Theorem 10. Let 0->L->M-+N ~ > 0 be a short exact sequence of P-modules. 
Then there is a long exact sequence of abelian groups 

0 -► Hom*(£>, L) -► Hom/?(D, M) Hom/KA N) Ext‘(A L) 

(17.14) 

-+ Ext)j(£>, M) Ext^A N) ^ E xtJ(A L) —> ■ ■ ■. 


Proof: Let 0 — > L ~ > M — > N — »■ 0 be a short exact sequence of R -modules. 

By taking a projective resolution of D and then applying Horn/? ( , L), Hom^( , M) 

and Horn* ( , AO to this resolution one obtains the columns in a commutative diagram 

similar to (13), but with L, M and N in the second positions rather than the first. 
Applying the Long Exact Sequence Theorem to this array gives (14). 

Theorem 10 shows that the group Ext^D, L) measures whether the exact sequence 

0 — > Hom/?(A L) — >■ Homf?(A M) — > Hom/f(A N ) 

can be extended to a short exact sequence — it can be extended if and only if yb is 
the zero homomorphism. In particular, this will always be the case if the module D 
has the property that Ext)j(A B) = 0 for all P -modules B; in this case it follows by 
Corollary 32 in Section 10.5 that D is a projective P -module. As in the situation of 
injective P -modules in Proposition 9, the vanishing of these cohomology groups in fact 
characterizes projective P -modules: 
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Proposition 11. For an /?-module P the following are equivalent: 

(1) P is projective, 

(2) ExtjyCP, B) = 0 for all A -modules B, and 

(3) Ext^(P, B) = 0 for all A -modules B and all n > 1. 

Proof: We proved (2) implies (1) above, and (3) implies (2) is trivial, so it remains 
to prove that (1) implies (3). If P is a projective 7?-module, then the simple exact 
sequence 



given by the identity map on P is a projective resolution of P. Taking homomorphisms 
into B gives the simple cochain complex 

0 — > Homfl(P, B ) — v Honi/j(.P, B ) — v 0 — v ■ ■ • — ► 0 — ► ■ ■ * 
from which it follows by definition that Ext^(P, B) = 0 for all n > 1, which gives (3). 

Examples 

(1) Since IT 1 is a free, hence projective, 2-module, it follows from Proposition 11 that 

Ext^(Z"\R) = 0 

for all abelian groups B, all m > 1, and all « > 1. 

(2) It is not difficult to show that Ext^ (A 1 0 A 2 , B) = Ext^ (Ai , B) 0 Ext^ (A 2 , B) for all 
n > 0 (cf. Exercise 10), so the previous example together with the example following 
Proposition 3 determines Ext^(A, B) for all finitely generated abelian groups A. In 
particular, Extg(A, B) = 0 for all finitely generated groups A, all abelian groups B, 
and all n > 2. 


We have chosen to define the cohomology group Ext^(A, B) using a projective 
resolution of A. There is a parallel development using an injective resolution of B: 

0 — y B -* j2o — ► Gi — *• ■ - ' 

where each Q t is injective- In this situation one defines Ext^ (A , B ) as the / 1 th co- 
homology group of the cochain sequence obtained by applying Horn r (A, ) to the 

resolution for B. The theory proceeds in a manner analogous to the development of this 
section. Ultimately one shows that there is a natural isomorphism between the groups 
Ext^(A, B ) constructed using both methods. 

Examples 

(1) Suppose R = 2 and A and B are 2-modules, i.e., are abelian groups. Recall that a 
2-module is injective if and only if it is divisible (Proposition 36 in Section 10.5). The 
group B can be embedded in an injective 2-module Qq (Corollary 37 in Section 10.5) 
and die quotient, Qi, of Qq by the image of B is again injective. Hence we have an 
injective resolution 

0 — ► B — ► Qq — ► Q 1 — ► 0 
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of B. Applying Hom^fA, ) to this sequence gives the cochain complex 


0 — y Homz(A, B ) — ► Hom^fA, Go) — ► Hom 2 (A, Q\) — ► 0 — *> ■ ■ ■ 
from which it follows immediately that 

Ext^(A, B) = 0 

for all abelian groups A and B and all n > 2. showing that the result of the previous 
example holds also when A is not finitely generated. 

(2) Suppose A is a torsion abelian group. Then we have Ext°(A, Z) = Hom(A, Z) = 0 
since Z is torsion free. The sequence 0 -*■ Z -y Q -*■ Q/Z —y 0 gives an injective 
resolution of Z. Applying Hom(A, ) gives the cochain complex 

0 — ► Hom(A, Z) — y Hom(A, Q) — > Hom(A. Q/Z) — y 0 — y > ■ ■ 

and since Q is also torsion free, this shows that 

Exti(A. Z) = Hom 2 (A, Q/Z). 

The group Hom(A, Q/Z) is called the Pontriagin dual group to A. If A is a finite 
abelian group the Pontriagin dual of A is isomorphic to A (cf. Exercise 14, Section 
5.2). In particular, Ext 1 (A, Z) = A is nonzero for all nonzero finite abelian groups A. 
We have Ext" (A . Z) = 0 for all n > 2 by the previous example. 


We record an important property of Ext^, which helps to explain the name for these 
cohomology groups. Recall that equivalent extensions were defined at the beginning 
of Section 10.5. 


Theorem 12. For any /^-modules N and L there is a bijection between Ext] ? (N, L ) 
and the set of equivalence classes of extensions of N by L. 

Although we shall not prove this result, in Section 4 we establish a similar bijection 
between equivalence classes of group extensions of G by A and elements of a certain 
cohomology group, where G is any finite group and A is any ZG-module. 

Example 

Supposed? = Zand A = B — Z/ pZ. We showed above that Ext^ (Z/pZ, Z/pZ) = Z/pZ, 
so by Theorem 12 there are precisely p equivalence classes of extensions of Z/pZ by Z/pZ. 
These are given by the direct sum Z/pZ ® Z/pZ (which corresponds to the trivial class in 
ExtjyCZ/pZ, Z/pZ)) and the p - 1 extensions 

0 — y TL/pUL — ► Z/p 2 Z — U- Z/pZ — y 0 

defined by the map / (jc) - ix mod p for i = 1,2 , . . . , p - 1 . Note that while these are 
inequivalent as extensions, they all determine the same group Z/p 2 Z. 
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Tensor Products and the Groups Tor%(A,B) 

The cohomology groups Ext^(A, B) determine what happens to short exact sequences 

on the right after applying the left exact functors Horn/? (D, ) and Hom/?( , D). One 

may similarly ask for the behavior of short exact sequences on the left after applying 

the right exact functor D®p or the right exact functor ®r D. This leads to the Tor 

(homology) groups (whose name derives from their relation to torsion submodules), 
and we now briefly outline the development of these left derived functors. In some 
respects this theory is “dual” to the theory for Ext*. We concentrate on the situation for 

D ® r when D is a right R -module. When D is a left R -module there is a completely 

symmetric theory for D, when R is commutative and all R -modules have the 

same left and right R action the homology groups resulting from both developments 
are isomorphic. 

Suppose then that D is a right R -module. Then for every left module B the 

tensor product D ® R B is an abelian group and the functor D ® is covariant and 

right exact, i.e., for any short exact sequence (1) of left /^-modules, 

D®L — > D®M — > D®N — >-0 

is an exact sequence of abelian groups. This sequence may be extended at the left end 
to a long exact sequence as follows. Let 



be a projective resolution of B, and take tensor products with D to obtain 

v D ® P n ^ D ® P„_! — > - - - ^ D ® Po D ® B — ► 0. (17. 15) 

It follows from the argument in Theorem 39 of Section 10.5 that (15) is a chain complex 
— the composition of any two successive maps is zero — so we may form its homology 
groups. 

Definition, Let D be a right R -module and let B be a left /?-module. For any projective 
resolution of B by left R -modules as above let 1 <g) d n : D <g> P n — > D ® P„-\ for all 
n > 1 as in (15). Then 

Tor^(£>, B) = ker(l <S>^„)/image(l ®d n+ 1 ) 

where Toro(D, B) = (D<g>Po)/iniage(l <8>^i)- The group Tor^D, B) is called the / 1 th 

homology group derived from the functor D® . When R = Z the group Torjf (£), B) 

is also denoted simply Tor „(D, B). 


Thus Tor^(Z), B) is the n * homology group of the chain complex obtained from 
(15) by removing the term D ® B. 

A completely analogous proof to Proposition 3 (but relying on Theorem 39 in 
Section 10.5) implies the following: 
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Proposition 13. For any left R-module B we have Tor *(D, B) = D B . 

Example 

Let R = Z and let B = TL/mlL for some m > 2. By the proposition, Tor^(D, Z//?»Z) is 
isomorphic to D ® TL/mlL, so we have Tor^ (D, Z/mZ) = D/mD (Example 8 following 

Corollary 12 in Section 10.4). For the higher groups we apply D® to the projective 

resolution 

o — ► z z — ► z/^z — > o 

of B and use the isomorphisms D ® Z = D and D ® Z/rwZ ~ D/mD. This gives the 
chain complex 

► 0 — ► D —y D — ► D/mD — > 0. 

It follows that Torf (D, Z/mZ) = m D is the subgroup of D annihilated by m and that 
Tor %(D, Z/mZ) = 0 for all n > 2, which we summarize as 

Toro(£h Z/mZ) = D/mD , 

Torj (D, Z/mZ) = m D, 

Tor„(D, Z/mZ) = 0, for all n > 2. 

As for Ext, the Tor groups depend on the ring R (cf. Exercise 20). 

Following a similar development to that for Ext/?, one shows: 

Proposition 14. 

(1) The homology groups Tor* (Z), B ) are independent of the choice of projective 
resolution of B, and 

(2) for every /^-module homomorphism / : B -> B' there are induced maps 

: TorjG D, 5) -> Tor"(D, B') on homology groups (depending only on /). 

There is a Long Exact Sequence in Homology analogous to Theorem 2, except that 
all the arrows are reversed, whose proof follows mutatis mutandis from the argument 
for cohomology. This together with Simultaneous Resolution gives: 

Theorem IS. LetO— ► L — ► M -*■ N ->■ 0 be a short exact sequence of left R-moduJes. 
Then there is a long exact sequence of abelian groups 

► Thrf (D, N) 4 Torf (D, L) -v Torf (D, M) -► 

Torf (D, N)-^D®L^>D®M^D®N^0 

where the maps between groups at the same level n are as in Proposition 14 (and the 
maps S„ are called connecting homomorphisms). 

There is a characterization of fiat modules corresponding to Propositions 9 and 1 1 
whose proof is very s imil ar and is left as an exercise. 
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Proposition 16 . For a right /^-module D the following are equivalent: 

( 1 ) D is a flat 7?-module, 

(2) Tor^(Z>, B) — 0 for all left f?-modules B, and 

(3) Torf (D, B) = 0 for all left f? -modules B and all n > 1, 

We have defined Tbrf (A , B) as the homology of the chain complex obtained by ten- 
soring a projective resolution of B on the left with A. The same groups are obtained by 
t akin g the homology of the chain complex obtained by tensoring a projective resolution 
of A on the right by B. Put another way, the Torf (A, B ) groups define the (covariant) 

left derived functors for both of the right exact functors A ®r and <S>r B: if D 

is a left R -module, then the short exact sequence >0of right 

/^-modules gives rise to the long exact sequence 

► Torf (TV, D) Torf (L, D) -> Torf (M, D) -► 

Torf (TV, D)^Xl®rD^M®rD^N®rD^>0 

of abelian groups. In particular, the left R -module D is flat if and only if Torf {A , D) = 0 
for all right /^-modules A. 

When R is commutative, A B = B <8>r A (Proposition 20 in Section 10.4) for 
any two /^-modules A and B with the standard R -module structures, and it follows that 
Torf (A, B ) = Torf (B, A) as /^-modules. When R is commutative the Tor long exact 
sequences are exact sequences of T? -modules. 

Examples 

(1) If R = Z, then since Z “ is free, hence flat (Corollary 42, Section 10.5), we have 
Tor„ (A, Z m ) = 0 for all n > 1 and all abelian groups A. 

(2) Since Torf (A, Z?i © Bj) = Torf (A. Bj) ©Torf (A , Bj) (cf. Exercise 10), the previous 
two examples together determine Torf (A, B) for all abelian groups A and all finitely 
generated abelian groups B. 

(3) As a particular case of the previous example, Tori (A, B) is a torsion group and 
Tor„(A, B) = 0 for every abelian group A, every finitely generated abelian group 
By and all n > 2. In fact these results hold without the condition that B be finitely 
generated. 

(4) The exact sequence 0 — ► Z — ► Q — ► Q/Z — ► 0 gives the long exact sequence 

► Tori(Z), Q) -w Tori(Z), Q/Z) D ® Z -v D ® Q -v D ® Q/Z -► 0. 

Since Q is a flat Z-module (Example 2 following Corollary 42 in Section 10.5), the 
proposition shows that we have an exact sequence 

0 Tori (B, Q/Z) — ► D — ► D ® Q 

and so Tori (D, Q/Z) is isomorphic to the kernel of the natural map from D into D ®Q, 
which is the torsion subgroup of D (cf. Exercise 9 in Section 10.4). 

The following results show that, for R = Z, the Tor groups are closely related to 
torsion subgroups. The Tor groups first arose in applications of torsion abelian groups 
in topological settings, which helps explain the terminology. 
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Proposition 17. Let A and B be Z-modules and let / ( A) and t(B) denote their respective 
torsion submodules. Then Torj (A, B) = Tori (/ (A), t (B)). 

Proof: In the case where A and B are finitely generated abelian groups this follows 
by Examples 3 and 4 above. For the general case, cf. Exercise 16. 

Corollary 18. If A is an abelian group then A is torsion free if and only if Tor] ( A , B) = 0 
for eveiy abelian group B (in which case A is flat as a Z-module). 

Proof: By the proposition, if A has no elements of finite order then we have 
Torj (A , B) = Tori (/ (A), B) — Tor] (0, B) = 0 for eveiy abelian group B. Conversely, 
if Tbri(A, B) = 0 for all B, then in particular Tori (A, Q/Z) = 0, and this group is 
isomorphic to the torsion subgroup of A by the example above. 

The results of Proposition 17 and Corollary 18 hold for any P.LD. R in place of Z 
(cf. Exercise 26 in Section 10.5 and Exercise 16). 

Finally, we mention that the cohomology and homology theories we have described 
may be developed in a vastly more general setting by axiomatizing the essential proper- 
ties of R -modules and the Horn/? and tensor product functors. This leads to the general 
notions of abelian categories and additive functors. In the case of the abelian category 
of /^-modules, any additive functor T to the category of abelian groups gives rise to 
a set of derived functors , T n , also from R -modules to abelian groups, for all n > 0. 
Then for each short exact sequence 0— >-L— >N— »-0of R -modules there is 
a long exact sequence of (cohomology or homology) groups whose terms are T n (L), 
!F n {M) and !F n (N), and these long exact sequences reflect the exactness properties of 
the functor T. If T is left or right exact then the 0 th derived functor JF 0 is naturally 
equivalent to T (hence the 0 th degree groups Tq(X) are isomoiphic to T(X)), and if T 
is an exact functor then T n (X) = 0 for all n > 1 and all /? -modules X . 

EXERCISES 

L Give the details of the proof of Proposition 1. 

2. Tills exercise defines the connecting map <5„ in the Long Exact Sequence of Theorem 2 and 

proves it is a homomorphism. In the notation of Theorem 21etO->A.A£l-^C— >-0 

be a short exact sequence of cochain complexes, where for simplicity the cochain maps 

for A B and C are all denoted by the same d. 

(a) If c 6 C n represents the class x e H n ( C ) show that there is some b e B n with 
0n(b) = c. 

(b) Show that d n +\ (b) e ker #,+ 1 and conclude that there is a unique a e A n+ 1 such that 
c*n+j (a) = d„+ ] (b). (Use c e ker d n +\ and the commutativity of the diagram.] 

(c) Show that d n+ 2 (d) = 0 and conclude that a defines a class a in the quotient group 
H n+1 (A- [Use the fact that a n+ 2 is injective.] 

(d) Prove that a is independent of the choice of b, i.e., \lb’ is another choice and a' is its 
unique preimage in A” +I then d = a', and that d is also independent of the choice of 
c representing the class x. 

(e) Define S„(jr) = a and prove that S n is a group homomorphism from H n (C) to 
H n+ 1 (A). [Use the fact that bn (*) is independent of the choices of c and b to compute 
£*(*1 +* 2 ).] 
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3. Suppose 


a 


0 


fi 


B > C 


f 


a 


-► A' ► B' 




*■ C' 


*- 0 


is a conamutative diagram of R-modules with exact rows. 

(a) If c 6 ker h and fi(b) — c prove that g(b) e ker fi' and conclude that gib) = ot'(a') 
for some a' e A'. [Use the commutativity of the diagram.] 

(b) Show that 8(c) = a' mod image / is a well defined R-moduIe homomorphism from 
ker/i to the quotient A'/ image /. 

(c) (The Snake Lemma) Prove there is an exact sequence 

ker / — y ker# — > ker h coker / — y coker# — y coker h 

where coker / (the cokemel of /) is >4' /(image /) and similarly for coker# and 
coker h . 

(d) Show that if a is injective and fi' is suijective (i.e., the two rows in the commutative 
diagram above can be extended to short exact sequences) then the exact sequence in 
(c) can be extended to the exact sequence 

0 — y ker / — y ker # — > ker h —y coker / — >• coker # — > coker h — y 0 


4. Let A ~ {A”) and B = { B n } be cochain complexes, where the maps A n —y A' 1 ' 1 " 1 
and B n — ► fi" +1 in both complexes are denoted by d„ +i for all n. Cochain complex 
homomorphisms a and fi from A to B are said to be homotopic if for all n there are 
module homomorphisms s„ : A n+l -► B n such that the maps a„ — fi n from A” to B n 
satisfy 

0!n fin = dn$ n — \ -j- S n d n -\.\ . 


The collection of maps {$„} is called a cochain homotopy from a to fi- One may similarly 
define chain homotopies between chain complexes. 

(a) Prove that homotopic maps of cochain complexes induce the same maps on cohomol- 
ogy, i.e.. if a and fi are homotopic homomorphisms of cochain complexes then the 
induced group homomorphisms from H n (A) to H n (B) are equal for every n > 0. 
(Thus “homotopy” gives a sufficient condition for two maps of complexes to induce 
the same maps on cohomology or homology; this condition is not in general neces- 
sary.) [Use the definition of homotopy to show (a„ — fi„)(z) e image d n for every 
z € kerd n+ i.] 

(b) Prove that the relation a ^ fi if a and fi are homotopic is an equivalence relation on 
any set of cochain complex homomorphisms. 

5. Establish the first step in the Simultaneous Resolution result of Proposition 7 as follows: 
assume the first two nonzero rows in diagram (11) are given, except for the map from 
Pq ffi Po to M (where the maps along the row of projective modules are the obvious 
injection and projection for this split exact sequence). Let p : Pq — i ► M be a lifting to Po 
of the map Po -y N (which exists because Pq is projective). Let A. be the composition 
Pq —y L —y M in the diagram. Define 

tt : Po © Po -*■ M by n(x, y) = k(x) + ^(y). 

Show that with this definition the first two nonzero rows of (11) form a commutative 
diagram. 
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6 . 


7. 


a ft 

Let 0->.A->i?->C->0bea short exact sequence of cochain complexes. Prove that 
if any two of A, B, C are exact, then so is the third, [Use Theorem 2.] 

Prove that a finitely generated abelian group A is free if and only if Ext 1 (A, Z) = 0. 


8. Prove that if 0 — ► M — > N -> 0 is a split short exact sequence of /?-modules, then 

for every n > 0 the sequence 0 — > Ext n R (N, D) — > Ext^(M, D) -> Ext^(L, D) -> 0 is 
also short exact and split. [Use a splitting homomorphism and Proposition 5.] 

9. Show that 


0 — ► Z/dZ 


Z/mZ Z/mZ Z/mZ -4- Z/mZ 


is an injective resolution of Z/dZ as a Z/mZ-module. [Use Proposition 36 in Section 
10.5.] Use this to compute the groups Ext^ m2 (.4, Z/dZ) in terms of the dual group 

Homz/ m z(A, Z/mZ). In particular, if m = p 2 and d — p, give another derivation of the 
result Ext^ 2 Z (Z/pZ, Z/pZ) = Z/p Z. 

10 . (a) Prove that an arbitrary direct sum ®; e / P; of projective modules Pi is projective and 

that an arbitrary direct product |”[ je y Qj of injective modules Qj is injective. 

(b) Prove that an arbitrary direct sum of projective resolutions is again projective and use 
this to show Ext^(ffi, e /A,-, B) = ]~[ (€/ Ext^(A;, B) for any collection of R -modules 
A,- (i € 7). [cf. Exercise 12 in Section 10.5.] 

(c) Prove that an arbitrary direct product of injective resolutions is an injective resolution 
and use this to show Ext^ (A , Y\j e j s j) = YljeJ E xt j? (A , Bj) for any collection of 
/?-modules B } (j € J). [cf. Exercise 12 in Section 10.5.] 

(d) Prove that Tor^fA. (BjejBj ) = ®y € /Toi^(A. Bj) for any collection of R - modules 
Bj U e J). 

11. (Bass' Characterization of Noetherian Rings) Suppose R is a commutative ring. 

(a) If R is Noetherian, and 7 is any nonzero ideal in R show that the image of any R- 
moduje homomorphism / : / — > Qj^J Qj from 7 into a direct sum of injective 

modules Qj (J e J) is contained in some finite direct sum of the Qj. 

(b) If R is Noetherian, prove that an arbitrary direct sum (BjeJ Qj of injective R -modules 
is again injective. [Use Baer’s Criterion (Proposition 36) and Exercise 4 in Section 
10.5 together with (a).] 

(c) Let7i c / 2 c ... be an ascending chain of ideals of R with union 7 and let 7 //,- — > Q, 
fori = 1,2,,.. be an injection of the quotient 7 /// into an injective R -module Qi (by 
Theorem 38 in Section 10.5). Prove that the composition of these injections with the 
product of the canonical projection maps 1 — >■ 7,- gives an 7?-module homomorphism 

/ : 7 — * ffij=i,2,... Qi ■ 

(d) Prove the converse of (b): if an arbitrary direct sum (Bj^jQj of injective R -modules 
is again injective then R is Noetherian. [If the direct sum in (c) is injective, use Baer’s 
Criterion to lift / to a homomorphism F : R ->■ ®,-=i, 2 ,...fif- If the component of 
/^(l) in Qj is 0 for i >n prove that 7 = I n and the ascending chain of ideals is finite.] 

12 . Prove Proposition 13: Tor^ (D, A) = D A. [Follow the proof of Proposition 3.] 

13 . Prove Proposition 16 characterizing flat modules. 

14. Suppose 0— >A— >5— >C-»0isa short exact sequence of R -modules. Prove that if 
C is a flat /?-module, then A is flat if and only if B is also flat [Use the Tor long exact 
sequence.] Give an example to show that if A and B are flat then C need not be flat. 
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15. (a) If / is an ideal in R and M is an /{-module, prove thatTor^(M, R/I) is isomorphic 

to the kernel of the map M <8>/? / -*■ M that maps m <8> i to mi for i € 7 and m e M. 
[Use the Tor long exact sequence associated to 0 7 -> R -> R/I 0 noting that 

R is flat.] 

(b) (A Flatness Criterion using Tor] Prove that the /{-module M is flat if and only if 
Torj^Af, R/I) = 0 for every finitely generated ideal 7 of R. [Use Exercise 25 in 
Section 10.5.] 

16 . Suppose 7? is a P.I.D. and A and B are /{-modules. If t(B) denotes the torsion submodule 

of B show that Torf (A, t(B)) = Torf (A, B) and deduce that Torf (A, B) is isomorphic 
to Torf (/(A), t(B)). [Use Exercise 26 in Section 10.5 to show that B/t(B) is flat over 
R, then use the Tor long exact sequence with D = A applied to the short exact sequence 
0 t(B) — ► £ — » B/t(B ) — > 0 and the remarks following Proposition 16.] 

17 . Let A = Z/2Z © Z/3Z © Z/4Z © • ■ -. Prove that Ext 1 (A, B) S (B/2B) x (B/3B) x 
(B/4B) x • • • for any abelian group A. [Use Exercise 10.] Prove that Ext 1 (A, B) = 0 if 
and only if B is divisible. 

18 . Prove that Z/2Z is a projective Z/6Z-module and deduce that Tor^ /6Z (Z/2Z, Z/2 T) = 0. 

19 . Suppose r ^ 0 is not a zero divisor in the commutative ring R. 

(a) Prove that multiplication by r gives a free resolution 0 — > /{ -^ /{ -*• R/rR ^ 0 of 
the quotient R/rR. 

(b) Prove that ExlP R (R/rR, B) = r B is the set of elements b e B with rb = 0, that 
Extjjf/i/rT?, B) = B/rB, and that Ext R (R/rR, B) = 0 for n > 2 for every R- 
module B. 

(c) Prove that Tor^( A, R/rR) = A/rA , thatTorf(A, R/rR) = r A is the set of elements 
a & A with ra = 0, and that Tbrf (A, R/rR ) = 0 for n > 2 for every /{-module A. 

20. Prove that TotJ /mZ ( A. Z/dZ) = A/dA, thatTor^ /mZ (A, Z/dZ) = d A/{m/d)A forn odd, 

n > 1, and that Tor^'^A, Z/dZ) = (j m / d )A/dA for n even, n > 2. [Use the projective 
resolution in Example 2 following Proposition 3.] 

21. Let R = k[x, y] where k is a field, and let / be the ideal (x, y) in R. 

(a) Let a : R -> R 2 be the map a(r) = ( yr , —xr) and let : R 2 -> R be the map 
fi((r \ . n)) =r\x + r 2 y. Show that 


0^R-^R 2 -^R— * k— >0 

where the map R -*■ R/I = k is the canonical projection, gives a free resolution of k 
as an R -module. 

(b) Use the resolution in (a) to show that Tor f (£, k) = k. 

(c) Prove that Torf (k, I) = k. [Use the long exact sequence corresponding to the short 
exact sequence 0 — >■ / — ^ /? — >■ — ► 0 and (b).f 

(d) Conclude from (c) that the torsion free /{-module 7 is not flat (compare to Exercise 
26 in Section 10.5). 

22. (Flat Base Change for Tor) Suppose R and S are commutative rings and /:/?-»■ S is a 
ring homomorphism making S into an /{-module as in Example 6 following Corollary 12 
in Section 10.4. Prove that if Sis flat as an /{-module, then Toi^( A, B) =Tor ^(£®j?A, B) 
for all /{-modules A and all S-modules B. [Show that since S is flat, censoring an /{-module 
projective resolution for A with S gives an 5- module projective resolution of S A.] 
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23. (Localization and Tor) Let D l R be the localization of the commutative ring R with 
respect to the multiplicative subset D of R. Prove that localization commutes with Tor, 
i.e., D~ l Tbr* (A, B ) = Tor f~ iR (D~ l A, D~ l B ) for all B-modules A and B and all n > 0. 
[Use the previous exercise and the fact that D l R is flat over R, cf. Proposition 42(6) in 
Section 15.4.] 

24. (Flatness is local) Suppose R is a commutative ring. Prove that an B -module M is flat if 
and only if every localization Mp is a flat Bp -module for every maximal (hence also for 
every prime) ideal in R. [Use the previous exercise together with the characterization of 
flatness in terms of Tor.] 

25. If R is an integral domain with field of fractions F, prove that Tor f(F/R, B) = t(B) for 
any /?-module B, where /(B) denotes the B-torsion submodule of B. 

An B-module M is said to be finitely presented if there is an exact sequence 

R s — > R? — > M — ► 0 

of B-modules for some integers s and t. Equivalently, M is finitely generated by / elements 
and the kernel of the corresponding B-module homomorphism R l M can be generated by 
.s elements. 

26. (a) Prove that every finitely generated module over a Noetherian ring R is finitely pre- 

sented. [Use Exercise 8 in Section 15.1.] 

(b) Prove that an B- module M is finitely presented and projective if and only if M is a 
direct summand of R n for some integer n > 1. 

27. Suppose that M is a finitely presented B -module and that 0— >A-^B-^M— >0is 
an exact sequence of B-modules. This exercise proves that if B is a finitely generated 
B-module then A is also a finitely generated B-module. 

(a) Suppose R s R‘ -X M — > 0 and e\ , ... ,e t is an B-module basis for R‘ . Show that 
there exist bi, ... ,b t € B so that fi(b{) — <p(ei) for / — 1 , . . . , t. 

(b) If / is the B-module homomorphism from R‘ to B defined by /(e,-) = bj for 
i — 1 ,...,/, show that / ( \J/(R S )) C ker^. [Use <p o \J/ — 0.] Conclude that there is 
a commutative diagram 


R s ^ R> ^ M ^ 0 



of B-modules with exact rows. 

(c) Prove that A / image g = B/ image / and use this to prove that A is finitely generated. 
[For the isomorphism, use the Snake Lemma in Exercise 3. Then show that image g 
and A/ image g are both finitely generated and apply Exercise 7 of Section 10.3.] 

(d) If 7 is an ideal of B conclude that B/7 is a finitely presented B-module if and only if 
7 is a finitely generated ideal. 

28. Suppose B is a local ring with unique maximal ideal m and M is a finitely presented 

B-module. Suppose mi m s are elements in A7 whose images in M/tnM form a basis 

for M/mM as a vector space over the field B/m. 

(a) Prove that mi m s generate M as an B-module. [Use Nakayama’s Lemma.] 

(b) Conclude from (a) that there is an exact sequence 0 ker -v B s 4 M -4 0 that 
maps a set of free generators of R s to the elements m i , . . . , m s . Deduce that there is 
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an exact sequence 


Tbrf (M. R/m ) — ► (ker^)/m(ker^) — ► 0. 


[Use the Tor long exact sequence with respect to tensoring with R/m, using the fact 
that N ® R/m = N/mN for any R- module N (Example 8 following Corollary 12 in 
Section 10.4] and the fact that <p : (R/mY = M/mM is an isomorphism by the choice 
of mi mj.] 

(c) Prove that if Tor ^ (M , R /m) = 0 then m i m s area set of free 7? -module generators 

for M. [Use the previous exercise and Nakayama’s Lemma to show that ker tp = 0.] 

29. Suppose R is a local ring with unique maximal ideal m. This exercise proves that a finitely 
generated 7? -module is flat if and only if it is free. 

(a) Prove that M = F/K is the quotient of a finitely generated free module F by a 
submodule K with K c m F. [Let F be a free module with F/mF = M/mM.] 

(b) Suppose x e K and write x = aie\ + - - ■ + a n e„ where e\,...,e n are an 7?-basis 
for F. Let 7 = (a\ , . . . , a n ) be the ideal of 7? generated by a\ , . . . , a„). Prove that 
if M is flat, then 7 = m7 and deduce that K = 0, so M is free. [Use Exercise 25(d) 
of Section 10.5 to see that x e IK C m7F and conclude that 7 c ml. Then apply 
Nakayama’s Lemma to the finitely generated ideal 7.] 

30. Suppose 7? is a local ring with unique maximal ideal m, M is an F-module, and consider 
the following statements: 

(i) M is a free 7?-module, 

(ii) M is a projective 7?-module, 

(lii) M is a flat 7? -module, and 
(hr) Tbrf (M, R/m) = 0. 

(a) Prove that (i) implies (ii) implies (iii) implies (iv). 

(b) Prove that (i) , (ii), and (iii) are equivalent if M is finitely generated. (Exercise 34 
below shows (iii) need not imply (i) or (ii) if M is finitely generated but R is not local.) 
[Use the previous exercise.] 

(c) Prove that (i), (ii), (iii), and (iv) are equivalent if M is finitely presented. (Exercise 
35 below shows that (iv) need not imply (i), (ii) or (iii) if M is finitely generated but 
not finitely presented.) [Use Exercise 28.] 

Remark: It is a theorem of Kaplansky (cf. Projective Modules . Annals of Mathematics, 
68(1958), pp. 372-377) that (i) and (ii) are equivalent without the condition that M be 
finitely generated. 

31. (Localization and Hom/or Finitely Presented Modules ) Suppose D 1 /? is the localization 
of the commutative ring R with respect to the multiplicative subset D of R, and let M be 
a finitely presented 7?-module. 

(a) For any 7? -modules A and B prove there is a unique D -1 7?-module homomorphism 
from D -l Hom^(A, B) to Hom D -ift(D -1 A, D~ l B) that maps <p e Hom^(A, B) to 
the homomorphism from D _I A to D~^B induced by <p. 

(b) For any /? -module N and any m > 1 show that Horn R(R m , N) = N m as R -modules 
and deduce that D _1 Homj?(7? m , N) = (D~ ] N) m as D“ '7? -modules. 

(c) Suppose R s — > R' — > M — > 0 is exact. Prove there is a commutative diagram 


0 - 


D -I Hom/?(A/, N) 


D- l Hom R (R' , N) 


D~ 1 Hom R (R s , N) 


i t 

Homp-^D-'Af, D~ l N) -► Hom D -i R ((D~ l Ry. D~ l N) -*Hom D -i r ((D~ 1 R) s ,D~ l N) 
of D ~ ^-modules with exact rows. [For the first row first take 7?-module homomor- 
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phisms from the terms in the presentation for M into N using Theorem 33 of Section 
10.5 (noting the first comment in the proof) and then tensor with the flat fl-module 
D~ l R, cf. Propositions 41 and 42(6) in Section 15.4. For the second row first ten- 
sor the presentation with D~ l R and then take D~ 1 7?-moduJe homomorphisms into 

(d) Use (b) to prove that localization commutes with taking homomorphisms when M 
is finitely presented, i.e., D -1 Hom^(A/, N) = Hom£>-i ff (£> -1 A/, D~ l N) as D~ l R- 
modules. [Show the second two vertical maps in the diagram above are isomorphisms 
and deduce that the left vertical map is also an isomorphism.] (This result is not true 
in general if M is not finitely presented.) 

32. ( Localization and Ext for Finitely Presented Modules) Suppose D~*R is the localization 
of the commutative ring R with respect to the multiplicative subset D of R. Prove that 
if M is a finitely presented 7?-module then D -1 Ext^(M, N) = Ext^_ lft (£> -1 M, D~ l N ) 
as £> -1 /?-modules for every /^-module N and every n > 0. [Use a projective resolution 
of N and the previous exercise, noting that tensoring the resolution with D~ l R gives a 
projective resolution for the £> -1 /?-module D~ l N.] 

33. Suppose R is a commutative ring and M is a finitely presented /f-module (for example a 
finitely generated module over a Noetherian ring, or a quotient, R/I t of R by a finitely 
generated ideal /, cf. Exercises 26 and 27). Prove that the following are equivalent: 

(a) M is a projective R- module, 

(b) M is a flat -module, 

(c) M is locally free, i.e., each localization Mp is a free /?/>-module for every maximal 
(hence also for every prime) ideal P of R. 

In particular show that finitely generated projective modules are the same as finitely pre- 
sented flat modules. [Exercises 24 and 30 show that (b) is equivalent to (c). Use the Ext 
criterion for projectivity and Exercises 30 and 32 to see that (a) is equivalent to (c).] 

34. (a) Prove that every R -module for the commutative ring R is flat if and only if every 

finitely generated ideal / of R is a direct summand of R, in which case every finitely 
generated ideal of R is principal and projective (such a ring is said to be absolutely flat). 
[Use Exercise 15, the previous exercise applied to the finitely presented fl-module 
R/I, and the remarks following Proposition 16.] 

(b) Prove that every Boolean ring is absolutely flat. [Use Exercise 24 in Section 7.4, 
noting that if / = Rx then x is an idempotent so R = Rx © /?(1 — jr) J 

(c) Let R be the direct product and / the direct sum of countably many copies of Z/2Z. 
Prove that / is an ideal of the Boolean ring R that is not finitely generated and that 
the cyclic R -module M = R/I is flat but not projective (so finitely generated flat 
modules need not be projective). 

35. Let R be the local ring obtained by localizing the ring of C°° functions on the open interval 
(—1,1) at the ma ximal ideal of functions that are 0 at x = 0 (cf. Exercise 45 of Section 
15.2), let m = (x) be the unique maximal ideal of R and let P be the prime ideal 

Set M = R/P . 

(a) Prove that Tor^Af, R/m) = 0. [Use Exercise 19 applied with r = x, noting that 
R/P is an integral domain.] 

(b) Prove that M is not flat (hence not projective). [Let F be as in Exercise 45 of Section 
15.2. Show that the sequence 0 ->> R -> R -> R/(F) -*■ 0 induced by multiplication 
by F is exact, but is not exact after tensoring with M.] 
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17.2 THE COHOMOLOGY OF GROUPS 


In this section we consider the application of the general techniques of the previous 
section in an important special case. 

Let G be a group. 

Definition. An abelian group A on which G acts (on the left) as automorphisms is 
called a G-module. 

Note that a G-module is the same as an abelian group A and a homomoiphism 
(p : G -> Aut(A) of G into the group of automorphisms of A. Since an abelian group 
is the same as a module over Z, it is also easy to see that a G-module A is the same as 
a module over the integral group ring,ZG, of G with coefficients in Z. When G is an 
infinite group the ring ZG consists of all the finite formal sums of elements of G with 
coefficients in Z. 

As usual we shall often use multiplicative notation and write ga in place of g-a for 
the action of the element g e G on the element a e A . 

Definition. If A is a G-module, let A G = [a e A \ ga = a for all g e G} be the 
elements of A fixed by all the elements of G. 

Examples 

(1) If ga = a for all a e A and g e G then G is said to act trivially on A. In this case 
A c = A. The abelian group Z will always be assumed to have trivial G-action for 
any group G unless otherwise stated. 

(2) For any G-module A the fixed points A c of A under the action of G is clearly a 
ZG-submodule of A on which G acts trivially. 

(3) If V is a vector space over the field F of dimension n and G = GL n (F) then V is 
naturally a G-module. In this case V G = (0) since any nonzero element in V can be 
taken to any other nonzero element in V by some linear transformation. 

(4) A semidirect product E = A x G as in Section 5.5 in the case where A is an abelian 
normal subgroup gives a G-module A where the action of G is given by the homo- 
morphism (p : G — > Aut(A). The subgroup A c consists of the elements of A lying 
in the center of E. More generally, if A is any abelian normal subgroup of a group 
E, then E acts on A by conjugation and this makes A into a E-module and also an 
£/A-module. In this case A E = A e/a also consists of the elements of A lying in the 
center of E. 

(5) If K/F is an extension of fields that is Galois with Galois group G then the additive 
group K is naturally a G-module, with K G = F. Similarly, the multiplicative group 
K* of nonzero elements in K is a G-module, with fixed points (K X ) G = F x . 

The fixed point subgroups in this last example played a central role in Galois Theory 
in Chapter 14. In general, it is easy to see that a short exact sequence 

0 — y A — y B — yC — >0 
of G-modules induces an exact sequence 

0 A* —y B G —y C G (17.15) 
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that in general cannot be extended to a short exact sequence (in general a coset in the 
quotient C that is fixed by G need not be represented by an element in B fixed by G). 
One way to see that (15) is exact is to observe that A c can be related to a Horn group: 

Lemma 19. Suppose A is a G -module and Hom^G (Z, A) is the group of all ZG-module 
homomorphi sms from Z (with trivial G-action) to A. Then A c ~ HomgcfZ, A). 

Proof: Any G-module homomorphism a from Z to A is uniquely determined by 
its value on 1 . Let u a denote the G-module homomorphism with a(l) — a. Since a a is 
a G-module homomorphism, a = a a (l) = a a (g • 1) = g ■ ot a (l) = g • a for all g e G, 
so that a must lie in A c . Likewise, for any a £ A G it is easy to check that the map 
ct a \-> a gives an isomorphism from Horn 20 (^. A) to A G . 

Combined with the results of the previous section, the lemma not only shows that 
the sequence (15) is exact, it shows that any projective resolution of Z considered as 
a Z G-module will give a long exact sequence extending (15). One such projective 
resolution is the standard resolution or bar resolution of Z: 

^ F n -* F 0 Z — > 0. (17.1 6) 

Here F„ = ZG <g>z ZG <g>z • • • <g>z ZG (where there are n + 1 factors) for n > 0, which 
is a G-module under the action defined on simple tensors by g ■ (go <8> gi <8> ■ ■ ■ <8> g n ) = 
(ggo) ® gi ® ® gn- It is not difficult to see that F n is a free ZG-module of rank |G j 71 

with ZG basis given by the elements 1 ® gi ® g 2 ® ® g n , where gi e G. The map 

aug : Fq —> Z is the augmentation map aug(£^ eC ot g g) = X ge G a g> an ^ die map d\ 
is given by d\(\ <g> g) = g — 1 . The maps d n for n > 2 are more complicated and their 
definition, together with a proof that (16) is a projective (in fact, free) resolution can be 
found in Exercises 1-3. 

Applying (ZG-module) homomorphisms from the terms in ( 16) to the G-module A 
(replacing the first term by 0) as in the previous section, we obtain the cochain complex 

0 — ► HoiuzcC^o, A) Hom Z G(F I , A) HomzG(F 2j A) — • • • , (17.17) 

the cohomology groups of which are, by definition, the groups Ext^ c (Z, A). Then, as 
in Theorem 8, the short exact sequence 0 — > A — > B — ► C — > 0 of G -modules 
gives rise to a long exact sequence whose first terms are given by ( 15) and whose higher 
terms are the cohomology groups Ext^ G (Z, A). 

To make this more explicit, we can reinterpret the terms in this cochain complex 
without explicit reference to the standard resolution of Z, as follows. The elements 
of HomzG (F n , A) are uniquely determined by their values on the ZG basis elements 
of F ni which may be identified with the n-tuples (gi, g 2 . • • ■ > gn) of elements gi of G. 
It follows forn > 1 that the group Hom^G^, A) may be identified with the set of 
functions from G x ■ ■ • x G (n copies) to A. For n - Owe identify HomzG(ZG, A) 
with A. 

Definition. If G is a finite group and A is a G-module, define C°(G, A) = A and for 
n > 1 define C n (G , A) to be the collection of all maps from G” = G x • ■ ■ x G (n 
copies) to A. The elements of C n (G, A) are called n-cochains (ofG with values in A). 
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Each C"(G , A) is an additive abelian group: for C°(G, A) = A given by the 
group structure on A ; for n > 1 given by the usual pointwise addition of functions: 
(/i + fi)(g \ ,82 gn) = f\(g\ , 82 , • • • , gn) + f 2 (g\, 82 , - - - , gn)- Under the iden- 

tification of Homzc(F fl> A) with C"(G , A) the cochain maps d„ in (17) can be given 
very explicitly (cf. also Exercise 3 and the following comment): 

De fini tion. For n > 0, define the n * coboundary homomorphism from C n (G, A) to 
C n+l (G,A) by 

dn(.f)(g 1* ■ ■ ■ , 8> t+l) gi ’ f (g 2, • • • , fin+l) 

n 

”1” 1 ) f (g\, • • • , gi — 1 1 gi 8i + 1 * gi-\-2, - * • i S^+l) 

i=l 

+ (-l)" +l /(«!,.... *«) (17.18) 

where the product g,g,+i occupying the I th position of / is taken in the group G. 

It is immediate from the definition that the maps d„ are group homomorphisms. It 
follows from the fact that (17) is a projective resolution that d„ o d„-j = 0 for n > 1 (a 
self contained direct proof just from the definition of d n above can also be given, but is 
tedious). 

Definition. 

(1) Let ZP (G, A) = ker dn for n > 0. The elements of Z n (G, A) are called n- 
cocycles. 

(2) Let B n (G , A) = image for n > 1 and let f?°(G, A) = 1. The elements of 
B n (G, A) are called n-coboundaries. 

Since d n o d n -\ = 0 for n > 1 we have imagec4-i C ker d„, so that B n (G , A) is 
always a subgroup of Z”(G, A). 

De fini tion. For any G-uiodule A tlie quotient group Z"(G, A)/B n (G, A) is called the 
/7 th cohomology group of G with coefficients m A and is denoted by H n (G, A), n > 0. 

The definition of the cohomology group H n (G, A) in terms of cochains will be 
particularly useful in the following two sections when we examine the low dimensional 
groups H y (G, A) and H 2 (G , A) and their application in a variety of settings. It should 
be remembered, however, that H"(G, A) = Ext" (Z, A) for all n >0. In particular, 
these groups can be computed using any projective resolution of Z. 

Examples 

(1) For f = a € C°(G, A) we have do(f)(g) = g-a — a and so ker<2o is the set 
{ a e A | g-a = a for all g € G ), i.e., Z°(G. A) = A c and so 

H°(G, A) = A c . 

for any group G and G-module A. 
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(2) Suppose G = 1 is the trivial group. Then G* = {(1, 1, . . . , 1)} is also the trivial group, 

so / € C n (G , A) is completely determined by /(1,1 l) = a£A. Identifying 

/ = a we obtain C n ( G , A) = A for all n > 0. Then, if / = a € A, 


d n (f)( 1, 1 1) = fl + £(-l)''a + 

i=l 


0 if n is even 

1 if n is odd 


so dn = 0 if n is even and d„ = 1 is the identity if n is odd. Hence 

H°( 1, A) = A c — A 


H n (\, A) = 0 for all n > I. 


Example: (Cohomology of a Finite Cyclic Group) 

Suppose G is cyclic of order m with generator cr. Let N = 1 + a + cr 2 -t ha" 1-1 € Z G. 

Then N(a — 1) = (a — 1 )N = o m — 1 = 0, and so we have a particularly simple free 
resolution 


cr— 1 


ZG 


N 


ZG 


0—1 





z 


0 


where aug denotes the augmentation map (cf. Exercise 8). Taking ZG-module homomor- 
pbisms from the terms of this resolution to A (replacing the first term by 0) and using the 
identification Homgc (ZG, A) = A gives the chain complex 

o^a^a " -*- 1 1 " 


a^Ia 


whose cohomology computes the groups H n (G, A): 

H°(G t A) = A c , and H n (G, A) = { A 1 ^ A 

l nA/(c - 1)A 


if n is even, n > 1 
if n is odd, n > 1 

where ^/A = [a e A \ Na = 0} is the subgroup of A annihilated by N, since the kernel of 
multiplication by a — 1 is A G . 

If in particular G = (a) acts trivially on A, then N -a = m-a, so that in this case 
H°(G, A) = A, with H n (G, A) — A/mA for even n > 2, and H n (G , A) = m A, the 
elements of A of order dividing m, for odd n > 1 . Specializing even further to m — 1 gives 
Example 2 previously. 


Proposition 20. Suppose mA = 0 for some integer m > 1 (i.e., the G-module A has 
exponent dividing m as an abelian group). Then 

mZ n {G, A) = mB n (G , A ) = mH n (G , A) = 0 for all n > 0. 

In particular, if A has exponent p for some prime p then the abelian groups Z n (G , A), 
B n (G. A) and H n (G, A) have exponent dividing p and so these groups are all vector 
spaces over the finite field = ZjpZ. 

Proof: If / € C” (G, A) is an n-cochain then / 6 A (if n = 0), in which case 
mf — 0, or / is a function from G n to A (if n > 1), in which case mf is a function 
from G n to mA = 0, so again mf = 0. Hence mZ"(G , A) = mB n {G , A) = 0 since 
these are subgroups of C n (G, A). Then mH n (G, A) = 0 since mZ”(G, A) = 0, and 
the remaining statements in the proposition are immediate. 

By Example 1, the long exact sequence in Theorem 10 written in terms of the 
cohomology groups H n (G, A) becomes 
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Theorem 21. ( Long Exact Sequence in Group Cohomology) Suppose 

0 — ► A — > B — > C — ► 0 

is a short exact sequence of G-modules. Then there is a long exact sequence: 

0— ► A c — ► B c — ► C G H\G,A) — ► H'(G, B) — ► H\G,Q — 
• •• H n (G, A) — ► H n (G, B) — ► H n (G, C ) H n+l (G, A) — > ■ - - 
of abelian groups. 


Among many other uses of the long exact sequence in Theorem 21 is a technique 
called dimension shifting which makes it possible to analyze the cohomology group 
H n+l (G , A) of dimension n + 1 for A by instead considering a cohomology group of 
dimension n for a different G-module. The technique is based on finding a G-module 
almost all of whose cohomology groups are zero. Such modules are given a name: 

Definition. A G-module M is called cohomologicalty trivial for G if H” (G, M) = 0 
for all n > 1 . 

Corollary 22. (Dimension Shifting) Suppose 0-»A->-A'f-+C->-0isa short exact 
sequence of G-modules and that M is cohomologically trivial for G. Then there is an 
exact sequence 


0 — ► A c — ► M c — ► C G — ► H\G, A) — *■ 0 
and 

H n+1 (G, A) = H n (G , C) for all n > 1. 


Proof: Since M is cohomologically trivial for G, the portion 

H n (G, M) — ► H n (G , C) — > H n+ \G , A) H n+l (G , Af) 

of the long exact sequence in Theorem 21 reduces to 

0 — ► H n (G, C) — ► H n+] (G y A) — ► 0 

which shows that H n (G t C ) = 7/" +I (G, A) for n > 1. Similarly, the first portion of 
the long exact sequence in Theorem 21 gives the first statement in the corollary. 

We now indicate a natural construction that produces a G-module given a module 
over a subgroup H of G. When H = 1 is the trivial group this construction produces 
a cohomologically trivial module M and an exact sequence as in Corollary 22 for any 
G-module A. 
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Definition. If H is a subgroup of G and A is an /-/-module, define the induced G- 
module A/p (A) to be Horn z//(ZG, A). In other words, My (A) is the set of maps / 
from G to A satisfying / (hx) = hf(x) for every x e G and h e H. 

The action of an element g € G on / e A/p (A) is given by (g- f)(x) = f (xg) for 
x e G (cf. Exercise 10 in Section 10-5). 

Recall that if H is a subgroup of G and A is an //-module, then the module 
ZG <g>zw A obtained by extension of scalars from Z// to ZG is a G-module. For a finite 
group G, or more generally if H has finite index in G, we have A/p (A) = ZGgiz/y A (cf. 
Exercise 10). When G is infini te this need no longer be the case (cf. Exercise 1 1). The 
module ZG ®z h A is sometimes called the induced G-module and the module A/p(A) 
is sometimes referred to as the coinduced G-module. For finite groups, associativity 
of the tensor product shows that M p (M% (A)) = A/p (A) for subgroups K < H < G, 
and the same result holds in general (this follows from the definition using Exercise 7). 

Examples 

(1) If H is a subgroup of G and 0->A->f?->C-vOisa short exact sequence of 

//-modules then 0 — ► A/p (A) -> A/p (5) — ► My(C) -> 0 is a short exact sequence 
of G-modules, since My (A) = ZG A and ZG is free, hence flat, over Z H. 

(2) When G is finite and A is the trivial //-module Z, the module A/p (Z) is a free Z- 

module of rank m = \G : H\. There is a basis b\ b,„ such that G permutes 

these basis elements in the same way it permutes the left cosets of H in G by left 
multiplication, ix., if we let />,- -o- g, H then gbi = bj if and only if ggi H = gjH. The 
module My (Z) is the permutation module over Z for G with stabilizer H. A special 

case of interest is when G = and H = where S m permutes {1,2 m] as 

usual. Permutation modules and induced modules over fields are studied in Part VL 

(3) Any abelian group A is an //-module when H = 1 is the trivial group. The corre- 
sponding induced G-module Mp (A) is just the collection of all maps / from G into 
A. For g e G the map gf e A/p (A) satisfies ( g ■ f)(x ) = / (xg) for x eG. 

(4) Suppose A is a G-module. Then there is a natural map 

ip : A — > A/p (A) 

from A into the induced G-module A/p (A) in the previous example defined by mapping 
a e A to the function f a with f a (x) = xa for all x e G. It is clear that tp is a group 
homomorphism, and f ga (x) — x(ga) = (xg)a = f a (xg) = (g- /*,)(*) shows that <p is 
a G-module homomorphism as well. Since f a ( 1) = a, it follows that f a is the zero 
function on G if and only if a = 0 in A, so that tp is an injection. Hence we may 
identify A as a G-submodule of the induced module A/p (A). 

(5) More generally, if A is a G-module and H is any subgroup of G then the function 
f 0 (x) in the previous example is an element in the subgroup A/p (A) since we have 
f a (hx) = ( hx)(a ) = h(xa) = hf a (x) for all h € H. The associated map from A to 
A/p (A) is an injective G-module homomorphism. 

(6) The fixed points (A/p(A)) c are maps / from G to A with gf = f for all g e G, i.e., 
with (g/)0t) = f(x) for all g, x e G. By definition of the G-action on A/p (A), this 
is the equation f(xg) = f(x) for all g, x e G. Taking x = 1 shows that / is constant 
on all of G: f(g) = /( 1) = a e. A. The constant function / = a is an element of 
A/p (A) if and only if a = f(hx) = hf(x) = ha for all h € H, so (A/p(A)) c = A H . 
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An element f a (x) in the previous example is contained in the subgroup (A/p(A)) c if 
and only if xa is constant for x € G, i.e., if and only if a e A c . 

One of the important properties of the G-module A/p (A) induced from the H- 
module A is that its cohomology with respect to G is the same as the cohomology of A 
with respect to H : 

Proposition 23. (Shapiro's Lemma ) For any subgroup H of G and any //-module A 
we have H n (G . A/g(A)) = H n (H , A) forn > 0. 

Proof: Let ■■■—»• P n -> ■ ■ ■— ► Po — ► ^ ► 0 be a resolution of Z by pro- 
jective G-modules (for example, the standard resolution). The cohomology groups 
//”(G, Mft(A)) are computed by taking homomorphisms from this resolution into 
A/g(A) = Horn zw(ZG, A). Since ZG is a free Z// -module it follows that this G- 
module resolution is also a resolution of Z by projective //-modules, hence by taking 
homomorphisms into A the same resolution may be used to compute the cohomol- 
ogy groups //"(//, A). To see that these two collections of cohomology groups are 
isomorphic, we use the natural isomorphism of abelian groups 

<P : Hom zc (P„, Homzw(ZG, A)) = Hom zw (P„, A) 

given by <P (f)(p) — f(p)( 1), for all / e Hom ZG (P n< Hom Z //(ZG, A)) and p e P„. 
The inverse isomorphism is defined by taking 4* (f')(p) to be the map from ZG to A that 
takes g € G to the element f'(gp) in A for all f e Hom zw (P„, A) and p e P„, i.e., 
(^(/OCp))^) = f'(SP)- Note this is well defined because P„ is a G-module. (These 
maps are a special case of an Adjoint Associativity Theorem, cf. Exercise 7.) Since 
these isomorphisms commute with die cochain maps, they induce isomorphisms on the 
corresponding cohomology groups, Le., H n (G , M%(A)) = H n (H, A), as required 

Corollary 24. For any G-module A the module A/p (A) is cohomologically trivial for 
G. i.e.. H n (G, A/p (A)) = 0 for all n > 1. 

Proof: This follows immediately from the proposition applied with H = 1 together 
with the computation of the cohomology of the trivial group in Example 2 preceding 
Proposition 20. 

By the corollary, the fourth example above gives us a short exact sequence of 
G-modules 

0 — ► A-% M — * C — ► 0 

where M = A/p (A) is cohomologically trivial for G and where C is the quotient of 
A/p (A) by the image of A. The dimension shifting result in Corollary 22 then becomes: 

Corollary 25. For any G-module A we have H n+i (G, A) = H n (G , A/p(A)/A) for 
all n > 1. 


804 


Chap. 17 Introduction to Homological Algebra 



We next consider several important maps relating various cohomology groups. 
Some applications of the use of these homomoiphisms appear in the following two 
sections. 

In general, suppose we have two groups G and G' and that A is a G -module and 
A' is a G'-module. If (p : G' -> G is a group homomorphism then A becomes a 
G'-module by defining g' a = <p(g')a for g' g G' and a e A. If now \}/ : A — »■ A' 
is a homomorphism of abelian groups then we consider whether \}/ is a G'-module 
homomorphism: 

Definition. Suppose A is a G-module and A' is a G'-module. The group homo- 
morphisms <p : G' -> G and ^ : A — »■ A' are said to be compatible if ^ is a 
G'-module homomorphism when A is made into a G'-module by means of <p , i.e., if 
\ J/((p(g ') a ) = g'yj/(a) for all g f e G' and oeA. 

The point of compatible homomoiphisms is that they induce group homomoiphisms 
on associated cohomology groups, as follows. 

If <p : G' -> G and ^ : A -► A' are homomoiphisms, then (p induces a homomor- 
phism (fA : (G') n -► G", and so a homomorphism from C"(G, A) to C^fG', A) that 
maps / to / o (fA . The map ^ induces a homomorphism from C n (G', A) to C” (G' , A') 
that maps / to ^ o /. Taken together we obtain an induced homomorphism 

: C”(G, A) — ► C n <G # , A') 

/»— ► f o(p n . 

If in addition <p and are compatible homomoiphisms, then it is easy to check that 
the induced maps A.„ commute with the coboundary operator: 

^n+I °^l = tin O k n 

for all n > 0. It follows that k n maps cocycles to cocycles and coboundaries to 
coboundaries, hence induces a group homomorphism on cohomology: 

k n : H n (G, A) — ► H n (G' , A') 

for n > 0. 

We consider several instances of such maps: 

Examples 

(1) Suppose G = G' and <p is the identity map. Then to say that the group homomorphism 
^ : A — ► A' is compatible with <p is simply the statement that ^ is a G-module 
homomorphism. Hence any G-module homomorphism from A to A' induces a group 
homomorphism 

H n (G, A) — » H n (G, A') forn > 0. 

In particular, if 0 — * A — ► fl — C 0 is a short exact sequence of G -modules we 
obtain induced homomoiphisms from H n (G, A) to H n (G, B) and from H n (G, B) to 
H n (G, C) for n > 0. These are simply the homomoiphisms in the long exact sequence 
of Theorem 21. 

(2) (The Restriction Homomorphism) If A is a G-module, then A is also an /f -module for 
any subgroup H of G. The inclusion map <p : H — »■ G of H into G and the identity 
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map \J/ : A -* A are compatible homomorphisms. The corresponding induced group 
homomorphism on cohomology is called the restriction homomorphism : 

Res : H n (G, A) — ► //”(//, A), n > 0. 

The terminology comes from the fret that the map on cochains from C"(G, A) to 
C"(H , A) is simply restricting a map / from G n to A to the subgroup H n of G" . 

(3) (The Inflation Homomorphism) Suppose H is a normal subgroup of G and A is a 
G-module. The elements of A that are fixed by H are naturally a module for the 
quotient group G/H under the action defined by (gH) a = g-a. It is then immediate 
that the projection <p : G -*■ G/H and the inclusion ^ : A H -*■ A are compatible 
homomorphisms. The corresponding induced group homomorphism on cohomology 
is called the inflation homomorphism : 

Inf : H n (jG/H . A H ) — ► H n (G, A), n > 0. 

(4) (The Corestriction Homomorphism) Suppose that H is a subgroup of G of index m 
and that A is a G-module. Let gi , . . . , g m be representatives for the left cosets of H 
in G. Define a map 

m 

V' : Mfj(A) — > A by / ► £> • /(g” 1 )- 

i=l 

Note that if we change any coset representative gi by g,h, then (gih)f ((gih)~ l ) = 
gthf(h~ l g~ l ) = gihh~ l f(g~ l ) = gif(g~ l ) so the map V' is independent of the 
choice of coset representatives. It is easy to see that i}/ is a G-module homomor- 
phism (and even Chat it is surjective), so we obtain a group homomorphism from 
H n (G , lo H n (G, A),forall/i > 0. Since A is also an //-module, by Shapiro’s 

Lemma we have an isomorphism H n (G, Mf/(A)) = H n (H, A). The composition of 
these two homomorphisms is called the corestriction homomorphism 

Cot : H n (H, A) — ► H n (G, A), n > 0. 

This homomorphism can be computed explicitly by composing the isomorphism W 
in the proof of Shapiro’s Lemma for any resolution of Z by projective G-modules P n 
(note these are G-modules and not simply //-modules) with the map i\r, as follows. 
For a cocycle / e Homz« (P n , A) representing a cohomology class c e //"(//, A), a 
cocycle Cor (/) e Homg^/V,, A) representing Cor (c) e H n (G, A) is given by 

m m 

Cor (J)(p) = ^gi ■ 4»(/)(p)(gf J ) = ^gifigr'p)- 

i=i i=i 

for p e P n . When n = 0 this is particularly simple since we can take /fo = ZG. In 
this case / e Hom^;/ (ZG, A) — Mfj(A) is a cocycle if / = a is constant for some 
a e A h and then Cor (/) is the constant function with value X!iLi gi ■ a ^ A G \ 

Cor : H°(H, A) = A 11 — ► A c = H°(G, A) 

rr 

i=l 

The next result establishes a fundamental relation between the restriction and core- 
striction homomorphisms. 
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Proposition 26. Suppose H is a subgroup of G of index m. Then Cor o Res = m, i.e., 
if c is a cohomology class in H n (G, A) for some G -module A, then 

Cor(Res(c)) = me £ II 11 (G, A) for all n > 0. 

Proof: This follows from the explicit formula for corestriction in Example 4 above, 
as follows. If / € Homz fj(Pn, A) were in HomzG(P n > A), i.e., if / were also a G - 
module homomorphism, \hea gj f (gf l p) — g,gf l f(p ) = / (p), for 1 < i < m. Since 
restriction is the induced map on cohomology of the natural inclusion of HomzG (P„, A) 
into Homz hGm, A), for such an / we obtain 

Dpe Cai> 

HomzG(Pn. A) — > Homzr/(Pjj, A) — > HomzG (ft, A) 

It follows that Res o Cor is multiplication by m on the cohomology groups as well. 

Corollary 27. Suppose the finite group G has order m. Then mH n (G, A) = 0 for all 
n > 1 and any G-module A. 

Proof Let H = 1, so that [G : H] = m, in Proposition 26. Then for any class 
c € H n (G, A) we have me = Cor(Res(c)). Since Res(c) e H n (H, A) = H n ( 1, A), 
we have Res(c) = 0 for all n > 1 by the second example preceding Proposition 20. 
Hence me = 0 for all n > 1, which is the corollary. 

Corollary 28. If G is a finite group then H n (G , A) is a torsion abelian group for all 
n > 1 and all G- modules A. 

Proof: This is immediate from the previous corollary. 

Corollary 29. Suppose G is a finite group whose order is relatively prime to the 
exponent of the G-module A. Then H n (G, A) = 0 for all n > 1. In particular, if A is 
a finite abelian group with (|G|, |A|) = 1 then H n (G, A) = 0 for all n > 1. 

Proof: This follows since the abelian group H n (G , A) is annihilated by ]G| by the 
previous corollary and is a nnihil ated by the exponent of A by Proposition 20. 

Note that the statements in the preceding corollaries are not in general true for 
n = 0, since then H°(G, A) = A G , which need not even be torsion. 

We mention without proof the following result. Suppose that H is a normal sub- 
group of G and A is a G-module. The cohomology groups H n (H, A) can be given 
the structure of G/H -modules (cf. Exercise 17). It can be shown that there is an exact 
sequence 

0 H\G/H . A h ) ™ H\G , A) ^ H\H, A) c/h ^ H 2 (G/H , A H ) ^ H 2 (G , A) 

where H l (H, A) G,H denotes the fixed points of H l (H, A) under the action of G/H 
and Tra is the so-called transgression homomorphism. This exact sequence relates the 
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cohomology groups for G to the cohomology groups for the normal subgroup H and 
for the quotient group G/H. Put another way, the cohomology for G is related to the 
cohomology for the factors in the filtration 1 < H < G for G. More generally, one 
could try to relate the cohomology for G to the cohomology for the factors in a longer 
filtration for G. This is the theory of spectral sequences and is an important tool in 
homological algebra. 

Galois Cohomology and Profinite Groups 

One important application of group cohomology occurs when the group G is the Galois 
group of a field extension K/F. In this case there are many groups of interest on which 
G acts, for example the additive group of K, the multiplicative group AT*, etc. The 
Galois group G = Ga l(K/F) is the inverse limit l^mGal(L/F) of the Galois groups 
of the finite extensions L of F contained in K and is a compact topological group 
with respect to its Krull topology (i.e., the group operations on G are continuous with 
respect to the topology defined by the subgroups Gal(F/L) of G of finite index), cf. 
Section 14.9. In this situation it is useful (and often essential) to take advantage of the 
additional topological structure of G. For example the subfields of K containing F 
correspond bijectively with the closed subgroups of G = Gal (K/F), and the example 
of the composite of the quadratic extensions of <Q> discussed in Section 14.9 shows 
that in general there are many subgroups of G that are not closed. Fortunately, the 
modifications necessary to define the cohomology groups in this context are relatively 
minor and apply to arbitrary inverse limits of finite groups (the profinite groups). If G 
is a profinite group then G = l^m G/N where the inverse limit is taken over the open 
normal subgroups N of G (cf. Exercise 23). 

Definition. If G is a profinite group then a discrete G-module A is a G-module A 
with the discrete topology such that the action of G on A is continuous, i.e., the map 
G x A ->• A mapping (g, a) to g a is continuous. 

Since A is given the discrete topology, every subset of A is open, and in particular 
every element a C A is open. The continuity of the action of G on A is then equivalent 
to the statement that the stabilizer G a of a in G is an open subgroup of G, hence is 
of finite index since G is compact (cf. Exercise 22). This in tum is equivalent to the 
statement that A = UA H where the union is over the open subgroups FI of G. 

Some care must be taken in defining the cohomology groups H n (G, A) of a profi- 
nite group G acting on a discrete G-module A since there are not enough projectives 
in this category. For example, when G is infinite, the free G-module ZG is not a 
discrete G-module (G does not act continuously, cf. Exercise 25). Nevertheless, the 
explicit description of H n (G, A) given in this section (occasionally referred to as the 
discrete cohomology groups) can be easily modified — it is only necessary to require 
the cochains C" (G, A) to be continuous maps from G w to A. The definition of the 
coboundary maps d n in equation (18) is precisely the same, as is the definition of the 
groups of cocycles, coboundaries, and the corresponding cohomology groups. It is 
customary not to introduce a separate notation for these cohomology groups, but to 
specify which cohomology is meant in the terminology. 
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Definition. If G is a profinite group and A is a discrete G-module, the cohomol- 
ogy groups H n (G, A) computed using continuous coc hains are called the profinite or 
continuous cohomology groups. When G = Gal (AT /F) is the Galois group of a field 
extension K/F then the Galois cohomology groups H n {G y A) will always mean the 
cohomology groups computed using continuous cochains. 

When G is a finite group, every G-module is a discrete G-module so the discrete 
and continuous cohomology groups of G are the same. When G is infinite, this need 
not be the case as shown by the example mentioned previously of the free G-module 
ZG when G is an infinite profinite group. All the major results in this section remain 
valid for the continuous cohomology ‘ groups when “G-module” is replaced by “discrete 
G-module” and “subgroup” is replaced by “closed subgroup.” For example, the Long 
Exact Sequence in Group Cohomology remains true as stated, the restriction homomor- 
phism requires the subgroup H of G to be a closed subgroup (so that the restriction of 
a continuous map on G n to H n remains continuous), Proposition 26 requires H to be 
closed, etc. 

We can write G = \}m(G/N) and A = U A N where N runs over the open normal 
subgroups of G (necessarily of finite index in G since G is compact). Then A N is a 
discrete G/N -module and it is not difficult to show that 

/T(G, A) = lkp H n (G/N, A n ) (17.19) 

N 

where the cohomology groups are continuous cohomology and the direct limi t is taken 
over the collection of all open normal subgroups N of G (cf. Exercise 24). Since 
G/N is a finite group, the continuous cohomology groups H n (G/N, A N )m this direct 
limit are just the (discrete) cohomology groups considered earlier in this section. The 
computation of the continuous cohomology for a profinite group G can therefore always 
be reduced to the consideration of finite group cohomology where there is no distinction 
between the continuous and discrete theories. 

EXERCISES 

1. Let F„ = ZG ZG <g>z - • ■ <8>z ZG (n + 1 factors) for n > 0 with G-action defined on 
simple tensors by g-(go ® gi ® • ■ - <8> gn) = (ggo) ® gi ® • • • ® gn- 
(a) Prove that F n is a free ZG-module of rank |G |" with ZG basis 1 ® gi ® gi ® ® gn 

with gi e G. 

Denote the basis element 1 <8> gi ® g 2 <8> ■ • ■ ® gn in (a) by (gi , g 2 , . , . , g„) and define the 
G-module homomorphisms d„ for n > 1 on these basis elements by d\ (gi) = gj — 1 and 

n— I 

4>(gl gn) = gl ■( g2 gn) + y>iy(gl gi-l,gigi+l,gi+2> ■ . ■ . gn) 

I 

+ (-l)"(gl g/J-l). 

for n > 2. Define the Z-module contracting homomorphisms 

Z F 0 A- Fi A- F 2 ■ - • 

on a Z basis by s_t(l) = 1 and s„(go® • ■ ■ ® gn) = 1 <8> go ® ■ - - ® gn- 
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(b) Prove that 


es -1 = 1, diso + S-ie — 1, d n+ is n + s n -id„ = 1. for all n > 1 


where the map aug : Fo -► Z is the augmentation map aug(^ eG a g g) = )T^ gG a fl- 
ic) Prove that the maps s„ are a chain homotopy (cf. Exercise 4 in Section 1) between 
the identity (chain) map and the zero (chain) map from the chain 


F n 


d n 


F n - 1 


^n-1 


A 


Fo -fUS. z — ► 0 


(*) 


of Z-modules to itself. 

(d) Deduce from (c) that all Z-module homology groups of (+) are zero, Le., (♦) is an 
exact sequence of Z-modules. Conclude that (*) is a projective G-module resolution 
of Z. 

2. Let P„ denote the free Z-module with basis (go. gi. g2> ■ - - . gn) with gj € G and define 

an action of G on P„ by g(go, gi g*) = (ggo, ggi ggn)- For n > 1 define 

n 

dn igO i gl • g2 • ■ • - » gn) ^ 1) (gO. ■ ■ ■ . gi i ■ ■ ■ . gn) . 

i = 0 

where (go. ■ . • . it gn) denotes the term (go, gi. g2. - - - . g*) with g; deleted. 

(a) Prove that P„ is a free ZG-module with basis (1, gi , £2, - - ■ . gn) where g, e G. 

(b) Prove that d„-\ o d„ = 0 for n > 1 . [Show that the term (go gj gk g n ) 

missing the entries gj and gk occurs twice in d„- 1 o d„(go»gi.g2 gn), with 

opposite signs.] 

(c) Prove that <p : P n F„ defined by 

<P((gO, gi, g 2 gn)) = go ® (go ^0 ® (^r 1 ^ ■ ■ ■ ® 

is a G-module isomorphism with inverse t]/ : P n ->- F n given by 

V'CgO <8>gl ® gn) = (g0> gOgt • g0gtg2 g0glg2 ' ‘ ' gn )- 

(d) Prove that if 6 (go) = 1 for all go € G then 

- ■ ■ — ► P n — P n -i P 0 — + 2 — >■ 0 (**) 

is a free G-module resolution of Z. [Show that the isomorphisms in (c) take the 
G-module resolutions (**) and (*) of the previous exercise into each other.] 

3. Let F„ and P n be as in the previous two exercises and let A be a G-module. 

(a) Prove that Horace (F n , A) can be identified with the collection C n (G, A) of maps 
from G x G x ■ ■ ■ x G (n copies) to A and that under this identification the associated 
coboundary maps from C n (G, A) to C n+l (G, A) are given by equation (18). 

(b) Prove that Homac(P„, A) can be identified with the collection of maps / from n + 1 

copies G x G x - x G to A that satisfy /(ggo, ggi gg„) = g/(go. gi, - - ■ . gn)- 


The group C n (G, A) is sometimes called the group of inhomogeneous n-cochains of G in A, 
and the group in (b) of the previous exercise is called the group of homogeneous n-cochoins 
of G in A. The inhomogeneous cochains are easier to describe since there is no restriction 
on the maps from G" to A, but the coboundary map d„ on homogeneous cochains is less 
complicated (and more naturally suggested in topological contexts) than the coboundary map 
on inhomogeneous cochains. The results of the previous exercises show that the cohomology 
groups H n (G, A) defined using either homogeneous or inhomogeneous cochains are the same 
and indicate the origin of the coboundary maps d„ used in the text Historically, H n (G, A) was 
originally defined using homogeneous cochains. 
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4. Suppose H is a normal subgroup of the group G and A is a G-module. For every g e G 
prove that the map /(a) = ga for a e A w defines an automorphism of the subgroup A H . 

5. Suppose the G-module A decomposes as a direct sum A = A i © A% of G-submoduJes. 
Prove that for all n > 0, H"(G, A) £ //"(G, Ai) © H n (G, A 2 ). 

6. Suppose 0-»-A-»- Mi -»M 2 M*-»-C->-0 is an exact sequence of G- 

modules where Mi, M 2 M* are cohomologically trivial. Prove that // n+ *(G, A) = 

H n (G, C) for all n > 1. [Decompose the exact sequence into a succession of short exact 

ct Y 

sequences and use Corollary 22. For example, if0-»A-»-Mi-*M 2 -*C-»-0is 
exact, show that 0 — * A B — ► 0 and 0-+B-* M 2 -*C — ► 0 are both exact, 

where B = Mi/imagea = Mi/ker /3 = image fi = ker y.] 

7. ( Adjoint Associativity) Let /?, S and 7 be rings with 1, let P be a left S-module, let N be 
a (7, S) -bimodule, and let A be a left 7 -module. Prove that 

0 : Hon\s(P, Hom 7 -(iV. A)) — > Homr^ <£>5 P, A) 

defined by 0(f) (n <£ p) = f(p)(n) is an isomorphism of abelian groups. (See also 
Theorem 43 in Section 10.5). 

8- Suppose G is cyclic of order m with generator cr and let N = 1 +0 4 - <7 2 -I e ZG. 

(a) Prove that the augmentation map augC^^o' a i a , ) = a i is a (/-module homo- 

morphism from ZG to Z. 

(b) Prove that multiplication by N and by 0 — 1 in ZG define a free G-module resolution 

of Z: ... ^4 ZG ZG ... ZG ^4 ZG Z— ► 0. 

9. Suppose G is an infinite cyclic group with generator a. 

(a) Prove that multiplication by 0 — 1 e ZG defines a free G-module resolution of 

Z: 0 — > ZG ZG — > Z — > 0. 

(b) Show that //°(G, A) = A c ,that// 1 (G, A) = A/(a — 1) A, and that //"(G, A) = 0for 
all n > 2. Deduce that // 1 (G, ZG) = Z (so free modules need not be cohomologically 
trivial). 

10 . Suppose H is a subgroup of finite index m in the group G and A is an //-module. Let 

jtl x m be a set of left coset representatives for H in G: G = x\ H U - ■ ■ U x m //. 

(a) Prove that ZG = ©f =1 Jt/Z H = ©"L, ZHx~' and ZG ® Z H A = ©J" = ,(jc/ ® A) as 
abelian groups. 

(b) Let fi ta be the function from ZG to A defined by 

f ha if x = with h € H 
fl >a (x) = | n ' 

l 0 otherwise. 

Prove that „ e Mp(A) = Hom 2 W (ZG, A), i.e., fi, a (h’x) = W fi. a (x) for e //. 

(c) Prove that the map <p(f) = Yl?=] x > ® / 0*, - 1 ) from Mp(A) to ZG <£ 2 // A is a G- 
module homomorphism. (Write x~ l g = hjxf 1 for * = 1, . . . , m and observe that 
xi <£ f(x~ l g) = xi <£ hifixp) = xjhi <£ f(xf l ) = gx v ® /U^ 1 ).] 

(d) Prove that <p gives a G-module isomorphism <p : Mp(A) = ZG ®zn A. [For the 

injectivity observe that an //-module homomorphism is 0 if and only if f(x7 x ) = 0 
for / = 1 m. For the surjectivity prove that <p(fi, a ) = *i <£ a.] 

11 . Prove that the isomorphism Mp(A) = ZG A in (d) of the previous exercise need not 
hold if H is not of finite index in G. [If G is an infinite cyclic group show that Shapiro’s 
Lemma implies H l (G , Mp (Z)) = 0 while H l (G, ZG) = Z by Exercise 9.] 
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12. IfH is a subgroup of G and A is an abelian group let Mg/h(A) denote the abelian group 
of all maps from the left cosets g H of H in G to A. 

(a) Prove that A/p (A) = (Me/ h( A)) as H -modules. [If is a choice of left 

coset representatives of H in G define the correspondence between / e A/p (A) and 
F : H -> Mc/h(A ) by F(h)(g t H ) = f(gjh), and check that this is an isomorphism 
of H -modules.] 

(b) A G-module A such that H n (H, A) = 0 for all n > 1 and all subgroups H of G is 
called cohomologically trivial. Prove that M p (A) is a cohomologically trivial for any 
abelian group A. 

(c) If G is finite, prove that Z G A is cohomologically trivial for all abelian groups A. 

13. Suppose A is a G-module and H is a subgroup of G. Prove that the group homomorphism 
from H n (G, A) to H n (G , A/^(A)) for all n > 0 induced from the G-module homo- 
morphism from A to Af^(A) in Example 3 following Corollary 22 composed with the 
isomorphism H n (G, A/p (A)) = H n {H. A) of Shapiro’s Lemma is the restriction homo- 
morphism from H n (G, A) to H n (H, A). 

14. Suppose <p ://-»■ G is the inclusion map of the subgroup H of G into G. If A is an //- 
module and A/p (A) the associated induced G-module, define the group homomorphism 
yp : A/p (A) -*■ A by mapping / Lo its value at 1: yp(f) = / (1). 

(a) Prove that <p and ^ are compatible homomorphisms. 

(b) Prove that the induced group homomorphism from H n (G, A/p (A)) to H n (H. A) for 
n > 0 is the isomorphism in Shapiro’s Le mm a. 

15. Suppose H is a normal subgroup of G and A is a G-module. For fixed g e G-,\ttip(a) = ga 
and <p(h) = g~ l hg for h € H. 

(a) Prove that <p and yp are compatible homomorphisms. 

(b) For each n > 0, prove that the homomorphism $ g from H n (H, A) to H n (H, A) 
induced by the compatible homomorphisms <p and tp is an automorphism of H" (H, A). 
[Observe that both <p and \p have inverses.] 

(c) Show that 8 g acting on H°(H, A) is the automorphism in Exercise 4. 

16. Let A be a G-module and f or g e G let 6 g denote the automorphism of H n (G, A) defined 
in the previous exercise. 

(a) Prove that 6 g acting on H°(G, A) = A c is the identity map. 

(b) Prove that 6 g acting on H n (G. A) is the identity map for n > 1. [By induction on n and 
dimension shifting. For/i = 1, use the exact sequence in Corollary 22, logeLher with 
(a) applied to 6 g on C G . For n > 2 use the isomorphism H n+[ (G t A) = H n (G, C ) 
in Corollary 22.] 

17. Suppose that H is a normal subgroup of G and A is a G-module. For n > 0 prove 
that H n (H, A) is a G/H - module where gH acts by the automorphism 6 g induced by 
conjugation by g on H and the natural action of g on A as in Exercise 15. [Use the 
previous exercise to show this action of a coset is well defined.] 

18. Suppose that G is cyclic of order m , that H is a subgroup of G of index d, and that Z is a 
trivial G-module. Use the projective G-module resolution in Exercise 8 to prove 

(a) that Cor : H"(H, Z) H r (G, Z) is multiplication by d from Z to Z forn = 0, from 

Z/(m/d)Z to Z/mZ if n is odd, and from 0 to 0 if n is even, n > 2, and 

(b) that Res : H n (G, Z) -»■ H n (H, Z) is the identity map from Z to Z for n = 0, and 
is the natural projection map from Z/mZ to Z/(m/d)Z or from 0 to 0, depending on 
the parity of n > 1. 

19. Let p be a prime and let P be a Sylow p- subgroup of the finite group G. Show that for 
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any G-module A and al In > 0 the map Res : H n (G, A) ->■ H n (P, A) is injective on the 
p-primary component of H 1 (G, A). Deduce that if | A\ = p° then the restriction map is 
injective cm H n (G, A). [Use Proposition 26.] 

20. Let p be a prime, let G = (a) be cyclic of order p m and let W be a vector space of 
dimension d > 0 over F p on which a acts as a linear transformation. Assume W has a 
basis such that the matrix of a is a d x d elementary Jordan block with eigenvalue 1. 

(a) Prove that d < p m . [Use facts about the minimal polynomial of an elementary Jordan 
block.] 

(b) Prove that dimp,, W G = 1 . 

(c) Prove that dim p p (cr - 1)W = d — 1. 

(d) If N = 1 +<r H Hcr^ -1 is the usual norm element, prove that NW is of dimension 

1 if d = pF (respectively, of dimension 0 i f d < p™) and that the dimension of 
fjW is d - 1 (respectively, d). [Let R be the group ring F P G, and show that every 
nonzero R- submodule of R contains N. Note that W is a cyclic /? -module and let 
<p: R -> W be a surjective homomorphism. Conclude that if <p is not an isomorphism 
then N e ker <p .] 

(e) Deduce that ifd = p m then H n (G , W) = 0, and if d < p m then H n (G, W) has order 
p, for all n > 1 (i.e., these cohomology groups are zero if and only if W is a free 
F p G-module). 

21 . Let p be a prime, let G = ( a ) be cyclic of order p 771 and let V be a G-module of exponent 
p. Let V = V\ 0 V 2 © ■ • ■ 0 Vjt be a decomposition of V giving the Jordan Canonical 
Form of a, where each V) is a -invariant and a matrix of a on V; is an d; x d-, elementary 
Jordan block with eigenvalue 1, d{ > 1 (cf. Section 12J). Prove that |V G | = p* and 
| H n (G, 101 = p s where s is the number of V/ of dimension less than p™ over F p , for all 
n > 1. [Use the preceding exercise and Exercise 5.] 

22. Suppose G is a topological group, i.e., there is a topology on G such that the maps 
G x G -> G defined by (gi, # 2 ) g\g2 and G -* G defined by g h* g~ l are continuous. 

(a) If H is an open subgroup of G and g G G, prove that the cosets gH and Hg and the 
subgroup g ~ 1 Hg are also open. 

(b) Prove that any open subgroup is also closed. [The complement is the union of cosets 
as in (a).] 

(c) Prove that a closed subgroup of finite index is open. 

(d) If G is compact prove that every open subgroup H is of finite index. 

23. Suppose G is a compact topological group. Prove the following are equivalent 

(i) G is profinite, i.e., G = l|m G,- is the inverse limit of finite groups G/ . 

(ii) There exists a family {Ni } (1 € I) of open normal subgroups N, in G such that 
rijNj = 1 and in this case G = l^m(G/M). 

(iii) There exists a family [Hj] (j € S) of open subgroups Hj in G such that = 1. 

[To show (iii) implies (ii), let H be open in G and use (d) of the previous exercise to show 
that N = Hg is a finite intersection and conclude that N c fi c. G and N is 

open and normal in G.] 

24. Suppose N and N' are open normal subgroups of the profinite group G and N' Q N. Prove 
that the projection homomorphism <p : G/N 1 — ► G/N and the injection \}r : A N — ► A N ' 
are compatible homomorphis ms and deduce there is an induced homomorphism from 
H n (G/N t A N ) to H n (G/N', A n ). 

25. If G is an infinite profinite group show that G does not act continuously on A = TLG. 
[Show that the stabilizer of a e A is not always of finite index in G.] 
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17,3 CROSSED HOMOMORPHISMS AND H^(G f A) 

In this section we consider in greater detail the cohomology group H 1 (G , A) where 
G is a group and A is a G-module. From the definition of the coboundary map dy in 
equation (18), if / G C l (G , A) then 

ddf)(gi, gi) = gi f(gi) ~ figigi) + /(gi). 

Thus any function / : G -► A is a 1 -cocycle if and only if it satisfies the identity 

/(**) = /(*) + */(*) for all g, h G G. (17.20) 

Equivalently, a 1 -cocycle is determined by a collection [a g } ge c of elements in A satis- 
fying a g h = a s + gcih for g,h g G (and then the 1 -cocycle / is the function sending g 
to a g ). Note that if 1 denotes the identity of G, then /(l) = / (l 2 ) = /(l) + 1 ■ /(l) = 
2/(1), so /(l) = 0 is the identity in A. Thus 1-cocycles are necessarily “normalized” 
at the identity. It then follows from the cocycle condition that /(g -1 ) = — g~ l /(g) for 
all g G G. 

If A is a G-module on which G acts trivially, then the cocycle condition (20) is 
simply f(gh) = /(g) + f (h), i.e., / is simply a homomorphism from the multiplicative 
group G to the additive group A. Because of this the functions from G to A satisfying 
(20) are called crossed homomorphisms. 

A 1 -cochain / is a 1 -coboundary if there is some a e A such that 

f(g) = ga~a for all g g G, (17.21) 

(equivalently, a g = go— a in the notation above). Note that since —a G A, the cobound- 
ary condition in (21) can also be phrased as /(g) = a — g a for some fixed a G A and 
all g g G. The 1 -coboundaries are called principal crossed homomorphisms. With this 
terminology the cohomology group Z/ 1 (G, A) is the group of crossed homomorphisms 
modulo the subgroup of principal crossed homomorphisms. 

Example: (Hilbert’s Theorem 90) 

Suppose G = Gal(A^ /F) is the Galois group of a finite Galois extension K/F of fields. 
Then the multiplicative group K* Ls a G-module and H l (G, K x ) = 0. To see this, let 
{cv} be the values f(o ) of a 1-cocycle /, so that a a e K x and a ot — a 0 o(a T ) (the 
cocycle condition written multiplicatively for the group A"*). By the linear independence 
of automorphisms (Corollary 8 in Section 14.2), there is an element y € K such that 

£ = 

T€G 

is nonzero, i.e., Then for any a e G we have 

o(fi) = ^a(o; r ) crt(y) = a~ l or(y) = a" 1 i 0 

reC zeG 

where the second equality comes from the cocycle condition. Hence a a = which 

is the multiplicative form of the coboundary condition (21) (for the element a = 

Since every 1 -cocycle is a 1 -coboundary, we have H l (G, K x ) =0. The same result holds 
for infinite Galois extensions by equation (19) in the previous section since H l (G, K x ) is 
the direct limit of trivial groups. 
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As a special case, suppose K / F is a Galois extension with cyclic Galois group G 
having generator or. The cohomology groups for G were computed explicitly in the pre- 
vious section, and in particular, H l (G, A) = nA/(o — 1)A for any G-module A (written 
additively). Since this group is trivial in the present context, we see that an element a in 
AT is in the kernel of the norm map, i.e., N k/f(&) = 1 if and only ifo? = for some 

/J e K. (For a direct proof of this result in the cyclic case, cf. Exercise 23 in Section 14.2.) 

T hi s famous result for cyclic extensions was first proved by Hilbert and appears as 
“Theorem 90” in his book (known as the "Zahlbericht”) on number theory in 1897. As a 
result, the more general result H 1 (G, K x ) = 0 is referred to in the literature as “Hilbert’s 
Theorem 90.” In general, the higher dimensional cohomology groups H n (G, K*) for 
n > 2 can be nontrivial (cf. Exercise 13). 

Example 

Suppose G = Gal(AT /F) is the Galois group of a finite Galois extension K /F of fields as 
in the previous example. Then the additive group K is also a G-module and H n (G, K) = 0 
for all n > 2. The proof of this in general uses the fact that there is a normal basis for K over 
F, i.e., there is an element a e K whose Galois conjugates give a basis for AT as a vector 
space over F, or, equivalently, K = ZG F as G-modules. The latter isomorphism 
shows that AT is induced as a G-module, and then H n (G, K) = 0 follows from Corollary 
24 in Section 2. For a direct proof in the case where G is cyclic, cf. Exercise 26 in Section 
14.2. 

If G acts trivially on A, then g • a — a = 0, so 0 is the only principal crossed 
homomorphism, i.e., B l (G, A) = 0. This proves the following result: 

Proposition 30. If A is a G-module on which G acts trivially then H l (G, A) = 
Hom(G, A), the group of all group homomorphisms from G to H. 

If G is a profinite group, then the same result holds for the continuous cohomology 
group H l (G t A) provided one takes the group of continuous homomorphisms from G 
into A. 

Examples 

(1) If G acts trivially on A then H l (G, A) — H l (G/[G, G], A) since any group homo- 
morphism from G to the abelian group A factors through the commutator subgroup 
[G, G] (cf. Proposition 7(5) in Section 5.4), so computing H l for trivial G-action 
reduces to computing H 1 for some abelian group. 

(2) If G is a finite group acting trivially on Z, then H l (G, Z) = 0 because Z has no 
nonzero elements of finite order so there is no nonzero group homomorphism from G 
to Z. 

(3) If A is cyclic of prime order p and G is a p-group then G must act trivially on A 
(since the automorphism group of A has order p — 1), so in this case one always has 
H X (G, A) = Hom(G, A). 

(4) If G is a finite group that acts trivially on Q/Z then H 1 (G, Q/Z) = Hom(G, Q/Z) = G 
is the dual group of G (cf. Exercise 1 4 in Section 5 .2.). Since <Q>/Zis abelian, any homo- 
morphism of G into Q/Z factors through the commutator quotient G 8 *’ = G/[G, G] 
of G, so Hom(G, Q/Z) = Hom(G ab , Q/Z). It follows that Hom(G, Q/Z) £ G* ab 
(which by cf. Exercise 14 again is Doncanonically isomorphic to G^). 
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If 0 — > A — >-B->C->-Oisa short exact sequence of G-modules then the long 
exact sequence in group cohomology in Theorem 21 of the previous section begins with 
terms 

0 — ► A c — ► B c — ► C c H l (G, A) — ► ■ • 

The connecting homomorphism <$o is given explicitly as follows: if c e C c then there is 
an element b € B mapping to c and then 5o(c) is the class in //'(G, A) of the 1 -cocycle 
given by 


8o(c) : G — ► A 

8 i— 5 ► 8 -b-b. 

Note that g b — b is (the image in B of) an element of A for all g e G since c € C G . 
To verify directly that f = 8q(c) satisfies the cocycle condition in (20), we compute 

f(gh) = gh ■ b-b = (g -b - b) + g ■ (h ■ b - b) = f(g) + gf(h). 

From the explicit expression f = g - b — b it is also clear that <5o (c) e H l (G, A) 
maps to 0 in the next term H l (G, B ) of the long exact sequence above since / is the 
coboundary for the element b e B. 

Example: (Kommer Theory) 

Suppose that F is a field of characteristic 0 containing the group fi n of all roots of 
unity for some n > 1. Let A! be an algebraic closure of F and let G Gal (K/F). 
The group G acts trivially on /x w since /x„ c F by assumption, i.e., y,„ = Z/nZ as G- 
modules. Hence the Galois cohomology group /f '(G, /*„) is the group Hom^fG, Z/nZ) 
of continuous bomomorphisms of G into Z/nZ. If x is such a continuous homomorphism, 
then ker / cGisa closed normal subgroup of G, hence corresponds by Galois theory to 
a Galois extension L x /F. Then Gal (L x /F) = image x* so L x is a cyclic extension of F 
of degree dividing n. Conversely, every such cyclic extension of F defines an element in 
Honv(G, Z/nZ), so there is a bijection between the elements of the Galois cohomology 
group H l (G, LL n ) and the cyclic extensions of F of degree dividing n. 

The homomorphism of raising to the n 1 * 1 power is surjective on K * (since we can 
always extract n* roots in K) and has kernel fx n - Hence the sequence 

1 — ► fj,„ — ► K* K* — ► 1 

is an exact sequence of discrete G-modules. The associated loDg exact sequence in Galois 
cohomology gives 

1 — * — * (K*) G (tf x ) G — > H l (G, m„) — ► H l (G. K*) — > ■ ■ ■ 

We have (jl g = fi n and (K x ) c = F x by Galois theory, and H l (G, K*) = 0 by Hilbert's 
Theorem 90, so this exact sequence becomes 

1 — ► — y F x F x — > H l (G,tm) — > o, 

which in turn is equivalent to the isomorphism 

ff l (G,M*) = F x /F x ” 
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where F* n denotes the group of n 01 powers of elements of F x . This isomorphism is made 
explicit using the explicit form for the connecting homomorphism given above: for every 
a e F* and a e G, the element Z/a in K* maps to a in the exact sequence and 


X(o0 


!/a 


defines an element in H i (G, p n ) (cf. Exercise 11). The kernel of this homomorphism x 
is the field F(Z/a). By the results of the previous paragraph, when F contains the n ^ 
roots of unity an extension L/F is Galois with cyclic Galois group of order dividing n if 
and only if L = F(^/a) for some a € F x . Furthermore, the class of a in F x /F x ” is 
unique, i.e., or is unique up to an n* power of an element in F. Such an extension is called 
a Kwnmer extension , cf. Section 14.7 and Exercise 12. 

If the characteristic of F is a prime p, the same argument applies when n is not 
divisible by p, replacing the algebraic closure of F with the separable closure of F (the 
largest separable algebraic extension of F). 


Example: (The Transfer Homomorphism) 

Suppose G is a finite group and H is a subgroup. The corestriction defines a homomorphism 
from H l (H, Q/Z) to ( G , Q/Z), which by Example 4 above gives a homomorphism 
from // ab to G^. This gives a homomorphism 

Ver : G ab — » H hh 


called the transfer (or Verlagerungen ) homomorphism (cf. Exercise 14). To make t hi s 
homomorphism explicit, consider the exact sequence 


0 — » Q/Z — ► Afp(Q/Z) -vC-> 0 (17.22) 

defined by the homomorphism mapping a e Q/Z to f a e A/p(Q/Z) in Example 4 
preceding Proposition 23 in the previous section (so f a (g) = g o for g e G). This 
is a short exact sequence of G-modules and hence also of //-modules. The first portions 
of the associated long exact sequences for the cohomology with respect to H and then G 
give the rows in the commutative diagram 


C 


H 


So . 


Cor 


//'(//, Q/Z) 

Cor 


>■ C G H\G, Q/Z) 


*■ 0 

0 


since //*(//, Afp(Q/ Z)) = H l (G, A/p (Q/Z)) = 0 (cf. Exorcise 12 in Section 2). Let 

X € //'(//, Q/Z) and suppose that c e C fI is an element mapping to x by the surjective 
connecting homomorphism <$o in the first row of the diagram above. By the commutativity, 

X I = Cor (x) is the image under the connecting homomorphism <$o of d = Cor (c) € C G 
in the second row of the diagram. By our explicit formula for the coboundary map <$o, if 
F e A/p (Q/Z) is any element mapping to c' in (22) then g ■ F — F = f a > for a unique 
a' € Q/Z, and we have x'ig) = />o( c O(g) = a' for g € G. Since f a > (x) — x ■ cf = a' for 
any x e G because G acts trivially on Q/Z, the function g ■ F — F in fact has the constant 
value a', and so can be evaluated at any x e G to determine the value of x'(g)- 
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Since d = Yl'iLi Si c & C G where g\, . .. , g m are representatives of the left cosets 
of H in G (cf. Example 4 preceding Proposition 26), such an element F is given by 

m 

^ = •/. 

;=i 

where / € Mp (<Q>/Z) is any element mapping to c in (22). This / can be used to compute 
the explicit coboundary of c as before: h- f — f = f a for a unique a € Q/Z and x(h) = a 
for h € H. As before, the function h - f — f = f a has the constant value a and so can be 
evaluated at any element x of G to determine the value of x (h). 

Computing g F — F on the element 1 e G it follows that 

/;; m 

x'(g) = ftesi) - J2 /(#)■ 

i=i i=i 

For i = 1, . . . , m, write 

ggi = gjWg . gi) with h(g, gj) € H, (17.23) 

noting that the resulting set of gj is some permutation of {gi , . . . , g m }. Then 

m mm m 

£/(**.-> - £/&) = gi )) 

j = 1 i=l 1 = 1 /=! 

since as noted above, x(/i) — f(xh) — / (x) for any x e G. Hence 

m 

x / (g)=x(n / »te-^» 

i=i 

and so the transfer homomorphism is given by the formula 

m 

Ver(g) = Yl h( -8> gi) (17.24) 

/=i 

with the elements h (g, gj ) € H defined by equation (23). Note that this proves in particular 
that the map defined in (24) is a homomorphism from G^ to H** 3 that is independent of the 
choice of representatives g, for H in G in (23). Proving that this map is a homomorphism 
directly is not completely trivial. The same formula also defines the transfer homomorphism 
when G is infinite and H is a subgroup of finite index in G. 

As an example ofthe transfer, suppose// = nZandG = ZandchooseO, 1,2,..., n — 1 
as coset representatives for H in G. If g = 1, then all the elements h(g, gi) are 0 for 

i = l,2 n — 1 and h(l, n — 1) = #». Hence the transfer map from Z to nZ maps 1 

to n, so is simply multiplication by the index. Similarly, the transfer map from any cyclic 
group G to a subgroup H of index n is the n 01 power map. See also Exercise 8. 

For the cyclic group F £ for an odd prime p and subgroup {±1), it follows that the 
transfer map is the homomorphism Ver : F£ — »■ {±1} given by 

Ver(a) = = ( a - ) = ( +1 if ' 0 U 31 “> Uare 

\P/ l—l if a is not a square 

a 

(the symbol (— ) is called the Legendre symbol or the quadratic residue symbol ). If instead 
P 

we take the elements 1, 2, .... (p — l)/2 as coset representatives for (±1) in F * we see 
that 
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where m(a) is the number of elements among a, 2a (p— l)a/2 whose least positive 

remainder modulo p is greater than (p — 1)/2 (in which case the element differs by — 1 from 
one of our chosen coset representatives and contributes one factor of —1 to the product in 
(24)). This result is known as Gauss' Lemma in elementary number theory and can be used 
to prove Gauss’ celebrated Quadratic Reciprocity Law (cf. also Exercise 15). 

Next we give two important interpretations of H l (G, A) in terms of semi direct 
products. If A is a G-module, let E be the semidirect product E = A >4 G, where A 
is normal in E and the action of G (viewed as a subgroup of E) on A by conjugation 
is the same as its G-module action: gag~ l = g - a. In the notation of Section 5.5, 
E = A * 1 ^ G, where <p is the homomorphism of G into Aut(A) given by the G-module 
action. In particular, E will be the direct product of A and G if and only if G acts 
trivially on A. As in Section 5.5, we shall write the elements of E as (a, g) where 
a € A and g € G, with group operation 

(oi, g l)(<32, Si) = (<3j + g\ ■ a%, gig 2 )- 
Note that A is written additively, while G and E are written multiplicatively. 

Definition. Let X be any group and let Y be a normal subgroup of X. The stability 
group of the series 1 < Y < X is the group of all automorphisms of X that map Y to 
itself and act as the identity on both of the factors Y and X/Y, i.e., 

Stab(l < y < X) = {a € Aut(X) | a(y) = y for all y e Y t 

and o(x) = x mod Y for all x e X}. 


In the special case where Y is an abelian normal subgroup of X, conjugation by 
elements of Y induce (inner) automorphisms of X that stabilize the series 1 < Y < X, 
and in this case Y/C y (X) is isomorphic to a subgroup of Stab(l < Y < X) (where 
C Y (X) is the elements of Y in the center of X). 

Proposition 31. Let A be a G-module and let E be the semidirect product A >3 G. For 
each cocycle / e Z*(G, A) define oy : E — ► E by 

o/((u. £)) = (a + f(g), g). 

Then the map / — > ay is a group isomorphism from Z 1 (G, A) onto Stab(l < A < E). 
Under this isomorphism the subgroup B l (G, A) of coboundaries maps onto the sub- 
group A/C a (E) of the stability group. 

Proof; It is an exercise to see that the cocycle condition implies oy is an automor- 
phism of E that stabilizes the chain 1 < A < E. Likewise one checks directly that 
°/i +/2 — a L ° °h' so th e ma P / | -* oy is a group homomorphism- By definition of oy 
this map is injective. Conversely, let a € Stab(l < A < E). Since a acts trivially on 
E/A, each element (0, g) in this semidirect product maps under a to another element 
(a, g) in the same coset of A; define f a : G -► A by letting f a (g) = a. If we identify 
A with the elements of the form (a, l) in E, then the group operation in E shows that 

/„<*)= <r((0, «))(£>, «)“'• 
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Because o is a stability automoiphism of £, it is easy to check that f a satisfies the 
cocycle condition. It follows immediately from the definitions that f a/ = /, so the 
map / !-► a/ is an isomorphism. 

Now / is a coboundary if and only if there is some* € A such that f(g ) = x-g-x 
for all g € G. Thus /is a coboundary if and only if < 7 / ((<3, g)) = (a+x — g>x, g). But 
conjugation in £ by the element (*, 1) maps (a, g) to the same element (a+x—gx, g), 
so the automorphism <7/ is conjugation by (*, l). This proves the remaining assertion 
of the proposition. 

Corollary 32. In the notation of Proposition 31 \.ti<p 0 denote the automorphism of E 
given by conjugation by a for any a e A. Then the cocycles f\ and fi are in the same 
cohomology class in H ] (G, A) if and only if <7/, —<p 0 ° oy 2 , for some a e A. 

The proposition and corollary show that l-cocycles may be computed by finding 
automorphisms of E that stabilize the series t < A < E, and vice versa. The first 
cohomology group is then given by taking these automorphisms modulo inner auto- 
morphisms, i.e., is the group of “outer stability automorphisms” of this series. 

Example 

Let G — 7.2 act by inversion on A = 2/42. The corresponding semi direct product 
E = A >i G is the dihedral group of order 8 , which has automoiphism group isomorphic 
to viewing £ as a normal (index 2) subgroup of Dk,, conjugation in the latter group 
restricted to £ exhibits 8 distinct automorphisms of £ (cf. Proposition 17 in Section 4.4). 
The subgroup A of £ is characteristic in £, hence every automorphism of £ sends A to 
itself, and therefore also acts on E/A (necessarily trivially since |£/A| = 2). Half the 
automorphisms of £ invert A and half centralize A; in fact, the cyclic subgroup of order 8 
in £>i 6 (which contains A) maps to a cyclic group of order 4 of automorphisms centralizing 
A. Thus Stab(l < A < £) = Z 4 = Z 1 (G, A). Since the center of £ is a subgroup of A of 
order 2, |A/Z(£)| =2 = [fi l (0, A)(. This proves |/f *(G, A)| = 2. 

In the semidirect product E the subgroup G is a complement to A, i.e., E = AG 
and A n G = 1; moreover, every £ -conjugate of G is also a complement to A. But A 
may have complements in £ that are not conjugate to G in £. Our second interpretation 
of H l (G, A) shows that this cohomology group characterizes the £-conjugacy classes 
of complements of A in E. 

Proposition 33. Let A be a G -module and let £ be the semidirect product A x G. For 
each 1 -cocycle / let 

G/ = {(fig), g)\ge G}. 

Then Gf is a subgroup complement to A in £. The map / G/ is a bisection from 
Z l (G y A) to the set of complements to A in £. Two complements are conjugate in 
£ if and only if their corresponding l-cocycles are in the same cohomology class in 
H 1 (G t A), so there is a bijection between H l (G, A) and the set of £-conjugacy classes 
of complements to A. 

Proof: By the cocycle condition, 

</<*). *)(/(*), A) = (f(M)+gf(b)g~\ gh) = (f(g)+g-f(h), gh ) = (J(gh). gh). 
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and it follows that G/ is closed under the group operation in £. As observed earlier, each 
cocycle necessarily has /( 1) = 0, so Gf contains the identity (0, 1) of £. The inverse 
to g) in £ is (f(g [ ), g 1 ), so G f is closed under inverses. This proves Gf is a 

subgroup of E. Since the distinct elements of Gf represent the distinct cosets of A in 
E , Gf is a complement to A in E . Distinct cocycles give different coset representatives, 
hence they determine different complements. 

Conversely, if C is any complement to A in G, then C contains a unique coset 
representative a g g of Ag for each g e G. Since C is closed under the group operation 
the element (a g g)(cihh) = (a s gahg~ l )gh represents the coset Agh, and so a g f, is 
dggcikg^ = a g (g-a h ) (written additively in A this becomes a gh = a g + (g a h )). This 
shows that the map / : G -> A given by f(g) = a g is a cocycle, and so C = Gf. 
Hence there is a bijection between 1 -cocycles and complements to A in E. 

Since Stab(l < A < E) normalizes A it permutes the complements to A in E. 
In the notation of Proposition 31, for 1 -cocycles fy and /2 it follows immediately 
from the definition that oy, (Gf 2 ) = G/ ]+ / 2 . This shows that the permutation action of 
Stab(l < A < E) on the set of complements to A in £ is the (left) regular representation 
of this group. Furthermore, if a € A and <p a is the stability automorphism conjugation 
by a, then 

aGfd 1 = (p a (Gf ) = Gf + p a (17.25) 

where is the 1 -coboundary : g hv a — g - a. Since Gf is a complement to A, any 
e € E may be written as ag for some a € A and g € Gf. Then eG/e _1 = aGfa~ l , 
i.e., the £ -conjugates of Gf are the just the A-conjugates of Gf. Now the complements 
Gy, and G/ 2 are conjugate in £ if and only if G/ 2 = aGf^a 1 = Gf i+ p a for some 
a € A by (25). This shows two complements are conjugate in £ if and only if their 
corresponding cocycles differ by a coboundary, i.e., represent the same cohomology 
class in 77 1 (G, A), which completes the proof. 

Corollary 34. Under the notation of Proposition 33, all complements to A are conjugate 
in £ if and only if H l (G, A) = 0. 

Corollary 35. If A is a finite abelian group whose order is relatively prime to |G| then 
all complements to A in any semidirect product £ = A xi G are conjugate in £. 

Examples 

(1) Let A = {a) and G = {g) both be cyclic of order 2. The group G must act trivially 
on A, hence AxG = AxG is a Klein 4-group. Here A xi G is abelian, so every 
subgroup is conjugate only to itself, and since H 1 (G, A) = Hom(Z 2 , Z/2Z) has order 
2, there are precisely two complements to A in E, namely ( g ) and (ag). 

(2) If A = ( a ) is cyclic of order 2 and G = (x)x(y) is a. Klein 4-group, then as before 
G must act trivially on A, so H l (G, A) = Hom(Z 2 x Z 2 , Z/2Z) has order 4. The 
four complements to A in A x G are G, ( ax, y ), (x,ay) and ( ax, ay ). 

(3) Proposition 33 can also be used to compute H l (G, A). Let A = (r) be cyclic of 

order 4 and let G = (s ) be cyclic of order 2 acting on A by inversion: = r^ 1 

as in the Example following Corollary 32. Then A x G is the dihedral group Dg of 
order 8. The subgroup A has four complements in Dg, namely the groups generated 
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by each of the four elements of order 2 not in A: ( s ), { r 2 s ), (rs) and { r 3 s ) . The 
former pair and the latter pair are conjugate in Dg (in both cases via r), but (s) is 
not conjugate to {rs). Thus A has 2 conjugacy classes of complements in A x G 
and hence H l (Z 2 , Z/4Z) has order 2. This also follows from the computation of the 
cohomology of cyclic groups in Section 2. 


EXERCISES 

1. Let G be the cyclic group of order 2 and let A be a G-module. Compute the isomorphism 
types of Z l (G, A), B ] (G, A) and H l (G, A) for each of the following: 

(a) A = Z/4Z (trivial action), 

(b) A = Z/2Z x Z/2Z (trivial action), 

(c) A — Zj2Z x Z/2Z (any nontrivial action). 

2. Let p be a prime and let P be a p-group. 

(a) Show that H l (P , F p ) = P/0(P), where <P(P) is the Frattini subgroup of P (cf. the 
exercises in Section 6.1). 

(b) Deduce that the dimension of H l (P, 1F P ) as a vector space over F p equals the minimum 
number of generators of P. [Use Exercise 26(c), Section 6. 1 .] 

3. If G is the cyclic group of order 2 acting by invasion on Z show that \H l (G, Z)| = 2. 
[Show that in E = Z x G every element of E — Z has order 2, and there are two conjugacy 
classes in this coset] 

4. Let A be the Klein 4-group and let G = Aut( A) = S 3 act on A in the natural fashion. Prove 
that H l (G, A) = 0. [Show that in the semidirect product E = A x G, G is the normalizer 
of a Sylow 3-subgroup of E. Apply Sylow’s Theorem to show all complements to A in E 
are conjugate.] 

5. Let G be the cyclic group of order 2 acting on an elementary abelian 2-group A of order 
2". Show that H l (G, A) = 0 if and only if n = 2k and \A C \ =2 k . [In E = A x G show 
that (a, jc) is an element of order 2 if and only if a € A c , where G = (x). Then compare 
the number of complements to A with the number of E-conjugates of x.] 

6. (Thompson Transfer Lemma) Let G be a finite group of even order, let T be a Sylow 
2-subgroup of G, let M < T with \T : M | = 2, and let x be an element of order 2 in 
G. Show that if G has no subgroup of index 2 then M contains some G-conjugate of x as 
follows: 

(a) Let Ver : G/[G, G] ->■ T /[T, T] be the transfer homomorphism. Show that 

Ver(x) = ]~Jg -1 xgmod[7', T] 

8 

where the product is over representatives of the cosets gT that are fixed under left 
multiplication by x. 

(b) Show that under left multiplication x fixes an odd number of left cosets of T in G. 

(c) Show that if G has no subgroup of index 2 then Ver(x) G M/[T, T]. Deduce that for 
some g G G we must have g~*xg g M. [Consider the product Ver(x) in the group 
T/M of order 2.] 

7. Let H be a subgroup of G and let x g G. The transfer Ver : G/[G, G] ->■ H/[H, H ] 

may be computed as follows: let 0 \, O2 Ok be the distinct orbits of x acting by 

left multiplication on the left cosets of H in G, let O x have length n,- and let gjH be any 
representative of O t . 
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( a ) Show that O t = [gjH, xgjH, x 2 g i H x ni 1 giH) and that g { l x n ‘gj € H. 

(b) Show that VerOc) = f[f =1 g~ l x n ‘gi mod [H> H). 

8. Assume the center, Z(G), of G is of index m. Prove that Ver(je) = x m , for all x e G, 
where Ver is the transfer homomorphism from G/[G, G] to Z(G). (Use the preceding 
exercise.] 

9. Let p be a prime, let n > 3, and let V be an /i-dimensional vector space over W p with 

basis ni , V 2 v„. Let V be a module for the symmetric group S n> where each n e S n 

permutes the basis in the natural way: jt(u,) = v„^). 

(a) Show that |tf l (S„, V)| = j ^ P ^ . [Use Shapiro’s Lemma.] 

12, if p = 2 

(b) Show that H l (A„, V) = 0 for all primes p. 

10. Let V be the natural pennutation module for S„ over F 2 , n > 3, as described in the 

preceding exercise, and let W = [aivi 4- - • - 4- a„v„ | a\ 4 - ■ • ■ + a n =0} (the “trace 
zero” submodule of V). Show that if n is even then W) ^ 0. [Show that in the 

semidirect product V * A„ the element nj induces a nontrivial outer automorphism on 
E = W >0 A„ that stabilizes the series 1 < W < £.] 

11. Let F be a field of characteristic not dividing n and let a be any nonzero element in F. 
Let A' be a Galois extension of F containing the splitting field of x n — a, and let %/a be 
a fixed 72 th root of or in K. 

(a) Prove that <r( 'tfa ) / ?Ja is an n * root of unity. 

(b) Prove that the function / (q) = o(lfa)/ Ufa is a 1-cocycle of G with values in the 
group p. n of 71 th roots of unity in K (note p n is not assumed to be contained in F). 

(c) Prove that the 1-cocycle obtained by a different choice of /I th root of a in A" differs 
from the 1 -cocycle in (b) by a 1-coboundary. 

12. Let F be a field of characteristic not dividing n that contains the roots of unity, and 
suppose L/F is a Galois extension with abelian Galois group of exponent dividing n. 
Prove that L is the composite of cyclic extensions of F whose degrees are divisors of n 
and use this to prove that there is a bijection between the subgroups of the multiplicative 
group F*/F x " and such extensions L. 

13. The Galois group of the extension C/R is the cyclic group G — ( r ) of order 2 generated 
by complex conjugation r. Prove that H 2 (G, C x ) = R* /R + = Z/2Z where R + denotes 
the positive real numbers. 

14. For any group G let G = Hom(G, Q/Z) denote its dual group. 

(a) If <p : G\ -> G 2 is a group homomorphism prove that composition with <p induces a 
homomorphism <p : G 2 -> G\ on their dual groups. 

(b) For any fixed g in G, show that evaluation at g gives a homomorphism <p g from G to 

Q/Z. 

(c) Prove that the map taking g e G to <p g in (b) defines a homomorphism from G to its 
double dual (G). 

(d) Prove that if G is a finite abelian group then the homomorphism in (c) is an iso- 
morphism of G with its double dual. (By Exercise 14 in Section 5.2 the group G is 
(noncanonically) isomorphic to its dual G. This shows that G is canonically isomor- 
phic to its double dual — the isomorphism is independent of any choice of generators 
for G.) 

^ A 

(e) If tk : G 2 — *■ Gi is a homomorphism where G 1 and G 2 are finite abelian groups, 
then by (a) and (d) there is an induced homomorphism <p : G 1 — y G 2 . Prove that 
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<P(g\) = g2 if X(gl) = x'(gl) for x' = tKx)- 

15. Use Gauss’ Lemma in the computation of the transfer map for to {±1} to prove that 
2 is a square modulo the odd prime p if and only if p = ±1 mod 8. [Count how many 
elements in 2, 4, .... p — 1 are greater than (p — l)/2.] 


17.4 GROUP EXTENSIONS, FACTOR SETS AND h^fG.A) 


If A is a G -module then from the definition of the coboundary map ^2 in equation (1 8) 
a function / from G x G to A is a 2-cocycle if it satisfies the identity 

f(g,h) + f{gh.k) = g -f(h, k) + f(g<hk) for all g, /z, k e G. (17.26) 

Equivalently, a 2-cocycle is determined by a collection of elements [a gi h} g ,hec of el- 
ements in A satisfying a gh + a gh .k — g ■ Qh,k + for g,h,k e G (and then the 
2-cocycle / is the function sending (g, h) to a g j t ). 

A 2-cochain / is a coboundaiy if there is a function f\: G A such that 

/ ( g , h) = gfi(h) - fi(gh) + /i ( g ), for all g, h e G (17.27) 

i.e., / is the image under di of the 1-cochain f\ . 

One of the main results of this section is to make a connection between the 2- 
cocycles Z 2 (G, A) and the factor sets associated to a group extension of G by A, which 
arise when considering the effect of choosing different coset representatives in defining 
the multiplication in the extension. In particular, we shall show that there is a bijection 
between equivalence classes of group extensions of G by A (with the action of G on A 
fixed) and the elements of H 2 (G, A). 

We first observe some basic facts about extensions. Let E be any group extension 
of G by A, 

1 — v A — ^ E —v G — * 1. (17.28) 

The extension (28) determines an action of G on A, as follows. For each g € G let e s 
be an element of E mapping onto g by jt (the choice of such a set of representatives 
for G in E is called a set-theoretic section of tt). The element e s acts by conjugation 
on the normal subgroup t(A) of E, mapping t(a) to e g i{a)e~ { . Any other element in 
E that maps to g is of the form e g i(a\) for some ai e A, and since t(A ) is abelian, 
conjugation by this element on t(A) is the same as conjugation by e g , so is independent 
of the choice of representative for g. Hence G acts on i(A), and so also on A since t 
is injective. Since conjugation is an automorphism, the extension (28) defines A as a 
G-module. 

Recall from Section 10.5 that two extensions 1 A 4 £i 4 G 1 and 
1 A 4 £2 4 G ^ 1 are equivalent if there is a group isomorphism ft : Ey — > E^ 
such that the following diagram commutes: 
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In this case we simply say fi is the equivalence between the two extensions. As noted 
in Section 10.5, equivalence of extensions is reflexive, symmetric and transitive. We 
also observe that 

equivalent extensions define the same G -module structure on A. 

To see this assume (29) is an equivalence, let g be any element of G and let e g be any 
element of E\ mapping onto g by 7t\. The action of g on A given by conjugation in 
E\ maps each a to ty l (e g ti(a)ej l ). Let e' g = f(e s ). Since the diagram commutes, 
^ 2 ( e ' g ) = g, so the action of £ on A in the second extension is given by conjugation 

by e! g . This conjugation maps a to ^(e 1 ^ (a)e’ g ^ x ). Since xj, i 2 and fi are injective, 
the two actions of g on a are equal if and only if they result in the same image in E 2> 
i.e., o l\ (if l (e g Li(a)eJ 1 )) = e' g i 2 (a)e' g ~ l . This equality is now immediate from the 
definition of e , g and the commutativity of the diagram. 

We next see how an extension as in (28) defines a 2-cocycle in Z 2 (G, A). For 
simplicity we identify A as a subgroup of E via 1 and we identify G as E/A via n. 

Definition, A map fi : G E with it o 11 (g) = g and /x(l) = 0, i.e., so that for 
each g e G, /x(g) is a representative of the coset Ag of E and the identity of E (which 
is the zero of A) represents the identity coset, is called a normalized section of it . 

Fix a section /x of it in (28). Each element of E may be written uniquely in the 
form a\x(g ), where a € A and g € G. For g, h € G the product /x(g)/x(h) in E lies in 
the coset Agh, so there is a unique element f(g, h) in A such that 

v(g)K h ) = /(£, h)ii(gh) for all g, h e G. (17.30) 

If in addition /x is normalized at the identity we also have 

f(g, 1) = 0 = /(l,g) for all g e G. (17.31) 

Definition. The function / defined by equation (30) is called the factor set for the 
extension E associated to the section /x. If / also satisfies (31) then / is called a 
normalized factor set. 

We shall see in the examples following that it is possible for different sections /x to 
give the same factor set /. 

We now verify that the factor set / is in fact a 2-cocycle. First note that the group 
operation in E may be written 

(aui(g))(a 2 ii(h)) = (ai + /x(£)fl 2 /x(£)"V(£)/x(/!) 

= («i + g ■ fl2)(M(£)M(h)) (17.32) 

= (ai+ga 2 + /(£, h))ii(gh) 

where g ■ a 2 denotes the G-module action of g on a 2 given by conjugation in E. Now 
use (32) and the associative law in £ to compute the product /x(£)/x(h)/x(^) in two 
different ways: 

(ji(g)gL(h))iL(k) = (f(g, h) + f(gh , k))ii(ghk) 

(17.331 

/x(£)(/x(/i)/x(fc)) = (gf(K *) + /(£, hk))ii(ghk). 
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It follows that the factors in A of the two right hand sides in (33) are equal for every 
g,h,k e G, and this is precisely the 2-cocycle condition (26) for f. This shows that 
the factor set associated to the extension E and any choice of section p is an element 
in Z 2 (G, A). 

We next see how the factor set / depends on the choice of section p- Suppose p! is 
another section for the same extension E in (28), and let f be its associated factor set. 
Then for all g e G both p(g) and p'(g) lie in the same coset Ag, so there is a function 
f\ : G A such that p'(g) = fi (g)p(g) for all g. Then 

»'(g)p'(h) = /'(£, h)p'(gh) = (/'(*, h) + h(gh))p{gh). 

We also have 

p'(g)p'(h) = (f\(g)p(g))(f\(h)li(h)) = (Mg) + g‘ fi(h))(ji(g)n(h)) 

= (Mg) +g ■ fi (h) + f(g> h))p(gh). 

Equating the factors in A in these two expressions for p'(g)p'(h) shows that 

f(g, h) = f(g, h) + (, gfdh ) - high) + Mg)) for all g% h e G, 

in other words / and f differ by the 2-coboundary of f\ as in (27). 

We have shown that the factor sets associated to the extension E corresponding to 
different choices of sections give 2-cocycles in Z 2 (G, A) that differ by a coboundary 
in B 2 (G, A). Hence associated to the extension £ is a well defined cohomology class 
in H 2 (G, A) determined by the factor set in (30) for any choice of section 

If the extension E of G by A is a split extension (which is to say that E = A x G 
is the semidirect product of G by A with the given conjugation action of G on A), then 
there is a section p of G that is a homomorphism from G to E. In this case the factor 
set / in (30) is identically 0: f(g, h) = 0 for all g, h e G. Hence the cohomology 
class in H 2 (G, A) defined by a split extension is the trivial class. 

Suppose now that is an equivalence between the extension in (28) and an extension 



If p is a section of 7r , then p' = fi o p is a section of so what we have just proved can 
be used to determine the cohomology class in H 2 (G , A) corresponding to £'. Applying 
the homomorphism £ to equation (30) gives 

0<M(g))P(nm = i B(f(g, h))${p(gh)) for all g, h & G. 

Since fi restricts to the identity map on A, this is 

p'(g)p'(h ) = f(g , h)p'(gh) for all g,h eG, 

which shows that the factor set for E' associated to p' is the same as the factor set for 
E associated to p. This proves that equivalent extensions define the same cohomology 
class in H 2 (G, A). 
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We next show how this procedure may be reversed: Given a class in H 2 (G, A) 
we construct an extension Ej whose corresponding factor set is in the given class in 
H 2 (G , A). The process generalizes the semidirect product construction of Section 5.5 
(which is the special case when / is the zero cocycle representing the trivial class). 

Note first that any 2-cocycle arising from the factor set of an extension as above 
where the section /x is normalized satisfies the condition in (3 1). 

Definition. A 2-cocycle / such that /(g, 1) = 0 = /(l, g) for all g € G is called a 
normalized 2-cocycle. 

The construction of Ej is a little simpler when / is a normalized cocycle and for 
simplicity we indicate the construction in this case (the minor modifications necessary 
when / is not normalized are indicated in Exercise 4). 

We first see that any 2-cocycle / lies in the same cohomology class as a normalized 
2-cocycle. Let d\ /j be the 2-coboundary of the constant function fi on G whose value is 
/( 1, 1). Then /( 1, 1) — d\fy(\, 1), and one easily checks from the 2-cocycle condition 
that f — d\f\ is normalized. 

We may therefore assume that our cohomology class in H 2 (G, A) is represented 
by the normalized 2-cocycle /. Let Ej be the set A x G, and define a binary operation 
on Ej by 

(at. g)(a 2 , h) = (a x + g ■ a 2 + /(g, h ), gh) (17.34) 

where, as usual, g - a-i denotes the module action of G on A. It is straightforward to 
check that the group axioms hold: Since / is normalized, the identity element is (0,1) 
and inverses are given by 

(a. g)-' = • a - f(g~', g ), g-'). (17.35) 

The cocycle condition implies the associative law by calculations similar to (32) and 
(33) earlier — the details are left as exercises. 

Since / is a normalized 2-cocycle, A* = {(a, 1) | a e A] is a subgroup of £/, and 
the map i* : a (a, 1) is an isomorphism from A to A*. Moreover, from (34) and 
(35) it follows that 

(0, g)(a, 1)(0, g) -1 = (g ■ a, 1) for all g € G and all a € A. (17.36) 

Since Ej is generated by A* together with the set of elements (0, g) for g e G, (36) 
implies that A* is a normal subgroup of Ej. Furthermore, it is immediate from (34) 
that the map n* : (a, g) f— ► g is a suijective homomorphism from Ef to G with kernel 
A*, i.e., Ej/A* = G. Thus 

1 — > A —*■ Ej G — > 1 (17.37) 

is a specific extension of G by A, where (36) ensures also that the action of G on 
A by conjugation in this extension is the module action specified in determining the 
2-cocycle / in H 2 (G , A). The extension sequence (37) shows that this extension has 
the normalized section /4(g) = (0, g) whose corresponding normalized factor set is /. 
Note that this proves not only that every cohomology class in H 2 (G, A) arises from 
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some extension E, but that every normalized 2-cocycle arises as the normalized factor 
set of some extension. 

Finally, suppose /' is another normalized 2-cocycle in the same cohomology class 
in H 2 (G , A) as / and let Ef be the corresponding extension. If / and /' differ by the 
co boundary of h : G -► A then f(g, h) - /'(g, h ) = gfi(h) - high) + hig) for 
all g, h e G. Setting g = h = 1 shows that f\ (1) — 0. Define 

P ’• E f — > E f > by fii(a, g)) = (a + hig), g). 

It is immediate that fi is a bijection, and 

£(tei> g)(a 2 , h )) = fiday + g • a 2 + fig , h ), gh)) 

= ia { +ga 2 + /(g, /i) + high), gh)) 

= ia i + f\ ig) + 8 ■ («2 + /i(/i)) + fig , h ), g/i) 

= tei + /> s)te 2 + /j te). /i) = ^(tei , g))^(te2, h)) 

shows that fi is an isomorphism from Ef to Ef. 

The restriction of fi to A is given by fiiia, 1)) = te + /i(l), 1) = te» so P is 
the identity map on A. Similarly fi is the identity map on the second component of 
(a, g), so fi induces the identity map on the quotient G. It follows that fi defines an 
equivalence between the extensions Ef and Ef. This shows that the equivalence class 
of the extension Ef depends only on the cohomology class of / in H 1 (G, A). 

We summarize this discussion in the following theorem. 

Theorem 36. Let A be a G-module. Then 

(1) A function / :GxG—vA is a normalized factor set of some extension E of 
G by A (with conjugation given by the G-module action on A) if and only if / 
is a normalized 2-cocycle in Z 2 (G, A). 

(2) There is a bijection between the equivalence classes of extensions E as in (1) 
and the cohomology classes in f/ 2 (G, A). The bijection takes an extension E 
into the class of a normalized factor set / for E associated to any normalized 
section ju of G into E, and takes a cohomology class c in f/ 2 (G, A) to the 
extension Ef defined by the extension (37) for any normalized cocycle f in the 
class c. 

(3) Under the bijection in (2), split extensions correspond to the trivial cohomology 
class. 

Corollary 37. Every extension of G by the abelian group A splits if and only if 
H 2 iG t A) = 0. 


Corollary 38. If A is a finite abelian group and (| Aj, |G|) = 1 then every extension of 
G by A splits. 

Proof: This follows immediately from Corollary 29 in Section 2. 

We can use Corollary 38 to prove the same result without the restriction that A be 
an abelian group. 
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Theorem 39. ( Schur’s Theorem) If £ is any finite group containing a normal subgroup 
N whose order and index are relatively prime, then N has a complement in E. 

Remark: Recall that a subgroup whose order and index are relatively prime is called 
a Hall subgroup , so Schur’s Theorem says that every normal Hall subgroup has a 
complement that splits the group as a semidirect product 

Proof: We use induction on the order of E. Since we may assume N 1, let p be 
a prime dividing |N| and let P be a Sylow p-subgroup of N. Let Eq be the normalizer 
in E of P and let No = N D Eq. By Frattini’s Argument (Proposition 6 in Section 6. 1 ) 
E = EqN. It follows from the Second Isomorphism Theorem that N 0 is a (normal) 
Hall subgroup of E 0 and \Eq : Nq\ = \E : N\ (cf. Exercise 10 of Section 3.3). 

If Eq < E> then by induction applied to in Eq we obtain that Eq co ntains a 
complement K to Nq. Since | AT | = |£o : No I, K is also a complement to N in E, as 
needed. Thus we may assume Eq = £\ i.e M P is normal in E. 

Since the center of P , Z(P), is characteristic in P, it is normal in E (cf. Section 
4.4). If Z(P) = N, then N is abelian and the theorem follows from Corollary 38. Thus 
we may assume Z(P) N. Let bars denote passage to the quotient group E/Z(P). 
Then N is a normal Hall subgroup of E. By induction it has a complement K in E. 
Let Ei be the complete preimage of K in E. Then |£i| = \K\\Z(P)\ — \E/N\\Z(P)\, 
so Z{P) is a normal Hall subgroup of E\. By induction Z(P) has a complement in 
E\ which is seen by order considerations to also be a complement to N in E. This 
completes the proof. 

Examples 

(1) If G = Z\ and A = Z/2Z then G acts trivially on A and so H 2 (G, A) = A C /NA = 
TLI'HL by the computation of the cohomology of cyclic groups in Section 2, so by 
Theorem 36 there are precisely two inequivalent extensions of G by A. These are 
the cyclic group of order 4 and the Klein 4-group, the latter being split and hence 
corresponding to the trivial class in H 2 . 

(2) If G = ( g ) = Z 2 and A = (a) = TL/XL is a group of order 4 on which G acts 
trivially, then H 2 (G, A) = A/2A ~ Z/2Z by the computation of the cohomology 
of cyclic groups. As in the previous example there are two inequivalent extensions 
of G by A; evidently these are the groups Z% and Z4 x Z%, the latter split extension 
corresponding to the trivial cohomology class. 

If E = (r ) x (s ) denotes the split extension of G by A, where |r| = 4 and 
\s\ =2, then Pi(g) = r*s for i = 0, .... 3 give the four normalized sections of G in 
E. The sections p-o. P2 both give the zero factor set /. The sections am , am both give 
the factor set f' with f'(g, g) = a 2 € A. Both / and f' give normalized 2-cocycles 
lying in the trivial cohomology class of H 2 (G, A). The extension £/ corresponding 
to the zero 2-cocycle / is the group with the elements (a, 1) and (1, g) as the usual 
generators (of orders 4 and 2, respectively) for Z4 x Zi. In Ef, however, (a, 1) has 
order 4 but so does (1, g) since (1, g) 2 = (f(g, g), g 2 ) = (a 2 , 1). The 2-cocycles / 
and /' differ by the coboundary f\ with /j(l) = 1 and f\{g) = r. The isomorphism 
jS(n, g) = (a 4- fiig), g ) from Ej to Ef> maps the generators (a, 1) and (1, g) of Ef 
to the generators (a, 1) and (a, g) of Ef and gives the explicit equivalence of these 
two extensions. 

The situation where G acts on A by inversion is handled in Exercise 3. 
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(3) Suppose G — Z% and A is the Klein 4-group. If G acts nontrivially on A then G 
interchanges two of the Don identity elements, say a and b, of A and fixes the third 
nonidentity element c. Then A c = NA = { 1 , c} and so H 2 (G, A) = 0, and so every 
extension E of G by A splits. This can be seen directly, as follows. Since the action 
is nontrivial, such a group must be nonabelian, hence must be Dg. From the lattice of 
Dg in Section 2.5 one sees that for each Klein 4-group there is a subgroup of order 2 
in Dg not contained in the 4- group and that subgroup splits the extension. 

If G acts trivially on A theo H 2 (G , A) = A/2A = A, so there are 4 inequivalent 
extensions of G by A in this case. These are considered in Exercise 1 . 

Example: (Groups of Order 8 and f^iZ^ x Zj, Z/2Z)) 

Let G = {1 , a, b, c} be the Klein 4-group and let A = TL/lla. The 2-group G must act 
trivially on A. The elements of H 2 iG, A) classify extensions E of order 8 which has a 
quotient group by some Z 2 subgroup that is isomorphic to the Klein 4-group. Although 
there are, up to group isomorphism, only four such groups, we shall see that there are eight 
inequivalent extensions. 

Since G x G has 16 elements, we have \C 2 (G, A)\ = 2 16 . The cocycle condition (26) 
here reduces to 

fig,h) + figh.k) = f(h.k) + fig.hk) for all g,h,k e G. (17.38) 

The following relations hold for the subgroup Z 2 (G , A) of cocycles: 

(1) fig, l) = /(l,g) = /(l.l).foraUgeG 

(2) f(g , 1) + fig, a) + fig, b) + fig , c) = 0, for all g e G 

(3) /(l,/i) + fia,h) + fib, h) + fic, h) = 0, for all h € G. 

The first of these come from (38) by setting h = k = 1 and by setting g = h = 1 . The other 
two relations come from (38) by setting g = h and h = k, respectively, using relations (1) 
and (2). It follows that every 2-cocycle / can be represented by a vector (a, fi, y, £, 6) in 
F 2 where 


a = fil,g) = fig, 1), for all g e G, 
i 6 = fia,a), y = fia,b), 8 = fib, a), € = fib,b) 

because the relations above then determine the remaining values of /: 

fia, c) = a + ft + y fib,c)=a + 8 + e fic,a)=a + fi + 8 

fic,b) = a + y + e fic, c) = a + fi + y+e. 

It follows that |Z 2 (G, A)| < 2 5 . Although one could eventually show that every function 
satisfying these relations is a 2-cocycle (heDCe the order is exactly 32), this will follow 
from other considerations below. 

A cocycle / is a coboundary if there is a function /i : G -* A such that 
fig . h) = fiih) - fiigh) + /i(g), for all g,heG. 

This coboundary condition is easily seen to be equivalent to the conditions: 

(i) fig , 1) = /(l, g) = fig . g) for all g € G, and 

(ii) fig, h ) = fig', h') whenever g, h are distinct nonidentity elements and so are g', h'. 

These relations are equivalent to a = ft = € and y = 8. Thus B 2 iG. A) consists of the 
vectors ia,a, y, y, a),andso// 2 (G, A) hasdimensionatmost3(i.e.,orderatmost2 3 = 8). 
It is easy to see that {(0, y, 0. e)} with 0, y, and e in IF 2 gives a set of representatives 
for Z 2 (G, A)/B 2 (G, A), and each of these representative cocycles is normalized. We 
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now prove \H 2 (G , A)| = 8 (and also that |Z 2 (G, A) j = 2 5 ) by explicitly exhibiting eight 
in equivalent group extensions. 

Suppose E is an extension of G by A, where for simplicity we assume A < E. If 
/i. : G -> E is a section, the factor set for E associated to /x satisfies 

= /(g> b)fx(gh). 

The group E is generated by /x(a), /x(fc) and A, and A is contained in the center of E since 
G acts trivially on A. Hence E is abelian if and only if /x(n)/x(fe) — fi(b)fi(a) y which by 
the relation above occurs if and only if f(a, b ) = f(b,a). If g is a nonidentity element in 
G, we also see from the relation above that fi(g) is an element of order 2 in E if and only if 
/(g. g) = 0- Because A is contained in the center of E, both elements in any nonidentity 
coset A/x(g) have the same order (either 2 or 4). 

There are four groups of order 8 containing a normal subgroup of order 2 with quotient 
group isomorphic to the Klein 4-group: Zi x Z2 x Z2, Z4 x Z2, Dg, and Qg. 

The group E = Z 2 x Z2 x Z2 is the split extension of G by A and has / = 0 as factor 
set 

When E = Qg, in the usual notation for the quaternion group A = ( — 1 > . In this (non- 
abelian) group every nonidentity coset consists of elements of order 4, and this property is 
unique to Qg, so the resulting factor set / satisfies f(g, g) ^ 0 for all nonidentity elements 
in G. 

When E = Z4 x Z2 = ( x ) x (y ) we must have A = {x 2 >. The cosets Ax and Axy 
both consist of elements of order 4, and the coset Ay consists of elements of order 2, so 
exactly one of n(a), /x(fc) or /x(c) is an element of order 2 and the other two must be of 
order 4. This suggests three homomorphisms from E to G, defined on generators by 

n\(y) = a n\ (x) = b 
n?.(y) = b jr 2 (x) = a . 
m(y)=c 7i^{x)=a 

Each of these homomorphisms maps surjectively onto G, has A as kernel, and has fi(a) 
(respectively, /x(i>) , /x(c)) an element of order 2 in E. Any isomorphism of E with itself that 
is the identity on A must take the unique nonidentity coset Ay of A consisting of elements 
of order 2 to itself. Hence any extension equivalent to the extension E\ defined by n\ also 
maps y to a (since the equivalence is the identity on G). It follows that the three extensions 
defined by 7 i\ t and JT 3 are inequivalent 

The situation when E = Dg = {r,s) is similar. In this case A = {r 2 ), the cosets As 
and Asr consist of elements of order 2, and the coset Ar consists of elements of order 4. 
In this case exactly one of fx(b) or /x(c) is an element of order 4 and the other two 
are of order 2, suggesting the three homomorphisms defined on generators by 

TTl (r) = a n\(s) = b 
Jt 2 (r) — b tt 2 (s) = a . 

7T3(r) — C 7T3(s)—a 

As before, the corresponding extensions are inequivalent. 

The existence of 8 inequivalent extensions of G by A proves that \H 2 (G, A) | =8, 
and hence that these are a complete list of all the inequivalent extensions. In particular, 
the extension E\ = Z 4 x Z2 defined by the homomorphism jrj mapping y to a and x to c 
must be equivalent to the extension E\ above (and similarly for the other two extensions 
isomorphic to Z4 x Z2 and the three extensions for £>8). This proves the existence of 
certain outer automorphisms for these groups, cf. Exercise 9. 
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Remark: For any prime p the cohomology groups of the elementary abelian group with 
coefficients in the finite field may be determined by relating them to the cohomology 
groups of the factors in the direct product as mentioned at the end of Section 2. In general, 
H 2 (Ep * , F p ) is a vector space over Fp of dimension ^m(m + l). When p = 2 and m = 2 
this is the result H 2 (Z 2 x Z 2 . Z/2Z) = (Z/2Z) 3 above. 


Crossed Product Algebras and the Brauer Group 

Suppose F is a field. Recall that an F-algebra B is a ring containing the field F in its 
center and the identity of B is the identity of F, cf. Section 10.1. 

Definition. An F-algebra A is said to be simple if A contains no nontrivial proper 
(two sided) ideals. A central simple F -algebra A is a simple F-algebra whose center 
is F. 

Among the easiest central simple F-algebras are the matrix algebras M„(F) of 
n x n matrices with coefficients in F. 

If K/F is a finite Galois extension of fields with Galois group G = Gal(F/F), 
then we can use the normalized 2-cocycles in Z 2 (G, #f x ) to construct certain central 
simple A' -algebras. The construction of these algebras from 2-cocycles and their clas- 
sification in terms of H 2 (G, K x ) (cf. Theorem 42 below) are important applications 
of cohomological methods in number theory. Their construction in the case when G is 
cyclic was one of the precursors leading to the development of abstract cohomology. 

Suppose / = {fl CT ,r}<T,r€C is a normalized 2-cocycle in 2?(G, K x ). Let By be the 
vector space over L having basis u a for o € G: 

By = \ Yj UaU - 1“* e K \ 07.39) 

*■ creG * 

Define a multiplication on By by 

u a a= a{a) u a u a u r = a 0iT u ar (17.40) 

for a e L and a, r e G. The second equation shows that the a„ T give a “factor 
set” for the elements in By and is one reason this terminology is used. Using this 
multiplication we find 

(W(jWr)Wp = Ocr,x^tat,pt^atp and u a (u T Up) = a (fl T p) a a X p u a ip~ 

Since a a i a aTiP = a(a T p ) a a< rp is the multiplicative form of the cocycle condition (26), 
it follows that the multiplication defined in (40) is associative. 

Since the cocycle is normalized we have a\ a = a 0i 1 = 1 for all e G and it 
follows from (40) that the element u\ is an identity in By. Identifying K with the 
elements cnt\ in By, we see that By is an F-algebra containing the field K and having 
dimension n 2 over F if n = [Af : F] = |G|. 
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Proposition 40 . The F-algebra Bf with A! - - vector space basis u a in (39) and multipli- 
cation defined by (40) is a central simple F-algebra. 

Proof: It remains to show that the center of Bf is F and that Bf contains no 
nonzero proper ideals. Suppose x = ct a u a is an element in the center of Bf. 

Then xfi = fix for ft e AT shows that <j(J 3) = fi if ct^ ^ 0. Since there is an element 
fi € AT not fixed bya for any a 1 , this shows that a a =0for all a ^ l,sox =a\u\. 
Then xu r = u T x if and only if r (a i ) = aj , so if this is true for all r then we must have 
ai = a € K. Hence x = auy and the center of Bf is F. 

To show that Bf is simple, suppose 7 is a nonzero ideal in Bf and let 

jc = a 0i u ai + ■ ■ ■ + a Cm u am 

be a nonzero element of 7 with the minimal number m of nonzero terms. If m > 1 there 
is an element /l € AT* wither,,, (fi) ^ er,„_i (fi). Then the element x~o m (fi)x would 
be an element of the ideal 7 with the nonzero element (1 — cr,„_ j (/l) -1 ) cc CTm _ l as 

coefficient of u 0ml , and would have fewer nonzero terms than x since the coefficient 
of u am is 0. It follows that m = 1 and x = a u a for some a £ K and some a. This 
element is a unit, with inverse a~ l (a -1 ) u a -\ , so 7 = Bf, completing the proof. 

Definition. The central simple F-algebra Bf defined by (39) and (40) is called the 
crossed product algebra for the factor set {a aT }. 

If f = a' a r is a normalized cocycle in the same cohomology class in H 2 (G , AT*) 
as a O T then there are elements b a £ AT* with 

a' OT =o a>x (o(b r )b-]b a ) 

(the multiplicative form of the coboundary condition (27)). If Bf is the F-algebra 
with A r -basis v a defined (fom this cocycle as in (39) and (40), then the K -vector space 
homomorphism (p defined by mapping u' a to b a u a satisfies 

<p(u a u ' r ) = <p(a' atr u‘ ar ) = a' a T b ol u at = b a cr(b T ) u a u t 

= (b„u a )(b t u T ) = <p(u 0 )<p(u ' T ). 

It follows that <p is an F-algebra isomorphism from Bp to Bf. 

We have shown that every cohomology class c in H 2 (G , AT*) defines an isomor- 
phism class of central simple F-algebras, namely the isomorphism class of any crossed 
product algebra for a normalized cocycle [a a . r ) representing the class c. The next 
result shows that the trivial cohomology class corresponds to the isomorphism class 
containing M„(F). 

Proposition 41 . The crossed product algebra for the trivial cohomology class in 
H 2 (G , AT*) is isomorphic to the matrix algebra M„(F) where n = [AT : F). 

Proof: If a £ AT then multiplication by a defines a linear transformation T a of 
AT viewed as an « -dimensional vector space over F. Similarly, every automorphism 
a £ G defines an F-linear transformation T a of K, and we may view both T a and T a as 
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elements of M n ( F ) by choosing a basis for K over F. If B 0 denotes the crossed product 
algebra for the trivial factor set (a cx = 1 for all a y r € G) y consider the additive map 
ip : Bo -> M„(F) defined by (p(ctu a ) = T a T a . Since T att = aT a for a e F, the map <p 
is an F- vector space homomorphism. If x € K y we have 

T a T a (x ) = T a (ax) = a (ax) = o(a)a(x) = T a{a) T ay 

so T a T a = T a (a)T a as linear transformations on K. It then follows from u a u T — u ax 
that 

<p((au a )(fiu T )) = <p(ao (fi) u aT ) = T aa{fi) T az = T a T aiP) T a T x 
= FoJaTpTr = <f>(au a ) <p(pu r ) 

which shows that <p is an F-algebra homomorphism from Bo to M n (F). Since ker$o 
is an ideal in Bo and tp ^ 0, it follows from Proposition 40 that ker^ = 0 and <p is 
an injection. Since both Bo and M n (F) have dimension n 2 as vector spaces over F y it 
follows that (p is an F-algebra isomorphism, proving the proposition. 

Example 

If K — C and F = R, then G = Gal(C/R) is of order 2 and generated by complex 
conjugation x. We have \H 2 (G. C x )| =2. The central simple R- algebra Bo corresponding 
to the trivial class is Cwi ffi Cu x with u T (a + bi ) = (a — bi)u T and u 2 = u\. This is 
isomorphic to the matrix algebra A/ 2 OR) under the map 

<p((a 4- bi)u\ + (c + di)u T ) =al + bTj + cT r + dTiT x = 

A normalized cocycle / representing the nontrivial cohomology class is defined by the 
values aj | = a l r = Oj- x = 1 and a f r = — 1 . The corresponding central simple R-algebra 
Bf is given by Ci>i © <Cv t . The element ui is the identity of By, and we have the relations 
v T (a + bi) = (a — bi)v x and = ~vy. Letting vj = 1 and v T = j we see that fly is 
isomorphic as an R-algebra to the real Hamilton Quaternions R + Ri -h R j + RL 

There is a rich theory of simple algebras and we mention without proof the following 
results. Let A be a central simple F-algebra of finite dimension over F. 

L If F c B c A where A is a simple F-algebra define the centralizer B c of B in A to 
be the elements of A that commute with all the elements of B. Define the opposite 
algebra B opp to be the set B with opposite multiplication, i.e., the product b\bi in 
B opp is given by the product in B. Both B c and B opp are simple F-algebras 
and we have 

a. (dim fB)(dimFF c ) = diniFA 

b. A <S>f B opp = M r (B c ) as F-algebras, where r = dim p B 

c. B®/rB r = AifBisa central single F-algebra. 

II. If A' is an Artinian (satisfies D.C.C. on left ideals) simple F-algebra, then A®p A' 
is an Artinian simple F-algebra with center (A0 C - 
HL We have A = M r ( A) for some division ring A whose center is F and some integer 
r > 1. The division ring A and r are uniquely determined by A. The same 
statement holds for any Artinian simple F-algebra. 

The last result is part of Wedder bum’s Theorem described in greater detail in the 
following chapter. 


/ a +c -£> + d\ 
\b + d a — c )' 
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Definition. If A is a central simple F-algebra then a field L containing F is said to 
split A if A <8>f L = M m {L) for some m > 1. 

It follows from (II) that every maximal commutative subalgebra of A is a field E 
with E — E c = E opp ; if [F : F] = m we obtain dim F A = m 2 . Applying (II) to 
A = A and B = E we also see that A <g>f E = M,„(E). It can also be shown that a 
maximal subfield E of the central simple F-algebra A also satisfies E = E c = E opp 
and so again by (II) it follows that A E = M r (E ) (r 2 = dim F A). 

If A = M t { A) then the field L splits A if and only if L splits A, as follows. If 
A ® F L = M n (L) then 

A® f L = M r ( A) ® F L = M r (A ® F L) = M r (M n (L )) = M rn (L). 
Conversely if A ® F L = M„(L) then 

M„(L) = M r ( A) <S> F L = M r ( A ® F L). 

By (II) and (III), A®fl = M s ( A') for some division ring A'. Together with the 
previous isomorphism, the uniqueness statement in (HI) shows that A ' = L and then 
the isomorphism A <8> F L = M S {L) shows that L splits A. 

We see from the discussion above that a maximal commutative subfield of A splits 
both A and A = Af r (A) for any r > 1. It is not too difficult to show from this that 
every central simple F-algebra of finite dimension over F can be split by a finite Galois 
extension of F. 

Applying (I) by taking A to be the crossed product algebra Bf and taking B = K 
shows that K = K c = K opp and B/<S> F K = M„(K). In particular, the crossed product 
algebras Bf are always split by K. 

Example 

In the example of the Hamilton Quaternions above we have Bf C = M 2 (C). We have 

Bf C = C + 0 + Cj +Ck and an explicit isomorphism <p to M 2 (C) is given by 



and extending C linearly. 

By (III) every central simple F-algebra A is isomorphic as an F-algebra to M r (A) 
for some division ring A uniquely determined up to F-isomorphism, called the division 
ring part of A. 

Definition. Two central simple F-algebras A and B are similar if A = M r ( A) and 
B = A) for the same division ring A, i.e., if A and B have the same division ring 
parts. 

Let [A] denote the similarity class of A. By (II), if A and B are central simple 
F-algebras then A B is again a central simple F-algebra, so we may define a 
multiplication on similarity classes by [A][F] = [A <g> /r F], The class [F] is an 
identity for this multiplication and associativity of the tensor product shows that the 
multiplication is associative. By (lb) applied with B — A (so then B c = F since A is 
central) we have [A][A opp ] = [F], so inverses exist with this multiplication. 
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Definition. The group of similarity classes of central simple F-algebras with multi- 
plication [A][B] = [A (g)/r B] is called the Brauer group of F and is denoted Br(F ). 

If L is any extension field of F then by (II) the algebra A ® r L is a central 
simple L-algebra. It is easy to check that the map [A] -*■ [A L] is a well defined 
homomorphism from Br(F ) to Br(L). The kernel of this homomorphism consists of 
the classes of the algebras A with A (2>f L = for some m > 1, i.e., the algebras 

A that are split by L. 

Definition. If L/F is a field extension then the relative Brauer group Br(L/F) is the 
group of similarity classes of central simple F-algebras that are split by L. Equivalently, 
Br(L/F) is the kernel of the homomorphism [A] — >■ [A L ] from Br(F ) to Br(L). 

The following theorem summarizes some major results in this area and shows 
the fundamental connection between Brauer groups and the crossed product algebras 
constructed above. 

Theorem 42. Suppose K/F is a Galois extension of degree n with G = Gal(F/F). 

(1) The central simple F-algebra A with dim fA = n 2 is split by K if and only if 
A (gif K = M n (K) if and only if A is isomorphic to a crossed product algebra 
Bj as in (39) and (40). 

(2) There is a bijection between the F-isomorphism classes of central simple F- 
algebras A with A K = M„(K) and the elements of H 2 (G, K x ). Under 
this bijection the class c e H 2 (G , K x ) containing the normalized cocycle / 
corresponds to the isomorphism class of the crossed product algebra Bj defined 
in (39) and (40), and the trivial cohomology class corresponds to M„(F). 

(3) Every central simple F-algebra of finite dimension over F and split by K is 
similar to one of dimension n 2 split by K. The bijection in (2) also establishes 
a bijection between Br(K/F) and H 2 (G, F x ) which is also an isomorphism 
of groups. 

(4) There is a bijection between the collection of F-isomorphism classes of central 
simple division algebras over F that are split by K and H 2 (G, K x ). 

As previously mentioned, every central simple F-algebra of finite dimension over 
F can be split by some finite Galois extension of F, and it follows that 

Br(F) = \jBr(K/F) 

K 

where the union is over all finite Galois extensions of F. It follows that there is a 
bijection between Br(F) and H 2 (Ga\(F s / F), (F*) x ) where F s denotes a separable 
algebraic closure of F. Here Gal(F 5 /F) is considered as a profinite group and the 
cohomology group refers to continuous Galois cohomology. 

One consequence of this result and Theorem 42 is that a full set of representatives 
for the F-isomorphism classes of central simple division algebras A over F can be 
obtained from the division algebra parts of the crossed product algebras for finite Galois 
extensions of F. Those division algebras that are split over K occur for the crossed 
product algebras for K/F. 
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Example 

Since // 2 (Gal(F 9 <i/F 9 ), F*j) = 0 (cf. Exercise 10), we have Br( f¥q) = 0 and hence 
also Br(Fq) = 0. As a consequence, every finite division algebra is a field (cf. Exercise 
13 in Section 13.6 for a direct proof), and every finite central simple algebra F 9 -algebra is 
isomorphic to a full matrix ring M r ( Fq). 


EXERCISES 

1. Let A = {1, a, b, c) be the Klein 4-group and let G = ( g ) be the cyclic group of order 2 
acting trivially on A. 

(a) Prove that |C 2 (G, A ) | = 2 8 . 

(b) Show that cobouDdaries are constant functions, and deduce that \B 2 (G, A)} = 4. 

(c) Use the cocycle condition to show that |Z 2 (G. A)| < 2 4 . 

(d) If E = Z4 x Z 2 — (x ) x (y ), prove that the extensions 1 ->■ A ■X E A G -*• 1 
defined by n(x) = g, 7T(y) = 1 and i\(a) = x 2 , n(b) = y (respectively, 12 (b) = x 2 , 
i 2 (a) = y, and 13(c) = x 2 , 12 (a) = y ), together with the split extension Z2 x Z 2 x Z2 
give 4 inequivalent extensions of Z2 by the Klein 4-group. Deduce that H 2 (G, A) 
has order 4 by explicitly exhibiting the corresponding cocycles. 

2. Let A = TL/XL and let G be the cyclic group of order 2 acting trivially on A. 

(a) Prove that |C 2 (C, A)| = 2 8 . 

(b) Use the coboundary condition to show that \B 2 (G,A)\ = 2 3 . 

(c) Use the cocycle condition to show that |Z 2 (G, A)\ < 2 4 . 

(d) Show that \H 2 (G, A)| = 2 by exhibiting two inequivalent extensions of G by A and 
their corresponding cocycles. 

3. Let A = Z/4Z and let G be the cyclic group of order 2 acting by inversion on A. 

(a) Show that there are four coboundaries and that only the zero coboundary is normalized. 

(b) Prove by a direct computation of cocycle and coboundary groups that | H 2 (G , A) \ =2. 

(c) Exhibit two distinct cohomology classes and their corresponding extension groups. 

(d) Show that for a given extension of G by A with extension group isomorphic to £>8 
there are four normalized sections, all of which have the zero 2-cocycle as their factor 
set. 

(e) Show that for a given extension of G by A with extension group isomorphic to Q& 
there are sixteen sections, four of which are normalized, and all of the latter have the 
same factor set 

4. For a non- normalized 2-cocycle / one defines the extension group Ef on the set A x G 
by the same binary operation in equation (34). Verify two of the group axioms in this case 
by showing that identity is now (— /(1 , 1), 1) and inverses are given by 

(a,*) -1 = ( -x~ l a- f(x~~ l ,x) - f( 1, 1),*“'). 

(Verification of the associative law is essentially the same as for normalized 2-cocycles.) 
Prove also that the set A** = ((a — f( 1, 1), 1) | a € A) is a subgroup of Ef and the map 
1 ** \ a i-v (a — f( 1, 1), 1) is an isomorphism from A to A**. Show that this extension Ef, 
with the injection t** and the usual projection map jt* onto G, is equivalent to an extension 
derived from a normalized cocycle in the sam e class as /. 

5. Show that the set of equivalences of a given extension l->A-^£'-^G->l with itself 
form a group under composition, and that this group is isomorphic to the stability group 
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Stab(l < t(A) < E). (Thus Proposition 31 implies Z l (G, A) is the group of equivalences 
of the extension with itself). 

6. (Gaschiltz’s Theorem ) Let p be a prime, let A be an abelian normal p-subgroup of a finite 
group G , and let P be a Sylow p-subgroup of G. Prove that G is a split extension of G/A 
by A if and only if P is a split extension of P/A by A. (Note that A < P by Exercise 
37 in Section 4.5). [Use Sylow’s Theorem to show if G splits over A then so too does P. 
Conversely, show that a normalized 2-cocycle associated to the extension of P/A by A via 
Theorem 36 is the image of a normalized 2-cocycle in H 2 (G/A , A) under the restriction 
homomorphism Res : H 2 (G/A, A) ->■ H 2 (P/A, A). Then use Proposition 26 and the 
fact that multiplication by |G : P\ is an automorphism of A.j 

7. (a) Prove that H 2 (A 4 , Z/2Z) ^ 0 by exhibiting a nonsplit extension of A4 by a cyclic 

group of order 2. [See Exercise 1 1 , Section 4.5.] 

(b) Prove that H 1 (A5 , Z/2Z) 7^ 0 by showing that SLiC^s) is a nonsplit extension of A5 
by a cyclic group of order 2. [Use Propositions 21 and 23 in Section 4.5.] 

8 . The Schur multiplier of a finite group G is defined as the group H 2 (G, C x ), where the 
multiplicative group C x of complex numbers is a trivial G-module. Prove that the Schur 
multiplier is a finite group. [Show that every cohomology class contains a cocycle whose 
values lie in the n 111 roots of unity, where n = |G|, as follows: If / is any cocycle then 
by Corollary 27, /" e fi 2 (G, C x ). Define k € C 2 (G, C x ) by k(g u g 2 ) = f(gu g 2 ) 1/n 
(take any n* roots). Show that k e B 2 (G, C x ) and fk~* takes values in the group of n* 
roots of 1.] 

9. Use the classification of the extensions of the Klein 4-group by Z2 in the example following 
Theorem 39 to prove the following (in the notation of that example): 

(a) There is an (outer) automorphism of Z4 x Z2 which interchanges the cosets Ax and 
Axy and fixes the coset Ay. 

(b) There is an outer automorphism of D% which interchanges the cosets As and Asr and 
fixes the coset Ar. 

10. Suppose is a finite field with G = Gal(F^/F fl ) = (o q ) where o q is the Frobenius 
automorphism, and let N be the usual norm element for the cyclic group G. 

(a) Use Hilbert’s Theorem 90 to prove that | w(F ) | = — 1 )/(q — 1), and deduce that 

the norm map from to F 9 is surjective. 

(b) Prove that H n (G , F x d ) = 0 for all n > 1 . 
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Part VI 


INTRODUCTION TO 
THE REPRESENTATION THEORY 
OF FINITE GROUPS 


The final two chapters are an introduction to the representation theory of finite 
groups together with some applications. We have already seen in Part I how actions of 
groups on sets, namely permutation representations, are a fundamental tool for unrav- 
elling the structure of groups. Cayley’s Theorem and Sylow’s Theorem as well as many 
of the results and applications in Sections 6. 1 and 6.2 are based on groups acting on sets. 
The chapter on Galois Theory developed one of the most beautiful correspondences in 
mathematics where die action of a group as automorphisms of a field gives rise to a 
correspondence between the lattice of subgroups of the Galois group and the lattice of 
subfields of a Galois extension of fields. In these final two chapters we study groups 
acting as linear transformations on vector spaces. We shall be pr imari ly interested in 
utilizing these linear actions to provide information about the groups themselves. 

In Part III we saw that modules are the “representation objects” for rings in the 
sense that the axioms for an R module specify a “ring action” of R on some abelian 
group M which preserves the abelian group structure of M. In the case where M 
was an Ffxj-module, x acted as a linear transformation from the vector space M to 
itself. In Chapter 12 the classification of finitely generated modules over Principal Ideal 
Domains gave us a great deal of information about these linear transformations of M 
(e.g., canonical forms). In Chapter 16 we used the ideal structure in Dedekind Do mains 
to generalize the results of Chapter 12 to the classification of finitely generated modules 
over such domains. In this part we follow a process similar to the study of F [x]-modules, 
replacing the polynomial ring with the group ring FG of G and classifying all finitely 
generated FG-modules for certain fields F (Wedderbum’s Theorem). We then use this 
classification to derive some results about finite groups such as Burnside’s Theorem on 
the solvability of groups of order p a q b in Chapter 19. 
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CHAPTER 18 


Representation Theory 
and Character Theory 


18.1 LINEAR ACTIONS AND MODULES OVER GROUP RINGS 

For the remainder of the book the groups we consider will be finite groups, unless 
explicitly mentioned otherwise. Throughout this section F is a field and G is a fi ni te 
group. We first introduce the basic terminology. Recall that if V is a vector space 
over F, then GL(V ) is the group of nonsingular linear transformations from V to itself 
(under composition), and if n e Z + , then GL„(F) is the group of invertible n x n 
matrices with entries from F (under matrix multiplication). 

Definition. Let G be a finite group, let F be a field and let V be a vector space over F. 

(1) A linear representation of G is any homomorphism from G into GL(V). The 
degree of the representation is the dimension of V. 

(2) Let n € Z + . A matrix representation of G is any homomorphism from G into 
GL„(F). 

(3) A linear or matrix representation is faithful if it is injective. 

(4) The group ring of G over F is the set of all formal sums of the form 

a g eF 

geG 

with componentwise addition and multiplication ( ag)(J$h ) = (a(i){gh) (where 
a, and /? are multiplied in F and gh is the product in G) extended to sums via 
the distributive law (cf. Section 7.2). 

Unless we are specifically discussing permutation representations the term “repre- 
sentation” will always mean “linear representation.” When we wish to emphasize the 
field F we shall say F -representation, or representation of G on V over F. 

Recall that if V is a finite dimensional vector space of dimension n, then by fixing 
a basis of V we obtain an isomorphism GL(V) = GL n (F). In this way any linear 
representation of G on a finite dimensional vector space gives a matrix representation 
and vice versa. For the most part our linear representations will be of finite degree and we 
shall pass freely between linear representations and matrix representations (specifying a 
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basis when we wish to give an explicit correspondence between the two). Furthermore, 
given a linear representation <p : G — > GL(V) of finite degree, a corresponding matrix 
representation provides n um erical invariants (such as the determinant of for g e 
G) which are independent of the choice of basis giving the isomorphism between 
GL(V) and GL n (F). The exploitation of such invariants will be fundamental to our 
development 

Before giving examples of representations we recall the group ring FG in greater 
detail (group rings were introduced in Section 7.2, and some notation and examples 
were discussed in that section). Suppose the elements of G are gi, g 2 , ■ ■ ■ ■> gn- Each 
element of FG is of the form 

n 

ctj e F. 

/=i 

Two formal sums 1 are equal if and only if all corresponding coefficients of group 
elements are equal. Addition and multiplication in FG are defined as follows: 

n n n 

= Yl <a ‘ + 

(i>*)(i>*)=i:( e 

i=l ' '■ i=l ' Jt=l ^ i,j ' 

gigj—gk 

where addition and multiplication of the coefficients a t and 0j is performed in F. Note 
that by definition of multiplication, 

FG is a commutative ring if and only if G is an abelian group . 

The group G appears in FG (identifying g, with lg, ) and the field F appears in 
FG (identifying fi with figu where g\ is the identity of G). Under these identifications 

P f XI a ‘ gi ) = ’ for all fi € F. 

^ j=i ' #=i 

In this way 

FG is a vector space over F with the elements of G as a basis. 

In particular, FG is a vector space over F of dimension equal to |G | . The elements of 
F commute with all elements of FG, i.e., F is in the center of FG. When we wish to 
emphasize the latter two properties we shall say that FG is an F -algebra (in general, an 
F-algebra is a ring R which contains F in its center, so R is both a ring and an F-vector 
space). 

Note that the operations in FG are s imil ar to those in the F-algebra F[x] (although 
F[x] is infinite dimensional over F). In some works FG is denoted by F[G], although 
the latter notation is currently less prevalent. 

1 The formal sum displayed above is a way of writing the function from G to F which takes the 
value o;, on the group element g, . This same “formality” was used in the construction of free modules 
(see Theorem 6 in Section 10.3). 
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Examples 

(1) If G = ( g ) is cyclic of order n e Z + , then the elements of FG are of the form 

n — 1 

5 >*'. 

i=0 

The map F[jc] — ► F(g ) which sends x k to g k for all k > 0 extends by F-linearity to 
a surjective ring homomorphism with kernel equal to the ideal generated by jt” — 1. 
Thus 

F{g)^F[xy(x n -l). 

This is an isomorphism of F-algebras, i.e., is a ring isomorphism which is F-Iinear. 

(2) Under the notation of the preceding example letr = 1 + g + g 2 -\ + g" 1 , so r 

is a nonzero element of F( g ). Note that rg = g + g 2 H b g” -1 + 1 = r, hence 

r(l — g) = 0. Thus the ring F{g) contains zero divisors (provided n > I). More 
generally, if G is any group of order > 1, then for any nonidentity element g e G, 
F( g ) is a subring of FG, so FG also contains zero divisors. 

(3) Let G ~ S 3 and F = Q. The elements r = 5(1 2) — 7(1 23) and s — —4(1 23) + 
12(1 3 2) are typical members of QS 3 . Their sum and product are seen to be 

r + s = 5(1 2) -11(1 2 3) + 12(1 3 2) 
rs = -20(2 3) + 28(1 3 2)+ 60(1 3) - 84 

(recall that products (compositions) of permutations are computed from right to left). 
An explicit example of a sum and product of two elements in the group ring QD& 
appears in Section 7.2. 


Before giving specific examples of representations we discuss the correspondence 
between representations of G and FG-modules (after which we can simultaneously give 
examples of both). This discussion closely parallels the treatment of F[jc]-modules in 
Section 10.1. 

Suppose first that tp : G GL(V) is a representation of G on the vector space V 
over F. As above, write G = {gi, . . . , g„ }, so for each i e { 1 , . . . , n), <p(gi) is a linear 
transformation from V to itself. Make V into an FG-module by defining the action of 
a ring element on an element of V as follows: 

( n v n n 

) - v = J2 aiV(gt)(v), for all e FG, v € V. 

1=1 ' i = 1 ;=i 

We verify a special case of axiom 2(b) of a module (see Section 10.1) which shows 
precisely where the fact that (p is a group homomorphism is needed: 


(Sigj) ■ v = (p(gigj)(v) 

= (<p(gi) o <p(gj))(v) 

= <p(gi)(<p(gj)( v)) 

- 8 i ■ (gj • v) 


(by definition of the action) 

(since (p is a group homomorphism) 

(by definition of a composition of linear 
transformations) 

(by definition of the action). 
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This argument extends by linearity to arbitrary elements of FG to prove that axiom 2(b) 
of a module holds in general. It is an exercise to check that the remaining module axioms 
hold. 

Note that F is a subring of FG and the action of the field element a on a vector is 
the same as the action of the ring element a 1 on a vector i.e., the FG-module action 
extends the F action on V. 

Suppose now that conversely we are given an FG-module V. We obtain an associ- 
ated vector space over F and representation of G as follows. Since V is an FG-module, 
it is an F-module, i.e., it is a vector space over F. Also, for each g e G we obtain a 
map from V to V, denoted by (pig), defined by 

<Pig)i v ) = g ' v for all v € V, 

where g ■ v is the given action of the ring element g on the element v of V. Since the 
elements of F commute with each g € G it follows by the axioms for a module that for 
all v, w € V and all or, € F we have 

(p(g)(av -|- fiw) = g-(av + fiw) 

= g • (av) + g • (j Bw) 

= a(g ■ v)+P(g - to) 

= a<p(g)(v) + P<p(g)(w), 

that is, for each g e G, (pig) is a linear transformation. Furthermore, it follows by 
axiom 2(b) of a module that 


(pigigj)i v) = (cp(gi) o <p(gj))(v) 

(this is essentially the calculation above with the steps reversed). This proves that tp is 
a group homomorphism (in particular, <p(g~ l ) — (pig) 1 ■, so every element of G maps 
to a nonsingular linear transformation, i.e., <p : G —>■ GL(V)). 

This discussion shows there is a bijection between FG-modules and pairs (V, <p)\ 


V an FG-module 


— > 


V a vector space over F 
and 

(p : G — * GL(V) a representation 


Giving a representation <p : G —*■ GL(V) on a vector space V over F is therefore 
equivalent to giving an FG-module V. Under this correspondence we shall say that 
the module V affords the representation <p of G. 


Recall from Section 10.1 that if a vector space M is made into an F[jt]-module 
via the linear transformation T, then the F[jt]-submodules of M are precisely the Te- 
stable subspaces of M. In the current situation if V is an FG-module affording the 
representation tp, then a subspace V of V is called G-invariant or G-stable if g ■ u e U 
for all g € G and all u € U (i.e., if (p(g)(u ) € U for all g € G and all u 6 U). It 
follows easily that 


the F G -submodules ofV are precisely the G-stable subspaces ofV. 
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Examples 

(1) Let V be a 1 -dimensional vector space over F and make V into an FG-module by 
letting gv = v for all g e G and v € V. This module affords the representation 
(p : G GL(V) defined by <p(g) — I = the identity linear transformation, for all 
g € G. The corresponding matrix representation (with respect to any basis of V) is 
the homomorphism of G into GL \ (F) which sends every group element to the 1 x 1 
identity matrix. We shall henceforth refer to this as the trivial representation of G. 
The trivial representation has degree 1 and if |G| > 1, it is not faithful. 

(2) Let V = FG and consider this ring as a left module over itself. Thai V affords a 
representation of G of degree equal to |G|. If we take the elements of G as a basis of 
V, then each g € G permutes these basis elements under the left regular permutation 
representation: 

g gi = ggi- 

With respect to this basis of V the matrix of the group element g has a 1 in row i 
and column j if ggj = gi, and has 0’s in all other positions. This (linear or matrix) 
representation is called the regular representation of G. Note that each nonidentity 
element of G induces a nonidentity permutation on the basis of V so the regular 
representation is always faithful. 

(3) Let n e Z + , let G = S n and let V be an n-dimensional vector space over F with basis 

ey, ej e n . Let S„ act on V by defining for each o € S n 

a ■ e, = e a (j), 1 < i < n 

i.e., o acts by permuting the subscripts of the basis elements. This provides an (injec- 
tive) homomorphism of S„ into GL(V) (i.e., a faithful representation of S„ of degree 
n), hence makes V into an FS„ -module. As in the preceding example, the matrix of 
a with respect to the basis ei, ... ,e„ has a 1 in row i and column j if a -ej = e- t (and 
has 0 in all other entries). Thus a has a 1 in row i and column j if o(J) = i. 

For an example of the ring action, consider the action of FS 3 on the 3-dimensional 
vector space over F with basis ey , ei , e^. Let a be the transposition (1 2), let t be the 
3-cycle (1 23) and let r = 2a — 3t e F5g. Then 

r ■ ( aey + fiei + yes) = 2(ae a (\) + Pe a (2) + V^a{ 3)) — 3(ae r (i) -I- /tejr(2) + V0z( 3 )) 
= 2(ae2 + /tei + ye 3 ) — 3 (ae 2 + + Y e 1 ) 

= (2/3 - 3y)ei — ue 2 + (2 y - 3p)ei- 

(4) If V' : H — ► GL(V) is any representation of H and <p : G H is any group 
homomorphism, then the composition ^ o (p is a representation of G. For example, 
let V be the FS n -module of dime nsion n described in the preceding example. If 
n : G ->• S„ is any permutation representation of G, the composition of 7 T with the 
representation above gives a linear representation of G. In other words, V becomes 
an FG-module under the action 

g ei = £*(*)(/) . for all g € G. 

Note that the regular representation, (2), is just the special case of this where n = \G\ 
and n is the left regular permutation representation of G. 

(5) Any homomorphism of G into the multiplicative group F* = GLy(F) is a degree 
1 (matrix) representation. For example, suppose G = ( g ) = Z n is the cyclic group 
of order n and f is a fixed n* root of 1 in F. Let g l f 1 , for alii e Z. This 
representation of ( g ) is a faithful representation if and only if f is a primitive n* root 
of 1 . 
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(6) In many situations it is easier to specify an explicit matrix representation of a group 
G rather than to exhibit an FG-module. For example, recall that the dihedral group 
l>2n has the presentation 

D-b\ — (r, s | r n = s 2 = 1 , rs=sr~ l ). 

If R and S are any matrices satisfying the relations R n = S 2 — / and RS = SR~ [ 
then the map r i-> R and s i-v S extends uniquely to a homomorphism from £> 2 „ to the 
matrix group generated by R and S , hence gives a representation of D^. An explicit 
example of matrices R.S € Af 2 (lR) may be obtained as follows. If a regular n-gon is 
drawn on the x, y plane centered at the origin with the line y = x as one of its lines 
of symmetry then the matrix R that rotates the plane through 2n/n radians and the 
matrix S that reflects the plane about the line y = x both send this n-gon onto itself. 
It follows that these matrices act as symmetries of the n-gon and so satisfy the above 
relations. These matrices are readily computed (cf. Exercise 25, Section 1 .6) and so 
the maps 

_ /cos2rr /n — &in2n/n\ , „ / 0 l\ 

\Sin2jr/n cos 2 izfn J yl 0/ 

extend uniquely to a (degree 2) representation of L>2 n into GLi (R). Since the matrices 
R and S have orders n and 2 respectively, it follows that they generate a subgroup of 
GL 2 (R) of order 2n and hence this representation is faithful. 

(7) By using the usual generators and relations for the quaternion group 

Q& = U. j 1 1 4 = / = 1 . i 7 = j 2 , r'ji = j ~ l ) 

one may similarly obtain (cf. Exercise 26, Section 1.6) a representation <p from Qs to 
GL 2 OC) defined by 



This representation of Qg is faithful 

( 8 ) A 4-dimensional representation of the quaternion group Qg may be obtained from 
the real Hamilton quaternions, H (cf. Section 7.1). The group Qg is a subgroup of 
the multiplicative group of units of H and each of the elements of Qg acts by left 
multiplication on the 4-dimensional real vector space M. Since the real numbers are in 
the center of H (i.e., since H is an R- algebra), left multiplication is R- linear. This linear 
action thus gives a homomorphism from Qg into GI 4 CR). One can easily write out 
the explicit matrices of each of the elements of Qg with respect to the basis 1, i, j, k of 
IHL For example, left multiplication by i acts by 1 i-> /, i —1, j k and k i-v — j 
and left multiplication by j acts by 1 j,i i-> —k, j — 1 and k i-> r so 


/° 

-1 

0 

°\ 

(° 

0 

-1 

°\ 

1 

0 

0 

0 

and j i-v 

0 

0 

0 

1 

0 

0 

0 

-1 

1 

0 

0 

0 

\0 

0 

1 

0 J 


\o 

-1 

0 

0 / 


This representation of Qg is also faithful. 

(9) Suppose that H is a normal subgroup of the group G and suppose that H is an ele- 
mentary abelian p-group for some prime p. Then V = H is a vector space over F p , 
where the scalar a acts on the vector v by av = v a (see Section 10.1). The action 
of each element of G by conjugation on V is F p - linear because gi/fy -1 = (gug " 1 ) 0 
and this action of G on V makes V into an F p G-module (the automorphisms of el- 
ementary abelian p-groups were discussed in Sections 4.4 and 10.1). The kernel of 
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this representation is the set of elements of G that commute with every element of 
H, Cg(H ) (which always contains the abelian group H itself)- Thus the action of a 
group on subsets of itself often affords linear representations over finite fields. Rep- 
resentations of groups over finite fields are called modular representations and these 
are fundamental to the study of the internal structure of groups. 

(10) For an example of an FG-subroodule, let G = S* and let V be the FS„- module 
described in Example 3. Let N be the subspace of V consisting of vectors all of whose 
coordinates are equal, Le., 

N = {(*)£] +01202 d 1- a„e„ I cq = <*2 = ■ ■ • = a*} 

(this is a I -dimensional S n -stable subspace). Each a € S„ fixes each vector in N so the 
submodule N affords the trivial representation of S*. As an exercise, one may show 
that if n > 3 then N is the unique 1 -dimensional subspace of V which is S„ -stable, 
i.e., N is the unique 1-dimensional FS,, -submodule (N is called the trace submodule 
of FS n ). 

Another FS n -submodule of V is the subspace / of all vectors whose coordinates 
sum to zero: 

/ = {a^ej + a2e 2 -I h a*e* | Ofj + <*2 H 1- a„ = 0}. 

Again / is an S n -stable subspace (since each o e S„ permutes the coordinates of each 
vector in V, each o leaves the sum of the coefficients unchanged). Since I is the 
kernel of the linear transformation from V onto F which sends a vector to the sum 
of its coefficients (called the augmentation map — cf. Section 7.3), I has dimension 
n — 1. 

(11) If V = FG is the regular representation of G described in Example 2 above, then V 
has FG-submodules of dimensions 1 and |G| - 1 as in the preceding example: 

N = {«l£l + <*2g2 -\ \-u n gn I aj = ct2 = -- - =a n ) 

I = [ceigi+a 2 g 2 \-ctngr I <*1 +<*2 + ha* = 0}. 

In fact N and / are 2-sided ideals of FG (not just left ideals — note that /V is in the 
center of FG). The ideal / is called the augmentation ideal of FG and N is called the 
trace ideal of FG. 

Recall that in the study of a linear transformation T of a vector space V to itself we 
made V into an F[x]-module (where x acted as T on V); our goal was to decompose V 
into a direct sum of cyclic submodules. Id this way we were able to find a basis of V for 
which the matrix of T with respect to this basis was in some canonical for m . Changing 
the basis of V did not change the module V but changed the matrix representation of 
T by similarity (i.e., changed the isomorphism between GL(V) and GL n (F)). We 
introduce the analogous terminology to describe when two FG -modules are the same 
up to a change of basis. 

Definition. Two representations of G are equivalent (or similar) if the FG -modules 
affording them are isomorphic modules. Representations which are not equivalent are 
called inequivalent. 

Suppose <p : G -> GL(V) and ^ : G -* GL(W) are equivalent representations 
(here V and W must be vector spaces over the same field F). Let T : V W be 
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an FG -module isomorphism between them. Since 7 is, in particular, an F -module 
isomorphism, 7 is a vector space isomorphism, so V and W must have the same 
dimension. Furthermore, for all g € G, v G V we have T(g ■ v) = g ■ (T ( v )), since 
7 is an isomorphism of FG -modules. By definition of the action of ring elements this 
means 7 (<p(g)v) = ty(g)(T (u)), that is 

7 o <p(g) = rpig) o 7 for all g e G. 

In particular, if we identify V and W as vector spaces, then two representations <p and 
rfr of G on a vector space V are equivalent if and only if there is some 7 € GL(V ) such 
that 7 o (pig ) o 7 -1 = rpig) for all g e G. This 7 is a simultaneous change of basis 
for all (p(g), g eG. 

In matrix terminology, two representations <p and \Jj are equivalent if there is a fixed 
invertible matrix P such that 

P(pig)P~ { = rjr(g) forallgeG. 

The linear transformation 7 or the matrix P above is said to intertwine the representa- 
tions (p and xp (it gives the “rule” for changing <p into xp). 

In order to study the decomposition of an FG -module into (direct sums of) sub- 
modules we shall need some terminology. We state these definitions for arbitrary rings 
since we shall be discussing direct sum decompositions in greater generality in the next 
section. 

Definition. Let R be a ring and let M be a nonzero F -module. 

(1) The module M is said to be irreducible (or simple ) if its only submodules are 0 
and M; otherwise M is called reducible. 

(2) The module M is said to be indecomposable if M cannot be written as Mi ® M 2 
for any nonzero submodules M\ and M 2 ; otherwise M is called decomposable. 

(3) The module M is said to be completely reducible if it is a direct sum of irreducible 
submodules. 

(4) A representation is called irreducible, reducible, indecomposable, decompos- 
able or completely reducible according to whether the FG-module affording it 
has the corresponding property. 

(5) If M is a completely reducible /? -module, any direct summand of M is called 
a constituent of M (i.e., A is a constituent of M if there is a submodule N' of 
M such that M = N © N'). 

An irreducible module is, by definition, both indecomposable and completely re- 
ducible. We shall shortly give examples of indecomposable modules that are not irre- 
ducible. 

If R = FG, an irreducible FG-module V is a nonzero F - vector space with no non- 
trivial, proper G-invariant subspaces. For example, if dim /7 V = 1 then V is necessarily 
irreducible (its only subspaces are 0 and V). 

Suppose V is a finite dimensional FG-module and V is reducible. Let U be a 
G-invariant subspace. Form a basis of V by taking a basis of U and enlarging it to a 
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basis of V. Then for each g € G the matrix, <p(g ), of g acting on V with respect to this 
basis is of the form 

0 «(*)j 

where q>\ = <p\u (with respect to the chosen basis of V) and <pi is the representation 
of G on V/U (and \}r is not necessarily a homomorphism — ^(g) need not be a 
square matrix). So reducible representations are those with a corresponding matrix 
representation whose matrices are in block upper triangular form. 

Assume further that the FG -module V is decomposable, V = U $ V. Take for 
a basis of V the union of a basis of U and a basis of U\ With this choice of basis the 
matrix for each g € G is of the form 



(i.e., ij/(g) = 0 for all g € G). Thus decomposable representations are those with a 
corresponding matrix representation whose matrices are in block diagonal form. 

Examples 

(1) As noted above, all degree 1 representations are irreducible, indecomposable and 
completely reducible. In particular, this applies to the trivial representation and to the 
representations described in Example 5 above. 

(2) If|G| > 1, the regular representation of G is reducible (the augmentation ideal and the 
trace ideal are proper nonzero submodules). We shall later determine the conditions 
under which this representation is completely reducible and how it decomposes into 
a direct sum. 

(3) For n > 1 the FS „ -module described in Example 10 above is reducible since N and 1 
are proper, nonzero submodules. The module N is irreducible (being 1 -dimensional) 
and if the characteristic of the field F does not divide n, then 1 is also irreducible. 

(4) The degree 2 representation of the dihedral group Z> 2 „ = G described in Example 

6 above is irreducible for n >3. There are no G -invariant 1 -dimensional subspaces 
since a rotation by 2njn radians sends no line in M 2 to itself. Similarly, the degree 2 
complex representation of Qg described in Example 7 is irreducible since the given 
matrix 0>(i) has exactly two 1 -dimensional eigenspaces (corresponding to its distinct 
eigenvalues iV-T) and these are not invariant under the matrix The degree 4 
representation <p : Qg GL4QR) described in Example 8 can also be shown to be 
irreducible (see the exercises). We shall see, however, that if we view <p as a complex 
representation <p : Q% GL^iC,) (just by considering the real entries of the matrices 

to be complex entries) then there is a complex matrix P such that P~ l <p(g)P is a direct 
sum of 2 x 2 block matrices for all g e Qg. Thus an irreducible representation over a 
field F may become reducible when the field is extended. 

(5) Let G = ( g ) be cyclic of order n and assume F contains all the n* roots of 1 . As 
noted in Example 1 in the set of examples of group algebras, F( g ) ~ F[x]/(x n — 1). 
Thus the FG-modules are precisely the F[jr]-modules annihil ated by x" - The 
latter (finite dimensional) modules are described, up to equivalence, by the Jordan 
Canonical Form Theorem. 

If the minimal polynomial of g acting on an F{ g > -module V has distinct roots in 
F, there is a basis of V such that g (hence all its powers) is represented by a diagonal 
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matrix (cf. Corollary 25, Section 12.3). In this case, V is a completely reducible F(g)~ 
module (being a direct sum of 1 -dimensional ( g ) -invariant subspaces). In general, 
the minimal polynomial of g acting on V divides x n — l so if x n — 1 has distinct roots 
in F, then V is a completely reducible F(g )- module. The polynomial x n - 1 has 
distinct roots in F if and only if the characteristic of F does not divide n. This gives 
a sufficient condition for eveiy F( g )-module to be completely reducible. 

If the minimal polynomial of g acting on V does not have distinct roots (so 
the characteristic of F does divide n), the Jordan canonical form of g must have an 
elementary Jordan block of size > 1. Since every linear transformation has a unique 
Jordan canonical form, g cannot be represented by a diagonal matrix, i.e., V is not 
completely reducible. It follows from results on cyclic modules in Section 12.3 that 
the (1 -dimensional) eigenspace of g in any Jordan block of size > 1 admits no ( g )- 
invariant complement, i.e., V is reducible but not completely reducible. 

Specifically, let p be a prime, let F = ¥ p and let g be of order p. Let V be the 
2-dimensional space over F p with basis v, w and define an action of g on V by 

g ■ v = v and g ■ u> = u + w. 

This endomorphism of V does have order p (in GL(V)) and the matrix of g with 
respect to this basis is the elementary Jordan block 

0(S ) =(o })• 

Now V is reducible (span{u) is a ( g }-invariant subspace) but V is indecomposable 
(the above 2x2 elementary Jordan matrix is not similar to a diagonal matrix). 

The first fundamental result in the representation theory of finite groups shows how 
Example 5 generalizes to noncyclic groups. 

Theorem 1. (Maschke’s Theorem) Let G be a finite group and let F be a field whose 
characteristic does not divide |G|. If V is any FG -module and U is any submodule of 
V, then V has a submodule W such that V — U © W (i.e., every submodule is a direct 
summand). 

Remark : The hypothesis of Maschke’s Theorem applies to any finite group when F has 
characteristic 0. 

Proof: The idea of the proof of Maschke’s Theorem is to produce an FG -module 
homomorphism 

7T : V — > U 

which is a projection onto U, i.e., which satisfies the following two properties: 

( i ) n(u) = u for all u e U 

( ii ) tt(tt(v)) = tt(v) for all v e V (i.e., jt 2 = n) 

(in fact (ii) is implied by (i) and the fact that n( V) C U). 

Suppose first that we can produce such an FG-module homomorphism and let 
W = ker 7T. Since n is a module homomorphism, W is a submodule. We see that W is 
a direct sum complement to U as follows. If v e V D W then by (i), v = tt(v) whereas 
by definition of W, jt(v) = 0. This shows U n W = 0. To show V = V + W let v be 
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an arbitrary element of V and write v = jr(t;) + (u — rr(ij)). By definition, 7r(u) € U. 
By property (ii) of n, 

n(v — rr(u)) = 7r(u) — 7r(7r(u)) = n(v) — n(v) = 0, 

i.e., u — rr(u) e W. This shows V — U + W and hence V = U ffi W. To establish 
Maschke’s Theorem it therefore suffices to find such an FG -module projection n . 

Since U is a subspace it has a vector space direct sum complement Wo in V (take 
a basis B\ of U , build it up to a basis B of V and let Wo be the span of B — Bi). Thus 
V = U (&Wq as vector spaces but Wo need not be G-stable (i.e., need not be an FG- 
submodule). Let 7To : V ->■ U be the vector space projection of V onto U associated 
to this direct sum decomposition, i.e., no is defined by 

77o(n + w) = u for all u e U, w € Wq. 

The key idea of the proof is to “average” ttq over G to form an FG -module proj ection 
n. For each g € G define 

gn 0 g~ [ : V -> U by = g * (g _1 ■ t>), for all v € V 

(here • denotes the action of elements of the ring FG). Since no maps V into U and U is 
stable under the action of g we have that gnog~ l maps V into U . Both g and g x act as 
F-linear transformations, so gnog~ x is a linear transformation. Furthermore, if u is in 
the G-stable space U then so is g~ l u, and by definition of no we have no(g~ l u) = g~ l u. 
From this we obtain that for all g G G, 

gnog~ l (u) = u for all u e U 

(i.e., gnog~ x is also a vector space projection of V onto U). 

Let n = |G| and view n as an element of F (n = 1 + - - ■ + 1, n times). By 
hypothesis n is not zero in F and so has an inverse in F. Define 

* = -YVog- 1 - 

n 

£€G 

Since n is a scalar multiple of a sum of linear transformations from V to U, it is also 
a linear transformation from V to U. Furthermore, each term in the sum defining n 
restricts to the identity map on the subspace U and so n\u is 1 /n times the sum of n 
copies of the identity. These observations prove the following: 

7r : V — > U is a linear transformation 
n(u) = u for all u e U 
n 2 (v) = n(v) for all v e V. 

It remains to show that n is an FG-module homomorphism (i.e., is FG-linear). It 
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suffices to prove that for all h e G , 7t(hv ) = hn{v), for w e V. In this case 

Tt{hv) = gng(g- l hv) 

n • * 


gee 


= - 1 g)7 r 0 ((g l h)v) 

^eG 

— — h{k7tQ,ik~^ v) — hn(v) 
n *—• 

k=h~ l g 

gee 8 


(as g runs over all elements of G, so does k = h~ l g and the module element h may 
be brought outside the s umma tion by the distributive law in modules). This establishes 
the existence of the FG-moduIe projection jv and so completes the proof. 


The applications of Maschke’s Theorem will be to finitely generated FG-modules. 
Unlike the situation of F[jc] -modules, however, finitely generated FG-modules are 
automatically finite dimensional vector spaces (the difference being that FG itself is 
finite dimensional, whereas F[jc] is not). Let V be an FG-module. If V is a finite 
dim ensional vector space over F, then a fortiori V is finitely generated as an FG- 
module (any F basis gives a set of generators over FG). Conversely, if V is finitely 
generated as an FG-module, say by . . . , u*, then one easily sees that V is spanned 
as a vector space by the finite set {g ■ u; | g g G, 1 < i < k). Thus 

an FG-module is finitely generated if and only if it is finite dimensional. 


Corollary 2. If G is a finite group and F is a field whose characteristic does not divide 
|G|, then every finitely generated FG-module is completely reducible (equivalently, 
every F -representation of G of finite degree is completely reducible). 

Proof: Let V be a finitely generated FG-module. As noted above, V is finite 
dimensional over F, so we may proceed by induction on its dimension. If V is irre- 
ducible, it is completely reducible and the result holds. Suppose therefore that V has a 
proper, nonzero FG-submodule U. By Maschke’s Theorem U has an FG -submodule 
complement W, i.e., V = U 0 W. By induction, each of U and W are direct sums of 
irreducible submodules, hence so is V. This completes the induction. 

Corollary 3. Let G be a finite group, let F be a field whose characteristic does not 
divide |G| and let <p : G — >■ GL{V) be a representation of G of finite degree. Then 
there is a basis of V such that for each g G G the matrix of <p{g) with respect to this 
basis is block diagonal: 

/<Pi(g) \ 

92(g) 


\ <Pm(g)' 

where ^ is an irreducible matrix representation of G, 1 < i < m. 
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Proof: By Corollary 2 we may write V = U\ © U 2 ® ■ • • 0 U mt where £/,- is an 
irreducible FG-submodule of V. Let Bj be a basis of U t and let B be the union of the 
Bi' s. For each g e G, the matrix of <p(g ) with respect to the basis B is of the form in 
the corollary, where <pi(g) is the matrix of <p(g)\v, with respect to the basis Bj. 

The converse of Maschke’s Theorem is also true. Namely, if the characteristic 
of F does divide |G|, then G possesses (finitely generated) FG-moduies which are 
not completely reducible. Specifically, the regular representation (i.e., the module FG 
itself) is not completely reducible. 

In Section 18.2 we shall discuss the question of uniqueness of the constituents in 
direct sum decompositions of F G-modules into irreducible submodules. 

EXERCISES 

Let F be a field, let G be a finite group and let n e Z+. 

1. Prove that if <p : G -» GL(V) is any representation, then <p gives a faithful representation 
of G/ker^. 

% Let <p : G -*■ GL„(F) be a matrix representation. Prove that the map g h-> det(<p(g)) is a 
degree 1 representation. 

3. Prove that the degree F representations of G are in bijective correspondence with the degree 
1 representations of the abelian group G/G' (where G’ is the commutator subgroup of G). 

4 . Let V be a (possibly infinite dimensional) FG-module (G is a finite group). Prove that 
for each v e V there is an FG-submodule containing v of dimension < |G|. 

5. Prove that if |G| > 1 then every irreducible FG-module has dimension < |G|. 

6 . Write out the matrices (pig ) for every g e G for each of the following representations that 
were described in the second set of examples: 

(a) the representation of S 3 described in Example 3 (let n = 3 in that example) 

(b) the representation of D& described in Example 6 (i.e., let n = 4 in that example and 
write out the values of all the sines and cosines, for all group elements) 

(c) the representation of Q% described in Example 7 

(d) the representation of £>8 described in Example 8 . 

7. Let V be the 4-dimensional permutation module for S 4 described in Example 3 of the 

second set of examples. Let n : Dg -*• 54 be the permutation representation of Dg 
obtained from the action of Dg by left multiplication on the set of left cosets of its subgroup 
( s ) . Make V into an FD %- module via it as described in Example 4 and write out the 4 x 4 
matrices for r and s given by this representation with respect to the basis ej £ 4 . 

8 - Let V be the FS„ -module described in Examples 3 and 10 in the second set of examples. 

(a) Prove that if v is any element of V such that o ■ v = v for all a e S„ then v is an 
F-multiple of e\ + £2 + ■ ■ ■ + e n . 

(b) Prove that if n > 3, then V has a unique 1 -dimensional submodule, namely the 
submodule N consisting of all F-multiples of e\ + £2 + • ■ ■ + c„. 

9. Prove that the 4-dimensional representation of on M described in Example 8 in the 
second set of examples is irreducible. [Show that any £) 8 -stable subspace is a left ideal.] 

10. Prove that GL 2 OR) has no subgroup isomorphic to Q%. [This may be done by direct 
computation using generators and relations for Q%. Simplify these calculations by putting 
one generator in rational canonical form.] 
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11. Let <p : S n GL„(F) be the matrix representation given by the permutation module 
described in Example 3 in the second set of examples, where the matrices are computed 
with respect to the basis e\ , . . . , e„ . Prove that det <p(c) = £ (er) for all a € S „ , where € (a) 
is the sign of the permutation cr. [Check this on transpositions.] 

12. Assume the characteristic of F is not 2. Let H be the set of T e M n (F) such that T 
has exactly one nonzero entry in each row and each column and zeros elsewhere, and the 
nonzero entries are ±1. Prove that If is a subgroup of GL n (F) and that H is isomorphic 
to £ 2 " S„ (semidirect product), where Ej” is the elementary abelian group of order 2” . 

The next few exercises explore an important result known as Schur’s Lemma and some of its 

consequences. 

13. Let £ be a ring and let M and N be simple (i.e., irreducible) ^-modules. 

(a) Prove that every nonzero £ -module homomorphism from M to N is an isomorphism. 
[Consider its kernel and image.] 

(b) Prove Schur’s Lemma: if M is a simple £ -module then Hom/f(M, M) is a division 
ring (recall that Hom/?(M, M) is the ring of all £-module homomorphisms from M 
to M, where multiplication in this ring is composition). 

14. Let <p : G -> GL(V) be a representation of G. The centralizer of <p is defined to be the set 
of all linear transformations, A , from V to itself such that A<p(g) = <p(g)A for all g € G 
(i.e., the linear transformations of V which commute with all <p(g)’s). 

(a) Prove that a linear transformation A from V to V is in the centralizer of (p if and only 
if it is an FG-module homomorphism from V to itself (so the centralizer of (p is the 
same as the ring HomrclV’, V)). 

(b) Show that if z is in the center of G then <p(z) is in the centralizer of <p. 

(c) Assume <p is an irreducible representation (so V is a simple FG-module). Prove 
that if H is any finite abelian subgroup of GL(V ) such that A<p(g) = <p(g)A for 
all A e H then H is cyclic (in other words, any finite abelian subgroup of the 
multiplicative group of units in the ring Hom J cc(V r , V) is cyclic). [By the preceding 
exercise, Hom^c(V r . V) is a division ring, so this reduces to proving that a finite 
abelian subgroup of the multiplicative group of nonzero elements in a division ring 
is cyclic. Show that the division subring generated by an abelian subgroup of any 
division ring is a field and use Proposition 18, Section 9.5.] 

(d) Show that if <p is a faithful irreducible representation then the center of G is cyclic. 

(e) Deduce from (d) that if G is abelian and <p is any irreducible representation then 
G/ ker <p is cyclic. 

15. Exhibit all 1 -dimensional complex representations of a finite cyclic group; make sure to 
decide which are inequivalent. 

16. Exhibit all 1 -dimensional complex representations of a finite abelian group. Deduce that 
the number of inequivalent degree 1 complex representations of a finite abelian group 
equals the order of the group. [First decompose the abelian group into a direct product of 
cyclic groups, then use the preceding exercise.] 

17. Prove the following variant of Schur’s Lemma for complex representations of abelian 
groups: if G is abelian, any irreducible complex representation, <p, of G is of degree 1 
and G/ker <p is cyclic. [This can be done without recourse to Exercise 14 by using the 
observation that for any g e G the eigenspaces of <p(g) are G-stable. Your proof that tp 
has degree 1 should also work for i nfini te abelian groups.) 

18. Prove the following general form of Schur’s Lemma for complex representations: if 
<p : G — ► GL„ (C) is an irreducible matrix representation and A is an n x n matrix com- 
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muting with <p(g) for all g € G, then A is a scalar matrix. Deduce that if <p is a faithful, 
irreducible, complex representation then the center of G is cyclic and y>(z) is a scalar 
matrix for all elements z in the center of G. [As in the preceding exercise, the eigenspaces 
of A are G-stable.] 

19. Prove that if G is an abelian group then any finite dimensional complex representation of G 
is equivalent to a representation into diagonal matrices (i.e., any finite group of commuting 
matrices over C can be simultaneously diagonalized). [This can be done without recourse 
to Maschke’s Theorem by looking at eigenspaces.] 

20. Prove that the number of degree 1 complex representations of any finite group G equals 
| G : G' |, where G' is the commutator subgroup of G. [Use Exercises 3 and 16.] 

21. Let G be a noncyclic abelian group acti ng by conjugation on an elementary abelian p -group 
V, where p is a prime not dividing the order of G. 

(a) Prove that if W is an irreducible F p G-submodule of V then there is some nonidentity 
element g e G such that W < Cv(g) (here Cy(g) is the subgroup of elements of V 
that are fixed by g under conjugation). 

(b) Prove that V is generated by the subgroups Cvig) as g runs over all nonidentity 
elements of G. 

22. Let p be a prime, let P be a p-group and let F be a field of characteristic p. Prove that 
the only irreducible representation of P over F is the trivial representation. [Do this for a 
group of order p first using the fact that F contains all roots of 1 (namely 1 itself)- If 
P is not of order p, let z be an element of order p in the center of P, prove that z is in the 
kernel of the irreducible representation and apply induction to P/{z)-] 

23. Let p be a prime, let P be a nontrivial p-group and let F be a field of characteristic p. Prove 
that the regular representation is not completely reducible. [Use the preceding exercise.] 

24. Let p be a prime, let P be a nontrivial p-group and let F be a field of characteristic p. 
Prove that the regular representation is indecomposable. 


18.2 WEDDERBURN’S THEOREM AND SOME CONSEQUENCES 

In this section we give a famous classification theorem due to Wedderbum which de- 
scribes, in particular, the structure of the group algebra FG when the characteristic 
of F does not divide the order of G. From this classification theorem wc shall derive 
various consequences, including the fact that for each finite group G there are only a 
finite number of nonisomorphic irreducible FG -modules. This result, together with 
Maschke’s Theorem, in some sense completes the Holder Program for representation 
theory of finite groups over such fields. The remainder of the book is concerned with 
developing techniques for determining and working with the irreducible representations 
as well as applying this knowledge to obtain group-theoretic information. 

Theorem 4. ( Wedderbum’ s Theorem ) Let R be a nonzero ring with 1 (not necessarily 
commutative). Then the following are equivalent: 

(1) every F-module is projective 

(2) every F-module is injective 

(3) every F -module is completely reducible 

(4) the ring R considered as a left F-module is a direct sum: 

R — L i © Z>2 © * • - © L„, 
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where each L, is a simple module (i.e., a simple left ideal) with L, = Re-,, for 
some e { € R with 

(i) e t ej = 0 if i ± j 

(ii) ef = e t for all i 

(iii) Y!Lx * = 1 

(5) as rings, R is isomorphic to a direct product of matrix nogs over division rings, 
i.e., R = R\ x R 2 x ■ - ■ x R r where Rj is a two-sided ideal of R and Rj is 
isomorphic to the ring of all rij x n ; matrices with entries in a division ring Ay, 
j = 1, 2, .... r. The integer r, the integers /iy, and the division rings A y (up to 
isomorphism) are uniquely determined by R. 

Proof: A proof of Wedderbum’s Theorem is outlined in Exercises 1 to 10 

Definition. A ring R satisfying any of the (equivalent) properties in Theorem 4 is 
called semisimple with minimum condition. 

Rings R satisfying any of the equivalent conditions of Theorem 4 also satisfy the 
minimum condition or descending chain condition (D.C.C) on left ideals: 

if h 5 h 5 * - ■ is a descending chain of left ideals of R 

then there is an N € Z + such that h = In for all k > N 

(which explains the use of this term in the definition above). The rings we deal with 
will all have this minimum condition. For example, group algebras always have this 
property since in any strictly descending chain of ideals the vector space dimensions of 
the ideals (which are F -subspaces of FG) are strictly decreasing, hence the length of a 
strictly descending chain is at most the dimension of FG (= | G |). We shall therefore use 
the term “semisimple” to mean “semisimple with minim um condition.” The rings R,- in 
conclusion (5) of Wedderbum’s Theorem are called the Wedderbum components of R 
and the direct product decomposition of R is called its Wedderbum decomposition. Note 
that Wedderbum ’s Theorem for commutative rings is a consequence of the classification 
of Artinian rings in Section 16.1. A commutative semisimple ring with minim um 
condition is an Artinian ring with Jacobson radical equal to zero and so is a direct 
product of fields (which are its Wedderbum components). 

One should note that condition (5) is a two-sided condition which describes the 
overall structure of R completely (the ring operations in this direct product of rings are 
componentwise addition and multiplication). In particular it implies that a semisimple 
ring also has the minimum condition on right ideals. A useful way of thinking of the 
elements of the direct product R^ x • • • x R r in conclusion (5) is as n x n (block diagonal) 
matrices of the form 

/Ax \ 

A 2 

< A r ) 

where Ai is an arbitrary n j x n t matrix with entries from A,- (here n = ”/)■ 
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Recall from Section 10.5 that an R -module Q is injective if whenever Q is a 
submodule of any R -module M, then M has a submodule N such that M = Q © N. 
Maschke’s Theorem therefore implies: 

Corollary 5. If G is a finite group and F is a field whose characteristic does not divide 
[Gj, then the group algebra FG is a semisimple ring. 

Before obtaining more precise information about how the invariants n, r, A;, etc., 
relate to invariants in group rings FG for certain fields F , we first study the structure 
of matrix rings (i.e., the rings described in conclusions (4) and (5) of Wedderbum’s 
Theorem). We introduce some terminology which is used extensively in ring theory. 
Recall that the center of the ring R is the subrrng of elements commuting with all 
elements in R\ it will be denoted by Z(R) (the center will contain 1 if the ring has a 1). 

Definition. 

(1) A nonzero element e in a ring R is called an idempotent if e 2 = e. 

(2) Idempotents e\ and e-i are said to be orthogonal if = 0. 

(3) An idempotent e is said to be primitive if it cannot be written as a sum of two 
(commuting) orthogonal idempotents. 

(4) The idempotent e is called a primitive central idempotent if e e Z(7?) and e 
cannot be written as a sum of two orthogonal idempotents in the ring Z(R). 

Proposition 6 describes the ideal structure of a matrix ring and Proposition 8 extends 
these results to direct products of matrix rings. 

Proposition 6. Let A be a division ring, let n € Z + , let R be the ring of all n x n 
matrices with entries from A and let / be the identity matr ix (= the 1 of R). 

(1) The only two-sided ideals of R are 0 and R. 

(2) The center of R consists of the scalar matrices o'/, where o' is in the center of A: 
Z(R) = [al j a e Z(A)}, and this is a field isomorphic to Z(A). In particular, 
if A is a field, the center of R is the subring of all scalar matrices. The only 
central idempotent in R is 7 (in particular, I is primitive). 

(3) Let e x be the matrix with a 1 in position i, i and zeros elsewhere. Then ei , . . . , e n 
are orthogonal primitive idempotents and e, = 7. 

(4) Li = Re; is the left ideal consisting of arbitrary entries in col umn / and zeros 
in all other columns. L, is a simple left 7? -module. Every simple left R -module 
is isomorphic to Li (in particular, all L, are isomorphic /^-modules) and as a 
left 7?-module we have R = L\ © ■ ■ ■ 0 L n . 

Before proving this proposition it will be useful to have the following result 

Lemma 7. Let R be an arbitrary nonzero ring. 

(1) If M and N are simple R -modules and <p : M -> TV is a nonzero R -module 
homomorphism, then <p is an isomorphism. 

(2) (Schur's Lemma) If M is a simple 7?-mo<hile, then Horn* (M, M) is a division 
ring. 
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Proof of Lemma 7: To prove (1) note that since <p is nonzero, ker^> is a proper 
submodule of M. By simplicity of M we have ker ip = 0. Similarly, the image of <p 
is a nonzero submodule of the simple module N , hence <p(M) = N . This proves <p is 
bijective, so (1) holds. 

By part (1), every nonzero element of the ring Hom^(M, M) is an isomorphism, 
hence has an inverse. This gives (2). 

Proof of Proposition 6 Let A be an arbitrary matrix in R whose /, j entry is ajj. 
Let E- t j be the matrix with a 1 in position i, j and zeros elsewhere. The following 
straightforward computations are left as exercises: 

(i) Ejj A is the matrix whose I th row equals the row of A and all other rows are 
zero. 

(ii) AEij is the matrix whose j* column equals the I th column of A and all other 
columns are zero. 

(iii) E pq AE rs is the matrix whose p, s entry is a Qr and all other entries are zero. 

To prove (1) suppose J is any nonzero 2-sided ideal of R and let A be an element 
of J with a nonzero entry in position q, r. Given any p, s e {1, we obtain from 

(iii) that 

E ps = — E pq AE rs G J. 
a qr 

Since the A-linear combinations of [E px | 1 < p < n, 1 < s < n} give all of R, it 
follows that J = R. This proves (1). 

To prove (2) assume A G Z(R). Thus for all i, j we have Ejj A — AE,j. From 
(i) and (ii) above it follows immediately that all off-diagonal entries of A are zero and 
all diagonal entries of A are equal. Thus A — a/ for some a G A. Furthermore, A 
must also commute with the set of all scalar matrices @1, fi g A, ie., a. must commute 
with all elements of A. Finally, since Z(R) is a field, it is immediate that it contains a 
unique idempotent (namely I). This establishes all parts of (2). 

In part (3) it is clear that e\, are orthogonal idempotents whose sum is /. 

We defer proving that they are primitive until we have established (4). 

Next wc prove (4). From (ii) above it follows that Re, = RE is the set of matrices 
with arbitrary entries in the 7 th column and zeros in all other columns. Furthermore, 
if A is any nonzero element of 7?e,, then certainly RA c Re,. The reverse inclusion 
holds because if a P j is a nonzero entry of A, then by (i) above 

1 

e-, — Eu = — Ej P A g RA. 
a pi 

This proves Re, = RA for any nonzero element A G Rej, and so must be a simple 
7? -module. 

Let M be any simple ^-module. Since 1m = m for all m G M and since 7 = 
Y?i=i e '<' there exists some i and some m G M such that ^ 0. For this i and m the 
map re, re,m is a nonzero P-module homomorphism from the simple 7?-module 
Rej to the simple module A7. By Lemma 7(1) it is an isomorphism. By (ii), the map 
r h>- rEj i gives Re, ~ Re\. Finally, every matrix is the direct sum of its columns so 
R = L\ 0 ■ ■ ■ ® L„. This completes the proof of (4). 
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It remains to prove that the idempotents in part (3) are primitive. If e-, = a + b, for 
some orthogonal idempotents a and b, then we shall see that 

Lt = Rei = Ra © Rb. 

This will contradict the fact that Li is a simple R -module. To establish the above direct 
sum note first that since ah = ba = 0, we have ae, = a e Re t and be t = b € /?«,-. For 
all r € R we have re,- = ra + rb, hence Rei = Ra+ Rb . Moreover, Ra D Rb = 0 
because if ra = sb for some r,s e R, then ra = raa = sba = 0 (recall a — a 2 and 
ba — 0). This completes all parts of the proof. 

Proposition 8 . Let R — R\ x R 2 x - > > x R rt where Ri is the ring of ti-, x n,- matrices 
over the division ring Aj , for i = 1 , 2, . . . , r. 

(1) Identify R { with the 1 th component of the direct product. Let z\ be the r-tuple 
with the identity of R, in position i and zero in all other positions. Then 
Rj = Zi R and for any a € Rj, ZiO = a and Zja = 0 for all j ^ i. The elements 
z\, ■ ■ . , z r are all of the primitive central idempotents of R. They are pairwise 
orthogonal and z > — 1 - 

(2) Let N be any left R -module and let z,-N = [zjX \ x € N), l < i < r. Then 
ZjN is a left K-submodule of N, each ZiN is an /?, -module on which Rj acts 
trivially for all j 7 ^ /, and 

N = Z\N (B Z 2 N © • ■ ■ © z r N. 

(3) The simple i?-modules are the simple R , ^modules on which Rj acts trivially 

for j ± i in the following sense. Let M-, be the unique simple R r module (cf. 
Proposition 6). We may consider Mi as an R -module by letting Rj act trivially 
for all j ± i . Then M \ , . . . , M r are pairwise nonisomorphic simple R -modules 
and any simple R -module is isomorphic to one of , . . . , M r . Explicitly, the 
R -module M, is isomorphic to the simple left ideal (0, . . . , 0, , 0, . . . , 0) of 

all elements of R whose I th component, consists of matrices with arbitrary 
entries in the first column and zeros elsewhere. 

(4) For any R -module N the ft-submodule zi N is a direct sum of simple R -modules, 
each of which is isomorphic to the module M,- in (3). In particular, if M is a 
simple R -module, then there is a unique i such that ziM = M and for this index 
i we have M = M,-; for all j ^ i, ZjM = 0. 

(5) If each A,- equals the field F, then J? is a vector space over F of dimension 
5^/=i rf and dim f Z(R) = r. 

Proof: In part (1) since multiplication in the direct product of rings is componen- 
twise it is clear that zi times the element (a\, . . . , a r ) of R is the r-tuple with a, in 
position i and zeros elsewhere. Thus /?,• = Z\R, Zi is the identity in R / and z\a = 0 if 
a € Rj for any j ^ i. It is also clear that z \ , . . . , z r are pairwise orthogonal central 
idempotents whose sum is the identity of R. The central idempotents of R are, by 
definition, the idempotents in Z(R) = F\ x^x -x F r , where F t is the center of /?, . 
By Proposition 6, F, is the field Z( A/). If w = (wi , . . . , w r ) is any central idempotent 
then w; € F/ for all r , and since w 2 = w we have wj = in, in the field F ; . Since 0 and 1 
are the only solutions to x 2 — x in a field, the only central idempotents in R are r-tuples 
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whose entries are 0’s and l’s. Thus zi, ... ,z r are primitive central idempotents and 
since every central idempotent is a sum of these, they are the complete set of primitive 
central idempotents of R. This proves (1). 

To prove (2) let N be any left R -module. First note that for any z € Z(R) the 
set [zx | x € N) is an /{-submodule of N. In particular, ZjN is an R -submodule. 
Let ZiX € ZiN and let a € Rj for some j ^ i. By (1) we have that a = azj and 
so azix = ( azj)(ziX ) = aziZjX = 0 because ZiZj = 0. Thus the /{-submodule 
ZiN is acted on trivially by Rj for all j ^ i. For each jc e N we have by (1) that 

x = lx = z\x h z r x, hence N = z\N h z r N. Finally, this sum is direct 

because if, for instance, x € z\ N fl ( Z 2 N h z r AO, then x = z\ x whereas z\ times 

any element of z%N -\ + z r N is zero. This proves (2). 

In part (3) first note that an Ri -module M becomes an /{-module when Rj is defined 
to act trivially on M for all j ^ i. For such a module M the /{-submodules are the same 
as the /{, -submodules. Thus M, is a simple /{-module for each i since it is a simple 
/{/-module. 

Next, let M be a simple /{-module. By (2), M = z\ M ® ■ * - ® z r M. Since M 
has no nontrivial proper /{-submodules, there must be a unique i such that M — z-,M 
and ZjM = 0 for all j ^ i. Thus the simple /{-module M is annihil ated by Rj for all 
j ^ i. This implies that the /{-submodules of M are the s am e as the /{, -submodules 
of M, so M is therefore a simple /{/-module. By Proposition 6, M is isomorphic as an 
/{,-module to M/. Since Rj acts trivially on both M and M, for all j i, it follows 
that the /{/-module isomorphism may be viewed as an /{-module isomorphism as well. 

Suppose i ^ j and suppose (p : M-, — »■ Mj is an /{-module isomorphism. If s- t € M, 
then Si = ZiSi so 

<p(Si) = (fiiZiSi) = Zi<p(Si) = 0, 


since <p(s,) e Mj and z\ acts trivially on Mj. This contradicts the fact that <p is an iso- 
morphism and proves that M \ , . . . , M r are pairwise nonisomorphic simple /{-modules. 

Finally, the left ideal of R described in (3) is acted on trivially by Rj for all j ^ i 
and, by Proposition 6, it is up to isomorphism the unique simple /{,-module. This left 
ideal is therefore a simple /{-module which is isomorphic to A/,-. This proves (3). 

For part (4) we linve already proved that if M is any simple /{-module then there is 
a unique i such that ZiM = M and ZjM = 0 for all j ^ i . Furthermore, we have shown 
that for this index i the simple /{-module M is isomorphic to A/,-. Now let N be any 
/{-module. Then z;N is a module over /{/ which is acted on trivially by Rj for all j ^ i. 
By Wedderbum’s Theorem Zi N is a direct sum of simple /{-modules. Since each of 
these simple summands is acted on trivially by Rj for all j ^ i, each is isomorphic to 
Mi . This proves (4). 

In part (5) if each A,- equals the field F, then as an F-vector space 
R = M ni (F) © M„ 2 (F) © ... © M n ,(F). 

Each matrix ring M nj (F) has dimension nf over F, hence R has dimension nf over 
F. Furthermore, the center of each M„. (F) is 1 -dimensional (since by Proposition 6(2) 
it is isomorphic to F), hence Z(R) has dimension r over F. This completes the proof 
of the proposition. 
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We now apply Wedderbum’s Theorem (and the above ring-theoretic calculations) 
to the group algebra FG. First of all, in order to apply Wedderburn’s Theorem we 
need the characteristic of F not to divide |G|. In fact, since we shall be dealing with 
numerical data in the sections to come it will be convenient to have the characteristic of 
F equal to 0. Secondly, it will simplify matters if we force all the division rings which 
will appear in the Wedderbum decomposition of FG to equal the field F — we shall 
prove that imposing the condition that F be algebraically closed is sufficient to ensure 
this. To simplify notation we shall therefore take F = C for most of the remainder of 
the text The reader can easily check that any algebraically closed field of characteristic 
0 (e.g., the field of all algebraic numbers) can be used throughout in place of C. 

By Corollary 5 the ring CG is semisimple so by Wedderbum’s Theorem 

CG = Ri x R 2 x - • • x R r 

where /?, is the ring of n; x n- t matrices over some division ring A,-. T hink ing of the 
elements of this direct product asnxn block matrices ( n = n,) where the 1 th 

block has entries from A/, the field C appears in this direct product as scalar matrices 
and is contained in the center of CG. Note that each A,- is a vector space over C of 
dimension < n. The next result shows that this implies each A, = C. 

Proposition 9. If A is a division ring that is a finite dimensional vector space over an 
algebraically closed field F and F C Z( A), then A = F. 

Proof: Since F c Z(A), for each or € A the division ring generated by a and F 
is a field. Also, since A is finite dimensional over F the field F(a) is a finite extension 
of F. Because F is algebraically closed it has no nontrivial finite extensions, hence 
F(a) = F for all a e A, i.e., A = F. 

This proposition proves that each Rj in the Wedderburn decomposition of CG is a 
matrix ring over C: 

R, = A MQ- 

Now Proposition 8(5) implies that 

E«? = igi. 

i=t 

The final application in this section is to prove that r (= the number of Wedderbum 
components in CG) equals the number of conjugacy classes of G. To see this, first 
note that Proposition 8(5) asserts that r = dimcZ(CG). We compute this dimension 
in another way. 

Let /Ci, .... be the distinct conjugacy classes of G (recall that these partition 
G). For each conjugacy class /C; of G let 

X, = Y, s e CG. - 

gsfCt 

Note that X, and Xj have no c omm on terms for » ^ j, hence they are linearly indepen- 
dent elements of CG. Furthermore, since conjugation by a group element permutes the 
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elements of each class, h l X,h — Xy, i.e., Xy commutes with all group elements. This 
proves that Xy € Z(CG). 

We show the Xy *s form a basis of Z(CG), which will prove s = dim cZ(CG) = r . 
Since the Xy’s are linearly independent it remains to show they span Z(CG). Let 
X = YlgeG a gS b© an arbitrary element of Z(CG). Since h~ l Xh = X, 

^a g h~ l gh = ^2a g g. 

geG geG 

Since the elements of G form a basis of CG the coefficients of g in the above two sums 
are equal: 

a hgh~ l = a g- 

Since h was arbitrary, every element in the same conjugacy class of a fixed group 
element g has the same coefficient in X, hence X can be written as a linear combination 
of the Xy’s. 

We summarize these results in the following theorem. 

Theorem 10. Let G be a finite group. 

(1) CG £ M m (C) x M„ 2 (C) x ■ ■ ■ x M nr (C). 

(2) CG has exactly r distinct isomorphism types of irreducible modules and these 
have complex dimensions n\ , n 2 , . . . , n r (and so G has exactly r inequivalent 
irreducible complex representations of the corresponding degrees). 

(3) ILi»? = |G|. 

(4) r equals the number of conjugacy classes in G. 

Corollary 11. 

(1) Let A be a finite abelian group. Every irreducible complex representation 
of A is 1 -dimensional (i.e., is a homomorphism from A into C x ) and A has 
| A | inequivalent irreducible complex representations. Furthermore, every finite 
dimensional complex matrix representation of A is equivalent to a representation 
into a group of diagonal matrices. 

(2) The number of inequivalent (irreducible) degree 1 complex representations of 
any finite group G equals |G/G'|. 

Proof: If A is abelian, C A is a commutative ring. Since a k x k matrix ring is not 
commutative whenever k > 1 we must have each ny =1. Thus r — \A | (= the number 
of conjugacy classes of A ). Since every C A -module is a direct sum of irreducible 
submodules, there is a basis such that the matrices are diagonal with respect to this 
basis. This establishes the first part of the corollary. 

For a general group G, every degree 1 representation, <p, is a homomorphism of G 
into C x . Thus <p factors through G/G'. Conversely, every degree 1 representation of 
G/G' gives, by composition with the natural projection G — ► G/G', a degree 1 repre- 
sentation of G. The degree 1 representations of G are therefore precisely the irreducible 
representations of the abelian group G/G' . Part (2) is now immediate from (1). 


Sec. 18.2 Wedderburn's Theorem and Some Consequences 


861 



Examples 

( 1 ) The irreducible complex representations of a finite abelian group A (i.e., the homo- 
morphisms from A into <C X ) can be explicitly described as follows: decompose A into 
a direct product of cyclic groups 

A = Ci x ■ ■ ■ x C„ 

where |C,| = ( jc,- ) | — d[. Map each x, to a (not necessarily primitive) root of 
1 and extend this to all powers of jc,- to give a homomorphism. Since there are d/ 
choices for the image of each x t , the number of distinct homomorphisms of A into 
C x = GL i(C) defined by this process equals |A|. By Corollary 11, these are all the 
irreducible representations of A. Note that it is necessary that the field contain the 
appropriate roots of 1 in order to realize these representations. An exercise below 
explores the irreducible representations of cyclic groups over Q. 

(2) Let G = S 3 . By Theorem 10 the number of irreducible complex representations of 
G is three (= the number of conjugacy classes of S3). Since the sum of the squares 
of the degrees is 6, the degrees must be 1 , 1 and 2. The two degree 1 representations 
are immediately evident: the trivial representation and the representation of £3 into 
{±1} given by mapping a permutation to its sign (i.e., a i-> + 1 if <7 is an even permu- 
tation and <7 1— v— 1 if <7 is an odd permutation). The degree 2 representation can be 
found by decomposing the permutation representation on 3 basis vectors (described 
in Section 1) into irreducibles as follows: let S 3 act on the basis vectors ei, ei, £3 of a 
vector space V by permuting their indices. The vector t = ei + e% + £3 is a nonzero 
fixed vector, so t spans a 1-dimensional G-invariant subspace (which is a copy of the 
trivial representation). By Maschke’s Theorem there is a 2-dimensional G-invariant 
complement, 7. Note that the permutation representation is not a sum of degree 1 
representations: otherwise it could be represented by diagonal matrices and the per- 
mutations would commute in their action — this is impossible since the representation 
is faithful and G is non-abelian. Thus 7 cannot be decomposed further, so 7 affords the 
irreducible 2-dimensional representation. Indeed, 7 is the “augmentation” submodule 
described in Section 1: 

/ = {u> e V | V) = ct\e\ + aiei + 01363 with a\ + 0*2 + «3 = 0}. 

Clearly e\ —e^ and — 63 are independent vectors in 7, hence they form a basis for this 
2-dimensional space. With respect to this basis of 7 we obtain a matrix representation 
of £3 and, for example, this matrix representation on twu elements uf S3 is 

(12)k^"J and (12 3)^^ “}). 

(3) We decompose the regular representation over C of an arbitrary finite group. Recall 
that this is the representation afforded by the left CG-module CG itself. By Theorem 
10, CG is first of all a direct product of two-sided ideals: 

CG = M nj (C) x A7„ 2 (C) x ■ ■ ■ x A7„ r (C). 

Now by Proposition 6(4) each M, u (C) decomposes further as a direct sum of n; 
isomorphic simple left ideals. These left ideals give a complete set of isomorphism 
classes of irreducible CG-modules. Thus the regular representation (over C) of G 
decomposes as the direct sum of all irreducible representations of G, each appearing 
with multiplicity equal to the degree of that irreducible representation. 

We record one additional property of CG which we shall prove in Section 19.2. 
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Theorem 12. The degree of each complex irreducible representation of a finite group 
G divides the order of G, i.e., in the notation of Theorem 10, each n, divides |G| for 
i = 1.2 r. 

In the next section we shall describe the primitive central idempo tents of CG in 
terms of the group elements. 


EXERCISES 

Let G be a finite group and let R be a ring with 1 . 

1. Prove that conditions (1) and (2) of Wedderbum’s Theorem are equivalent. 

2. Prove that (3) implies (2) in Wedderbum’s Theorem. [Let Q be a submodule of an R- 
module N. Use Zorn’s Lemma to show there is a submodule M maximal with respect to 
Q H M = 0. If Q M = N, then (2) holds; otherwise let M\ be the complete preimage 
in N of some simple module in N/M not contained in (Q +- M)/M, and argue that M\ 
contradicts the maximality of A/.] 

3. Prove that (4) implies (3) in Wedderbum’s Theorem. [Let N be a nonzero R -module. First 
show N contains simple submodules by considering a cyclic submodule. Then use Zorn’s 
Lemma applied to the set of direct sums of simple submodules (appropriately ordered) to 
show that N con tains a maximal completely reducible submodule M. If M ± N let Mi 
be the complete preimage in N of a simple module in N/M and contradict the maximality 
of M.) 

4. Prove that (5) implies (4) in Wedderbum’s Theorem. [Use the methods in the proofs of 
Propositions 6 and 8 to decompose each Rj as a left R -module.] 

The next six exercises establish some general results about rings and modules that imply the 
remaining implication of Wedderbum’s Theorem: (2) implies (5). In these exercises assume 
R satisfies (2): every ^-module is injective. 

5. Show that R has the descending chain condition (D.C.G) on left ideals. Deduce that R is 
a finite direct sum of left ideals. [If not, then show that as a left ^-module R is a direct 
sum of an infini te number of nonzero submodules. Derive a contradiction by writing the 
element 1 in this direct sum.] 

6. Show that R = Ri x R 2 x • • • x R r where Rj is a 2-sided ideal and a simple ring (i.e., 
has no proper, nonzero 2-sided ideals). Show each Rj has an identity and satisfies D.C.C. 
on left ideals. [Use the preceding exercise to show R has a minimal 2-sided ideal Ri . As 
a left /^-module R = Ri © R' for some left ideal R'. Show R' is a right ideal and proceed 
inductively using D.C.C. ] 

7. Let S be a simple ring with 1 satisfying D.C.C. on left ideals and let L be a mi nim al 
left ideal in 5. Show that S ~ IP as left S-modules, where LP = L 0 ■ ■ ■ © L with n 
factors. [Argue by simplicity that LS = S so 1 = lisi 4- • ■ ■ + l n s n for some /,• € L 

and si e S with n minimal. Show that the map (jq x „ ) i-* jciq + - - + x„s n is a 

suijective homomorphism of left S- modules; use the minimality of L and n to show it is 
an injection.] 

8. Let A be any ring with 1 , let L be any left A-module and let LP be the direct gum of n 
copies of L with itself. 

(a) Prove the ring isomoiphism Hom^(L' 1 . LP) = M n (D), where D = Hom^(L, L) 
(multiplication in the ring Hom>i(X, X) is function composition, cf. Proposition 2(4) 
Ln Section 10.2). 
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(b) Deduce that if L is a simple A -module, then Horn ,4 (L M , L n ) is isomorphic to a matrix 
ring over a division ring. [Use Schur’s Lemma and (a).] 

(c) Prove the ring isomorphism HorrM (A, A) = A opp , where A opp is the opposite ring to 
A (the elements and addition are the same as in A but the value of the product jc ■ y 
in A opp is yx, computed in A), cf. the end of Section 17.4. [Any homomorphism is 
determined by its value on 1 .] 

9. Prove that if S is a simple ring with 1 satisfying D.C.C. on left ideals then S ~ M„ (A) for 
some division ring A. (This result together with Exercise 6 completes the existence part of 
the proof that (2) implies (5) in Wedderbum ’s Theorem). [Use Exercises 7 and 8 to show 
S op P ~ Homs(L", L n ) ~ M n (D) for some division ring D. Then show S = M„( A), 
where A is the division ring D opp .] 

10 . Prove that A and n in the isomorphism S = M„ (A) of the previous exercise are uniquely 
determined by S (proving the uniqueness statement in Wedderbum’s Theorem), as follows. 
Suppose S = M„(A) = M n >(A') as rings, where A and A' are division rings. 

(a) Prove that A = Hom^(L, L) where L is a minima l left ideal in S. Deduce that 
A = A'. [Use Proposition 6(4).] 

(b) Prove that a finitely generated (left) module over a division ring A has a “basis” (a 
linearly independent generating set), and that any two bases have the same cardinality. 
Deduce that n = n' . [ Mimi c the proof of Corollary 4(2) of Section 11.1.] 

11 . Prove that if R is a ring with 1 such that every R-raodule is free then R is a division ring. 

12 . Let F he a field, let /(jc) e F[x\ and let R = F[jt]/(/(jt)). Find necessary and sufficient 
conditions on the factorization of /( jc) in F[x\ so that R is a semisimple ring. When R is 
semisimple, describe its Wedderbum decomposition. [See Proposition 16 in Section 9-5.] 

13 . Let G be the cyclic group of order n and let R = QG. Describe the Wedderbum decom- 
position of R and find the number and the degrees of the irreducible representations of 
G over Q. In particular, show that if n = p is a prime then G has exactly one nontrivial 
irreducible representation over Q and this representation has degree p — 1. [Recall from 
the first example in Section 1 that QG = Q[x]/(;t” - 1). Use Proposition 16 in Section 
9.5 and results from Section 13.6.] 

14 . Let p be a prime and let F = F p be the field of order p. Let G be the cyclic group of order 
3 and let R = FG. For each of p == 2 and p = 7 describe the Wedderbum decomposition 
of R and find the number and the degrees of the irreducible representations of G over F. 

15 . Prove that if P is a p-group for some prime p, then P has a faithful irreducible complex 
representation if and only if Z(P) is cyclic. [Use Exercise 18 in Section 1, Theorem 6.1 (2) 
and Example 3.] 

16 . Prove that if V is an irreducible FG-module and F is an algebraically closed field then 
HompG(V, V) is isomorphic to F (as a ring). 

17 . Let F be a field, let R = M n (F) and let M be the unique irreducible R -module. Prove 
that Hom/?(M, M) is isomorphic to F (as a ring). 

18 . Find all 2-sided ideals of M„ (Z). 

18.3 CHARACTER THEORY AND THE ORTHOGONALITY RELATIONS 

In general, for groups of large order the representations are difficult to compute and 
unwieldy if not impossible to write down. For example, a matrix representation of 
degree 100 involves matrices with 10,000 entries, and a number of 100 x 100 matrices 
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may be required to describe the representation, even on a set of generators for the 
group. There are, however, some striking examples where large degree representations 
have been computed and used effectively. One instance of this is a construction of 
the simple group J\ by Z. Janko in 1965 (the existence problem for simple groups 
was discussed at the end of Section 6.2). Janko was investigating certain properties 
of simple groups and he found that if any simple group possessed these properties, 
then it would necessarily have order 175,560 and would be generated by two elements. 
Furthermore, he proved that a hypothetical simple group with these properties must 
have a 7-dimensional representation over the field Fn with two generators mapping to 
the two matrices 


/° 

1 

0 

0 

0 

0 

0 \ 


/' 

-3 

2 

-1 

-1 

-3 

-1 

-3\ 

0 

0 

1 

0 

0 

0 
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- 

-2 

1 

1 

3 

1 

3 

3 

0 

0 

0 

1 

0 

0 

0 


- 

-1 

-1 

-3 

-1 

-3 

-3 

2 

0 

0 

0 

0 

1 

0 

0 

and 

- 

-1 

-3 

-1 

-3 

-3 

2 

-1 

0 

0 

0 

0 

0 

1 

0 


- 

-3 

-1 

-3 

-3 

2 

-1 

-1 

0 

0 

0 

0 

0 

0 

1 



1 

3 

3 

-2 

1 

1 

3 

\1 

0 

0 

0 

0 

0 

0 / 


1 

3 

3 

-2 

1 

1 

3 

1/ 


(note that for any simple group S, every representation of S into GL n (F) which does 
not map all group elements to the identity matrix is a faithful representation, so S is 
isomorphic to its image in GL„(F)). In particular, Janko’s calculations showed that 
the simple group satisfying his properties was unique, if it existed. M. Ward was able 
to show that these two matrices do generate a subgroup of GZ/ 7 (F U ) of order 175,560 
and it follows that there does exist a simple group satisfying Janko’s properties. 

In a similar vein, S. Norton, R. Parker and J. Thackray constructed the simple group 
J 4 of order 86,775,571,046,077,562,880 using a 1 12-dimensional representation over 
F 2 . This group was shown to be generated by two elements, and explicit matrices in 
GL|i 2 (F 2 ) for these two generators were computed in the course of their analysis. 

In 1981, R. Griess constructed the largest of the sporadic groups, the so called 
Monster , of order 

2^ , 320 59 7 6 j j 2 13 3 . 17 . 19 . 23 . 29 • 31 - 41 - 47 - 59 - 71 . 

His proof involves calculations of automorphisms of an algebra over C of dimension 
196,884 and leads to a construction of the Monster by means of a representation of this 
degree. 

By analogy, in general it is difficult to write out the explicit permutations associated 
to a permutation representation ip : G S„ for large degrees n. There are, however, 
numerical invariants such as the signs and the cycle types of the permutations n(g) and 
these numerical invariants might be easier to compute than the permutations themselves 
(i.e., it may be possible to determine the cycle types of elements without actually having 
to write out the permutations themselves, as in the computation of Galois groups over 
Q in Section 14.8). These invariants alone may provide enough information in a given 
situation to carry out some analysis, such as prove that a given group is not simple (as 
illustrated in Section 6.2). Furthermore, the invariants just mentioned do not depend on 
the labelling of the set {1, 2, . . . , n) (i.e., they are independent of a “change of basis” 
in S„) and they are the same for elements that are conjugate in G. 
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In this section we show how to attach numerical invariants to linear representations. 
These invariants depend only on the equivalence class (isomorphism type) of the rep- 
resentation. In other words, for each representation <p : G — >• GL„(F ) we shall attach 
an element of F to each matrix <p(g) and we shall see that this number can, in many 
instances, be computed without knowing the matrix <p(g). Moreover, we shall see that 
these invariants are independent of the similarity class of (p (i.e., are the same for a fixed 
g e G if the representation <p is replaced by an equivalent representation) and that they, 
in some sense, characterize the similarity classes of representations of G. 

Throughout this section G is a finite group and, for the moment, F is an arbitrary 
field All representations considered are assumed to be finite dimensional. 

Definition. 

(1) A class function is any function from G into F which is constant on the con- 
jugacy classes of G, i.e., / : G — >- F such that fig~ x xg) = f(x ) for all 
g>x e G. 

(2) If (p is a representation of G afforded by the FG -module V, the character of (p 
is the function 


X'G^F defined by xig) = tr^(g), 

where tr^(g) is the trace of the matrix of (pig) with respect to some basis of 
V (i.e., the sum of the diagonal entries of that matrix). The character is called 
irreducible or reducible according to whether the representation is irreducible 
or reducible, respectively. The degree of a character is the degree of any repre- 
sentation affording it 

In the notation of the second part of this definition we shall also refer to y as the 
character afforded by the FG-module V . In general, a character is not a homomorphism 
from a group into either the additive or multiplicative group of the field. 

Examples 

(1) The character of the trivial representation is the function x(g) = 1 for all g e G. This 
character is called the principal character of G. 

(2) For degree 1 representations, the character and the representation are usually identified 
(by identifying a 1 x 1 matrix with its entry). Thus for abelian groups, irreducible 
complex representations and their characters are the same (cf. Corollary 11). 

(3) Let n : G -> S n be a permutation representation and let <p be the resulting linear 
representation on the basis e \ , . . . , e n of the vector space V : 

<p(g)(ei) = euig)(i) 

(cf. Example 4 of Section 1). With respect to this basis the matrix of cp(g) has a 1 
in the diagonal entry i,i if 11(g) fixes i\ otherwise, the matrix of <p{g) has a zero in 
position i, i. Thus if n is the character of <p then 

n(g) = the number of fixed points of g on {1, 2, ... , n). 

In particular, if n is the permutation representation obtained from left multiplication 
on the set of left cosets of some subgroup H of G then the resulting character is called 
the permutation character of G on H. 
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(4) The special case of Example 3 when n is the regular permutation representation of 
G is worth recording: if <p is the regular representation of G (afforded by the module 
FG) and p is its character: 


0 if* ± 
\G\ iig = l 


The character of the regular representation of G is called the regular character of G. 
Note that this provides specific examples where a character takes on the value 0 and 
is not a group homomorphism from G into either F or F* . 

(5) Let (p : Ihn —*■ GL 2 CK) be the explicit matrix representation described in Example 6 
in the second set of examples of Section 1. If x is the character of <p then, by taking 
traces of the given 2x2 matrices one sees that xi r ) = 2cos(2rr/n) and x( s ) = 0. 
Since <p takes the identity of F>i n to the 2 x 2 identity matrix, x(l) = 2. 

(6) Let <p : -»• GZ^(C) be the explicit matrix representation described in Example 7 

in the second set of examples of Section 1. If x is the character of <p then, by taking 
traces of the given 2x2 matrices, x (0 = 0 and x O’) = 0. Since the element — 1 e Qg 
maps to minus the 2 x 2 identity matrix, x (— 1) — —2. Since <p takes the identity of 
08 to the 2 x 2 identity matrix, x (1) = 2. 

(7) Let <p : Q% GZ, 4 (R) be the matrix representation described in Example 8 in the 
second set of examples of Section 1. If x is the character of <p then, by inspection of 
the matrices exhibited, x 0) = X C/) = 0. Since <p takes the identity of £>8 to the 4 x 4 
identity matrix, x(l) = 4. 


For n x n matrices A and B, direct computation shows that tr AB = tr BA. If A is 
invertible, this implies that 

tr A~ l BA — tr B. 

Thus the character of a representation is independent of the choice of basis of the vector 
space affording it, i.e., 

equivalent representations have the same character, : (18-1) 

Let <p be a representation of G of degree n with character /. Since tp(g~ x xg) is 
Vis)* V(*)tf>(£) for all g, x G G, taking traces shows that 

the character of a representation is a class function. (1 8.2) 

Since the trace of the n x n identity matrix Is n and <p takes the identity of G to the 
identity linear transformation (or matrix), 

X (1) is the degree of tp. (18.3) 

If V is an FG -module whose corresponding representation has character x, then 
each element of the group ring FG acts as a linear transformation from V to V. Thus 
each a gS e FG has a trace when it is considered as a linear transformation from 
V to V. The trace of g € G acting on V is, by definition, x(s)’ Since the trace of 
any linear combination of matrices is the linear combination of the traces, the trace of 
YlgeG a sS acting on V is Yl g ec a gX(S)~ N°te that this trace function on FG is the 
unique extension of the character x of G to an F-linear transformation from FG to F. 
In this way we shall consider characters of G as also being defined on the group ring 
FG. 
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Notice in Example 3 above that if the field F has characteristic p > 0, the values of 
the character mod p might be zero even though the number of fixed points is nonzero. 
In order to circumvent such anomalies and to use the consequences of Wedderbum’s 
Theorem obtained when F is algebraically closed we again specialize the field to be the 
complex numbers (or any algebraically closed field of characteristic 0). By the results 
of the previous section 

CG = M„, (€) x M„ 2 (€) x ■ ■ ■ x M„ r (€). (18.4) 

For the remainder of this section fix the following notation: 

M \ , M 2 , . . . , M r are the inequivalent irreducible CG-modules, 

Xi is the character afforded by Mi, 1 < i < r. 

Thus r is the number of conjugacy classes of G and we may relabel Mi ..... M r if 
necessary so that the degree of X; is Wj for all i (which is also the dimension of M,- over 
€). 

Now every (finite dimensional) CG -module M is isomorphic (equivalent) to a direct 
sum of irreducible modules: 


M = a\M) ©< 32 M 2 © ■ ■ ■ ©a r M r , (18.6) 

where a; is a nonnegative integer indicating the multiplicity of the irreducible module 
Mi in this direct sum decomposition, i.e., 

fl( times 

a\ M,- = M,- © - - - © Mj . 


Note that if the representation <p is afforded by the module M and M = Mi © M 2 , then 
we may choose a basis of M consisting of a basis of M\ together with a basis of M 2 . 
The matrix representation with respect to this basis is of the form 


(p{g) = 


( <P\(g) 

V 0 


0 ) 

<Pi(g) J 


where <pi is the representation afforded by Mi, i = 1,2. One sees immediately that if 
ip is the character of <p and xp t is the character of <p t , then \p(g) = tp\ (g) + 1J/2 (g), i.e., 
xp — ip y -f- xfa. By induction we obtain: 


the character of a representation is the sum of the characters 
of the constituents appearing in a direct sum decomposition. 


08.7) 


If \p is the character afforded by the module M in (6) above, this gives 

& = 01 X 1 +02X2H 1-flrXr- 08.8) 

Thus every (complex) character is a nonnegative integral sum of irreducible (complex) 
characters. Conversely, by taking direct sums of modules one sees that every such sum 
of characters is the character of some complex representation of G. 

We next prove th a t the correspondence between characters and equivalence classes 
of complex representations is bijective. Let Zi , Z 2 , ■ ■ ■ , z r be the primitive central idem- 
potents of CG described in the preceding section. Since these are orthogonal (or equiv- 
alently, since they are the r-tuples in the decomposition of CG into a direct product of r 
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subrings which have a 1 in one position and zeros elsewhere), zi , . - . , z, are C- linearly 
independent elements of C G. As above, each irreducible character Xi is a function on 
CG. By Proposition 8(3) we have 

(a) if j i then Zj M\ — 0, i.e., Zj acts as the zero matrix on Mj, hence Xj(zt) = 0. 
and 

(b) H acts as the identity on M-„ hence x, (z, ) = rij. 

Thus xi , - » Xj are multiples of the dual basis to the independent set z\ , - - ■ , z r , hence 

are linearly independent functions. Now if the CG -module M described in (6) above 
can be decomposed in a different fashion into irredudbles, say, 

M = b\ My © b 2 M 2 © - ■ ■ © b r M r , 

then we would obtain a relation 


a\X\ +«2X2H +a r Xr =£ 1 X 1 + b 2 X2 + ■ ■ ■ + b r Xr- 

By linear independence of the irreducible characters, b; = a-, for all j € [1, . . . , r}. 
Thus, in any decomposition of M into a direct sum of irredudbles, the multiplicity of 
the irreducible M t is the same, 1 < i < r. In particular, 

two representations are equivalent if and only if they have the same character. 

(18.9) 

This uniqueness can be seen in an alternate way. First, use Proposition 8(2) to 
decompose an arbitrary finite dimensional CG -module M uniquely as 


M = z\M © z 2 M © - - ■ © z r M . 


By part (4) of the same proposition, z, M is a direct sum of simple modules, each of 
which is isomorphic to M\ . The multiplicity of Mj in a direct sum decomposition of z M 

dim zs M 

is, by counting dimensions, equal to . This proves that the multiplicity of Mj 

dim Mj 

in any direct sum decomposition of M into simple submodules is uniquely determined. 

Note that, as with decompositions of F[jt] -modules into cyclic submodules, a 
CG-module may have many direct sum decompositions into irredudbles — only 
the multiplidties are unique (see also the exercises). More precisely, comparing 
with the Jordan canonical form of a single linear transformation, the direct s umm and 
a, M j = Mj © ■ ■ ■ © Mj (a, times) which equals the submodule z, M is the analogue of 
the generalized eigenspace corresponding to a single eigenvalue. This submodule of 
M is unique (as is a generalized eigenspace) and is called the xf' isotypic component 
of M. Within the x^ isotypic component, the summands M- t are analogous to the 1- 
dimensional eigenspaces and, just as with the eigenspace of an endomorphism there is 
no unique basis for the eigenspace. If G = ( g ) is a finite cyclic group, the isotypic 
components of G are the same as the generalized eigenspaces of g. 

Observe that the vector space of all (complex valued) class functions on G has a 
basis consisting of the functions which are 1 on a given class and zero on all other 
classes. There are r of these, where r is the number of conjugacy classes of G, so the 
dimension of the complex vector space of class functions is r. Since the number of 
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(complex) irreducible characters of G equals the number of conjugacy classes and these 
are linearly independent class functions, we see that 

the irreducible characters are a basis for the space of all complex class functions. 

(18.10) 

The next step in the theory of characters is to put an Hennitian inner product 
structure on the space of class functions and prove that the irreducible characters form 
an orthonormal basis with respect to this inner product. For class functions 9 and \}/ 
define 

( 0 , t) = t^t ^&ig)tig) 

' ' geG 

(where the bar denotes complex conjugation). One easily checks that ( , ) is Hermitian: 
for a, p € C 

(a) (ae { + P9 2 , = a(6u if/) + p{9 2 , r]/) t 

(b) ( 6 , a\fi + Pfo) = a(9 t ifi) + P(9, yjt 2 ), and 

(c) (0,*) = GM). 

Our principal aim is to show that the irreducible characters form an orthonormal 
basis for the space of complex class functions with respect to this Hennitian form (we 
already know that they are a basis). This fact will follow from the orthogonality of 
the primitive central idempotents, once we have explicitly determined these in the next 
proposition. 

Proposition 13. Let zu ■ ■ . , z r be the orthogonal primitive central idempotents in C G 
labelled in such a way that z,- acts as the identity on the irreducible CG -module A/, , and 
let Xi be the character afforded by Mi . Then 

*«■ = 

|G| “ 


Proof: Let z = z, and write 

z = J2 a *S’ 

geC 

Recall from Example 4 in this section that if p is the regular character of G then 

0 iSg^l 


Pig) = . 

[ |G| if g — 1 

and recall from the last example in Section 2 that 


(18.11) 


7=1 

To find the coefficient a g , apply p to zg~ l and use linearity of p together with equation 
(11) to obtain 

P(z£ -1 ) = <Xg\G\. 
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Computing p(zg ') using (12) then gives 


£*i(l);(;(** _1 ) = “«IGI- (18.13) 

j= I 


Let <pj be the irreducible representation afforded by Mj, l < j < r. Since we may 
consider <pj as an algebra homomorphism from CG into End(M ; ) , we obtain <pj ( zg ~ 1 ) = 
<Pj(z)<Pj(g~ l ). Also, we have already observed that <Pj(z) is 0 if j / / and y>,(z) is the 
identity endomorphism on Af, . Thus 


<pM& ') 


0 if j ± i 

<Pi(g~ l ) if j = i. 


This proves Xj (z% ') = X/(g 1 )£/./. where S s j is zero if i =/ j and is 1 if i = j (called 

the Kronecker delta). Substituting this into equation (13) gives a g = — X/(1)X«(£ -1 )- 

|C| 

This is the coefficient of g in the statement of the proposition, completing the proof. 


The orthonormality of the irreducible characters will follow directly from the or- 
thogonality of the central primitive idempotents via the following calculation: 


Zi&ij — ZjZj 


Xid) XyCD 
\G\ \G\ 


X] Xt(g l )Xj(h l )gh 
g.heG 


Xi-d) X/d) 
\G\ \G\ 



Xi(*y~ l )Xj(x l ) 


y 


(to get the latter sum from the former substitute y for gh and x for h). Since the 
elements of G are a basis of CG we may equate coefficients with those of Zi found in 
Proposition 13 to get (the coefficient of g) 


C X»0) , _K 

Xi(^ ) 


\G\ 


Xi(l)Xj(l) 

|G| 2 


J]x/ds x )Xj(* *)• 

reC 


Simplifying (and replacing g by g ') gives 
Sij 


= iTm 5Z ttd£)Xj(* l ) for all g e G. 


Xjd) \G\ 


(18.14) 


x eC 


Taking g = 1 in (14) gives 


s u = ')- 

xeG 


(18.15) 


The sum on the right side would be precisely the inner product (xt , Xj) if Xj( x ') were 
equal to Xj(jt); this is the content of the next proposition. 
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Proposition 14. If ip- is any character of G then if(x) is a sum of roots of 1 in C and 
^(x _1 ) = ir{x) for all x e G. 


Proof: Let (p be a representation whose character is rfr, fix an element x e G and 
let |.x| = k. Since the minimal polynomial of <p(x) divides X k — 1 (hence has distinct 
roots), there is a basis of the underlying vector space such that the matrix of <p(x) with 
respect to this basis is a diagonal matrix with £* roots of 1 on the diagonal. Since ip~(x) 
is the sum of the diagonal entries (and does not depend on the choice of basis), \Js(x) 
is a sum of roots of 1. Moreover, if e is a root of 1, e 1 — e. Thus the inverse of a 
diagonal matrix with roots of 1 on the diagonal is the diagonal matrix with the complex 
conjugates of those roots of 1 on the diagonal. Since the complex conjugate of a sum 
is the sum of the complex conjugates, \}r(x~ l ) = tr <p(x -1 ) = tr^(jc) = ijf(x). 

Keep in mind that in the proof of Proposition 14 we first fixed a group element * 
and then chose a basis of the representation space so that (p(x) was a diagonal matrix. 
It is always possible to diagonalize a single element but it is possible to simultaneously 
diagonalize all <p(x)’s if and only if <p is similar to a sum of degree 1 representations. 
Combining the above proposition with equation (15) proves: 

Theorem 15. (The First Orthogonality Relation for Group Characters ) Let G be a 
finite group and let xi , - - - , Xr be the irreducible characters of G over C. Then with 
respect to the inner product ( , ) above we have 

(Xi > Xj) = &ij 

and the irreducible characters are an orthononnal basis for the space of class functions. 
In particular, if 6 is any class function then 

r 

6 = 5>, Xi)*. 
i=l 

Proof: We have just established that the irreducible characters form an orthonormal 
basis for the space of class functions. If 9 is any class function, write 9 = 5Z/=i a > Xj > 
for some a t e C. It follows from linearity of the Hermitian product that a, = (6, 
as stated. 

We list without proof the Second Orthogonality Relation; we shall not require it 
for the applications in this book. 

Theorem 16. (The Second Orthogonality Relation for Group Characters ) Under the 
notation above, for any x f yeG 

|C c (jc)| if * and y are conjugate in G 
0 otherwise. 

Definition. For 6 any class function on G the norm of 6 is (6 , 9) 1 ^ and will be denoted 
by II $ !!- 


Lx.-Wx.-W 
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When a class function is written in terms of the irreducible characters, 0 — ^a.x;, 
its norm is easily calculated as 1 1 6 \ \ = (£ a 2 ) 1/2 . It follows that 

a character has norm 1 if and only if it is irreducible. 

Finally, observe that computations of the inner product of characters 0 and \Js may be 
simplified as follows. If IC \ , . . . , IC r are the conjugacy classes of G with sizes d\, . . ■ ,d r 
and representatives g\, .... g r respectively, then the value 0(gi)\jf(gi) appears d, times 
in the sum for (0. VO, once for each element of IQ . Collecting these terms gives 

(6. 'i’) = TTT. y 

a sum only over representatives of the conjugacy classes of G. In particular, the norm 
of 6 is given by 

ii0ii 2 =<0,e) = -i-y>i0<g ( )i 2 . 


Examples 

(1) Let G ~ S-$ and let 7r be the permutation character of degree 3 described in the 
examples at the beginning of this section. Recall that 7r (a) equals the number of 
elements in {1, 2, 3} fixed by a. The conjugacy classes of £3 are represented by 1, 
(1 2) and (1 2 3) of sizes 1, 3 and 2 respectively, and n-(l) = 3, ;r((l 2)) = 1, 
tt ((1 2 3)) = 0. Hence 

II 7T II 2 = 1 [1 7r(l) 2 + 3 7T((1 2)) 2 + 2 7T((1 2 3)) 2 ] 

= 2 (9 + 3 + 0) = 2 

D 

This implies that n is a sum of two distinct irreducible characters, each appearing with 
multiplicity 1 . Let xi be the principal character of S 3 , so that xi (cr) = xi (cr) = 1 for 
all a e S 3 . Then 

xi) = \ [1 ^(DiriO) + 3 tt((1 2))xi((l 2)) + 2 tt((1 2 3))xi((l 2 3)) ] 

= ^ (3 + 3 + 0) = 1 
6 

so the principal character appears as a constituent of n with multiplicity 1 . This proves 
71 = Xi + X2 for some irreducible character X 2 of S 3 of degree 2 (and agrees with our 
earlier decomposition of this representation). This also shows that the value of X2 0° 
a e S 3 is the number of fixed points of o minus 1. 

(2) Let G = S 4 and let n be the natural permutation character of degree 4 (so again n(o) 
is the number of fixed points of a). The conjugacy classes of S4 are represented by 1, 
(1 2), (1 2 3), (1 2 3 4) and (1 2) (3 4) of sizes 1, 6, 8, 6 and 3 respectively. Again we 
compute: 

II 71 II 2 = ^ [1 7 i ( l) 2 + 6 n({\ 2)) 2 + 8 7T((1 2 3)) 2 + 6 7T((1 2 3 4)) 2 
+ 3 7r((l 2)(3 4)) 2 ] 

= ^-(16 + 24 + 8 + 0 + 0) = 2- 
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so n has two distinct irreducible constituents. If xt is the principal character of S 4 , 
then 


fr,Xl) = ~ [ljr(l) + 6jr((l 2)) + 8 rr (( 1 2 3)) 

+ 6 tt((1 2 3 4)) + 3 tt((1 2)(3 4))] 

= ^(4 + 12 + 8 + 0 + 0 ) = 1 . 

T hi s proves that the degree 4 permutation character is the sum of the pri ncipal character 
and an irreducible character of degree 3. 

(3) Let G — Dg, where 


Dg = ( r, s | s 2 = r 4 = 1 , rs — sr 1 ). 

The conjugacy classes of Dg are represented by 1, s, r, r 2 and sr and have sizes 1, 2, 
2, l and 2, respectively. Let cp be the degree 2 matrix representation of Dg obtained 
as in Example 6 in Section 1 from embedding a square in R 2 : 

* (,)= (? o)- v< " > = (? _°) ■*<">=(£ -,)• 

Let \J/ be the character of this representation (where we consider the real matrices as 
a subset of the complex matrices). Again, since yj/ is real valued one computes 

I! if II 2 = l [ W) 2 + 2tK*) 2 + 2 Mr? + mr 2 ) 2 + 2 y//(sr) 2 ] 

= ^(4 + 0 + 0 + 4 + 0) = 1. 

O 

This proves the representation <p is inedudble (even if we allow s imilar ity transfor- 
mations by complex matrices). 


We have seen that the sum of two characters is again a character. Specifically, if 
yj/ 1 and yj /2 are characters of representations (f\ and <p 2 , then i/\ + if 2 is the character of 
<Pl+(p2- 

Proposition 17. If yj/y and i /2 are characters, then so is their product i/yyj/ 2 - 


Proof: Let V\ and V 2 be CG -modules affording characters yj/y and i /2 and define 
W — V\ <8>c V- 2 - Since each g € G acts as a linear transformation on Vy and V 2 , the 
action of g on simple tensors by g(vy <gi vf) = (gvi) <gi (^ 1 ^ 2 ) extends by linearity to a 
well defined linear transformation on W by Proposition 17 in Section 11.2. One easily 
checks that this action also makes W into a CG-module. By Exercise 38 in Section 
11.2 the character afforded by W is i/yi/i. 

The next chapter will contain further explicit character computations as well as 
some applications of group characters to proving theorems about certain classes of 
groups. 
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Some Remarks on Fourier Analysis and Group Characters 

This brief discussion is intended to indicate some connections of the results above with 
other areas of mathematics. 

The theory of group representations described to this point is a special branch of 
an area of mathematics called Harmonic Analysis. Readers may already be familiar 
with the basic theory of Fourier series which also falls into this realm. We make some 
observations which show how representation theory for finite groups corresponds to 
“Fourier series” for some infinite groups (in particular, to Fourier series on the circle). 
To be mathematically precise one needs the Lebesgue integral to ensure completeness of 
certain (Hilbert) spaces but readers may get the flavor of things by replacing “Lebesgue” 
by “Riemann.” 

Let G be the multiplicative group of points on the unit circle in C: 

G = [zeC | k| = l). 

We shall usually view G as the interval [0, 2n] in R with the two end points identified, 
i.e., as the additive group R/2 nZ (the isomorphism is: the real number x corresponds 
to the complex number e' x ). Note that G has a translation invariant measure, namely 
the Lebesgue measure, and the measure of the circle is 27T. For finite groups, the 
counting measure is the translation invariant measure (so the measure of a subset H is 
the number of elements in that subset, | H|) and integrals on a finite group with respect 
to this counting measure are just finite sums. 

The space 

L 2 (G) = [f : G € | / is measurable and |/| 2 is integrable over G } 

plays the role of the group algebra of the infinite group G. This space becomes a 
commutative ring with 1 under the convolution of functions: for f,g€ L 2 (G) the 
product / * g : G — ► C is defined by 

1 f 2 * 

if * g)(*) = — / fix- y)giy) dy for all x € G. 

2n Jo 

(Recall that for a finite group H, the group algebra is also formally the ring of C- valued 
functions on H under a convolution multiplication and that these functions are written 
as formal sums - the element ^2ct g g e CG denotes the function which sends g to 
a 8 € C for all g e G.) 

The complete set of continuous homomorphisms of G into GL\ (C) is given by 
e n (x) = e" 1 *, xe[0,2n], neZ. 

(Recall that for a finite abelian group, all irreducible representations are 1 -dimensional 
and for 1 -dimensional representations, characters and representations may be identi- 
fied.) 

The ring L 2 (G) admits an Hermitian inner product: for /, g e L 2 {G) 

</,«) = 7 " f 

2 7t Jo 


Sec. 18.3 Character Theory and the Orthogonality Relations 


875 



Under this inner product, {e n \ n e 2} is an orthonormal basis (where the term “basis” is 
used in the analytic sense that these are independent and 0 is the only function orthogonal 
to all of them). Moreover, 

L\G) = 0E„ 

where E„ is the 1 -dimensional subspace spanned by the hat over the direct sum 
denotes taking the closure of the direct sum in the L 2 -topology, and equality indicates 
equality in the L 2 sense. (Recall that the group algebra of a finite abelian group is the 
direct sum of the irreducible 1 -dimensional submodules, each occurring with multi- 
plicity one.) These facts imply the well known result from Fourier analysis that every 
square integrable function fix) on [0, 2 tt] has a Fourier series 

00 

Y «»«'“ 

n=—oo 

where the Fourier coefficients, c„, are given by 

c n = if, e n ) = ^ f{t)e~ inl dt. 

This brief description indicates how the representation theory of finite groups ex- 
tends to certain infin ite groups and the results we have proved may already be familiar in 
the latter context. In fact, there is a completely analogous theory for arbitrary (not nec- 
essarily abelian) compact Lie groups — here the irreducible (complex) representations 
need not be 1 -dimensional but they are all finite dimensional and L 2 {G) decomposes 
as a direct sum of them, each appearing with multiplicity equal to its degree. The 
emphasis (at least at the introductory level) in this theory is often on the importance of 
being able to represent functions as (Fourier) series and then using these series to solve 
other problems (e.g., solve differential equations). The underlying group provides the 
“symmetry” on which to build this “harmonic analysis,” rather than being itself the 
principal object of study. 


EXERCISES 

Let G be a finite group. Unless stated otherwise all representations and characters are over C. 

1. Prove that tr AB — cr BA for n x n matrices A and B with entries from any commutative 
ring. 

2. In each of (a) to (c) let rj/ be the character afforded by the specified representation <p. 

(a) Let <p be the degree 2 representation of D\q described in Example 6 in the second 
set of examples in Section 1 (here n = 5) and show that || tj/ || 2 = 1 (hence <p is 
irreducible). 

(b) Let ip be the degree 2 representation of Qs, described in Example 7 in the second set 

of examples in Section 1 and show that 1 1 | | 2 = 1 (hence <p is irreducible). 

(c) Let <p be the degree 4 representation of Q& described in Example 8 in the second set of 
examples in Section 1 and show that || ^ 1 1 2 = 4 (hence even though <p is irreducible 
over R, <p decomposes over C as twice an irreducible representation of degree 2). 

3. If X is art irreducible character of G, prove that the x -isotypic subspace of a CG-module 
is unique. 
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4. Prove that if N is any irreducible CG-module and M = N (B N, then M has infinitely 
many direct sum decompositions into two copies of N. 

5. Prove that a class function is a character if and only if it is a positive integral linear 
combination of irreducible characters. 

6. Let cp \ G -*■ GL(V) be a representation with character xp. Let W be the subspace 
[v e V \ 9 ?(g)(u) = v for all g e G} of V fixed pointwise by all elements of G. Prove that 
dim W = (xp, xi), where xi is the principal character of G. 

7. Assume V is a CG-module on which G acts by permuting the basis B = [e\,...,e„}. 

Write B as a disjoint union of the orbits B\ B t of G on B. 

(a) Prove that V decomposes as a CG-module as V) © - - ■ © V,, where V/ is the span of 
Bi. 

(b) Prove that if u; is the sum of the vectors in Bi then the 1 -dimensional subspace of V,- 
spanned by u, is the unique CG-submodule of V, affording the trivial representation 
(in other words, any vector in Vj that is fixed under the action of G is a multiple of 
Vi). [Use the fact that G is transitive on Bt . See also Exercise 8 in Section L] 

(c) Let W — {u € V | #>(g)(u) = v for all g e G) be the subspace of V fixed pointwise 
by all elements of G. Deduce that dim W = t = the number of orbits of G on B. 

8. Prove the following result (sometimes called Burnside’s Lemma although its origin is with 
Frobenius): let G be a subgroup of S„ and for each a e G let Fox((t) denote the number 
of fixed points of a on (l, .... n). Lett be the number of cubits of G on {1, .... n). Then 

f|G| = £Fix(g). 

geC 

[Use the preceding two exercises.] 

9. Let G be a nontrivial, transitive group of permutations on the finite set £2 and let ^ be the 
character afforded by the linear representation over C obtained from £2 (cf. Example 4 in 
Section 1) so \p(cr) is the number of fixed points of a on £2. Now let G act on the set 
£2 x £2 by g ■ (co \ , co 2 ) ~ (g ■ <o\,g ■ C 02 ) and let jt be the character afforded by the linear 
representation obtained from this action. 

(a) Prove that jt = ip 2 . 

(b) Prove that the number of orbits of G on Ci x ft is given by the inner product (xp, yp). 
[By the preceding exercises, the number of orbits on Q x is equal to (jt, xi), where 
Xi is the principal character.] 

(c) Recall that G is said to be doubly transitive on if it has precisely 2 orbits in its 
action on Q x (it always has at least 2 orbits since the diagonal, [(co, co) \ co € £2], 
is one orbit). Prove that if G is doubly transitive on £2 then xp = x 1 + X 2 . where xi 
is the principal character and X 2 is a nonprincipal irreducible character of G. 

(d) Let £2 = (1, 2 n) and let G = £1 act on £2 in the natural fashion. Show that the 

character of the associated linear representation decomposes as the principal character 
plus an irreducible character of degree n — 1. 

10. Let xp be the character of any 2-dimensional representation of a group G and let x be an 
element of order 2 in G. Prove that xp(x) = 2, 0 or —2. Generalize this to n-dimensional 
representations. 

11. Let x be an irreducible character of G. Prove that for every element z in the center of G 
we have x(z) = £X(1)» where e is some root of 1 in C. [Use Schur’s Lemma.] 

12. Let xp be the character of some representation <p of G. Prove that for g e G the following 
hold: 

(a) if xp(g) = ^(1) then g e ker^, and 


Sec. 18.3 Character Theory and the Orthogonality Relations 


877 



(b) if IV'C?)! = yp{\) and (p is faithful then g e Z(G ) (where |\^(^)| is the complex 
absolute value ofip(g)). [Use the method of proof of Proposition 14.] 

13. Let#> : G — > GL(V) be a representation and let x : G — > C x be a degree 1 representation. 
Prove that x<P ■ O -> GL(V ) defined by x<P(s) = X(s)<P(s) is a representation (note that 
multiplication of the linear transformation <p(g) by the complex number x(S) is well 
defined). Show that x<P 15 irreducible if and only if <p is irreducible. Show that if \p is the 
character afforded by <p then x& is the character afforded by x<P- Deduce that the product 
of any irreducible character with a character of degree 1 is also an irreducible character. 

The next few exercises study the notion of algebraically conjugate characters. These exercises 
may be considered as extensions of Proposition 14 and some consequences of these extensions. 
In particular we obtain a group-theoretic characterization of the conditions under which all 
irreducible characters of a group take values in (Q). 

Let F be the subfield of C of all elements that are algebraic over Q (the field of algebraic 
numbers). Thus F is the algebraic closure of Q contained in C and all the results established 
over C hold without change over F. 

14. Note that since F C C, every representation (p : G — ► GL m (F) may also be considered 
as a complex representation. Prove that if <p is a representation over F that is irreducible 
over F, then <p is also irreducible when considered over the larger field C (note that this is 
not true if F is not algebraically closed — cf. Exercise 2(c) above). Show that the set of 
irreducible characters of G over F is the same as the set of irreducible characters over C 
(i.e.. these are exactly the same set of class functions on G ). Deduce that every complex 
representation is equivalent to a representation over F. [Since F is algebraically closed 
of characteristic 0, the irreducible characters over either F or C are characterized by the 
first orthogonality relation.] 

Let <p : G -> GL,„(F) be any representation with character \p. Let Q(<p) denote the subfield 
of F generated by all the entries of the matrices <p(g) for all g e G. 

15. Prove that <Q(<p) is a finite extension of Q. 

Now let K be any Galois extension of Q containing Q(p>) and let a € Gal (A - /Q). In fact, 
since every automorphism of K extends to an automorphism of F, we may assume a is any 
automorphism of F. The map <p a : G — ► GL n (F) is defined by letting <p a (g) be the n x n 
matrix whose entries are obtained by applying the field automorphism cr to the entries of the 
matrix <p(g). 

16. Prove that <p a is a representation. Prove also that the character of <p a is \p n , where 
t°(g) = (t(\p(g)). 

17. Prove that <p is irreducible if and only if (p a is irreducible. 

The representation <p a (or character \p a ) is called the algebraic conjugate of <p by a (or of 
yp, respectively); two representations <p\ and <pi (or characters \p\ and yp'i) are said to be alge- 
braically conjugate if there is some automorphism o of F such that <p\ — <pi (or yp° = ip 2 , 
respectively). Some care needs to be taken with this (standard) notation since the exponen- 
tial notation usually denotes a right action whereas automorphisms of F act on the left on 
representations: <p( az) — (<p T ) n . 

Let Q(yp) be the subfield of F generated by the numbers \p(g) for all g e G. Let |G| — n 
and let e be a primitive n 01 root of 1 in F. 

18. Prove that Q(ip) c Q(e). Deduce thatQ(^) is a Galois extension of Q with abelian Galois 
group. [See Proposition 14.] 
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Recall from Section 14.5 that Gal(Q(e)/Q) = (Z/nZ) x , where the Galois automorphisms are 
given on the generator £ by o a \ c a , where a is an integer relatively prime to n. 

19. Prove that if o a e Gal(Q(e)/Q) is the field automorphism defined above, then for all 

g e G we have t}r aa (g) = [Use the method of Proposition 14.] 

20. Prove that if g is an element of G which is conjugate to g° for all integers a relatively 
prime to n, then i}/(g) € Q, for every character ijr of G. [Use the preceding exercise and 
the fact that Q is the field fixed by all er a ’s.] 

21. Prove that an element g € G is conjugate to g a for all integers a relatively prime to | G\ if 
and only if g is conjugate to g^ few all integers d relatively prime to |g | . 

22. Show for any positive integer n that every character of the symmetric group S„ is rational 
valued (i.e., e Q for all g € S n and all characters ij/ of S„). 

The next two exercises establish the converse to Exercise 20. 

23. Prove that elements x and y are conjugate in a group G if and only if x 00 = X (j) for all 
irreducible characters x of G. 

24. Let g e G and assume that every irreducible character of G is rational valued on g. Prove 
that g is conjugate to g a for every integer a relatively prime to |G| . [If g is not conjugate to 
g a for some a relatively prime to |G| then by the preceding exercise there is an irreducible 
character x such that x(g) 7^ x(g°)- Derive a contradiction from the hypothesis that 
X(g) € Q.] 

25. Describe which irreducible characters of the cyclic group of order n are algebraically 
conjugate. 

26. Prove that every irreducible character of both £?8 and D% is rational valued. Prove that 
Dio has an irreducible character that is not rational valued 

27. Let G = H x K and let <p \ H -*■ GL(V) be an irreducible representation of H with 
character x- Then G H —+ GL(V) gives an irreducible representation of G, where 
Ttfj is the natural projection; the character, x, of this representation is x((fi,J0) = x(h). 
Likewise any irreducible character ^ of K gives an irreducible character ^ of G with 

k)) = \Jf(k). 

(a) Prove that the product x^r is an irreducible character of G. [Show it has norm 1.] 

(b) Prove that every irreducible character of G is obtained from such products of irre- 
ducible characters of the direct factors. [Use Theorem 10, either (3) or (4).] 

28. (Finite subgroups qfGL 2 (Q)) Let G be a finite subgroup of Gf- 2 (Q)- 

(a) Show that GZ, 2 (<© does not contain an element of order n for n = 5, 7, or n > 9. 
Deduce that |G| = 2 a 3 i> . [Use rational canonical forms.] 

(b) Show that the Klein 4-group is the only noncyclic abelian subgroup of G 1,2(0)- 
Deduce from this and (a) that |G| | 24. 

(c) Show that the only finite subgroups of GL 2 (Q) are the cyclic groups of order 1, 2, 3, 
4, and 6, the Klein 4-group, and the dihedral groups of order 6, 8, and 12. [Use the 
classifications of groups of small order in Section 4.5 and Exercise 10 of Section 1 
to restrict G to this list. Show conversely that each group listed has a 2-dimensional 
faithful rational representation.] 
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CHAPTER 19 


Examples and Applications 
of Character Theory 


19.1 CHARACTERS OF GROUPS OF SMALL ORDER 

The character table of a finite group is the table of character values formatted as follows: 
list representatives of the r conjugacy classes along the top row and list the irreducible 
characters down the first column. The entry in the table in row x> and column gj is 
Xi(gj)- The character table of a finite group is unique up to a permutation of its rows 
and columns. It is customary to make the principal character the first row and the 
identity the first column and to list the characters in increasing order by degrees. In 
our examples we shall list the size of the conjugacy classes under each class so the 
entire table will have r + 1 rows and r + 1 columns (although strictly speaking, the 
character table is the r x r matrix of character values). This will enable one to easily 
check the “orthogonality of rows” using the first orthogonality relation: if the classes 
are represented by g[ , . . . , g r of sizes d\, ... >d r then 

l r 

(X/. Xj) = T^y\dkXi(gk)Xj(gk)- 

The second orthogonality relation says that the Hermitian product of any two distinct 
columns of a character table is zero (i.e., it gives an “orthogonality of columns”). 

A number of character tables are given in the Atlas of Finite Groups by Conway, 
Curtis, Norton, Parker and Wilson, Clarendon Press, 1985. These include the character 
table of the Monster simple group, M. The group M has 194 irreducible characters. 
The smallest degree of a nonprincipal irreducible character of M is 196883 and the 
largest degree is on the order of 2 x 10 26 . Nonetheless, it is possible to compute the 
values of all these characters on all conjugacy classes of M. 

For the first example of a character table let G = ( x ) be the cyclic group of order 
2. Then G has 2 conjugacy classes and two irreducible characters: 


classes: 

1 

X 

sizes: 

1 

1 

Xi 

1 

1 

X2 

1 

-1 


Character Table of 2a 
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The characters and representations of this abelian group are the same, and the irreducible 
representations of any abelian group are described in Example 1 at the end of Section 
18 . 2 . 

Similarly, if G = ( jc ) is cyclic of order 3 , and f is a fixed primitive cube root of 1 
(so £ 2 = £), then the character table of G is the following: 


classes: 

1 x x 2 

sizes: 

1 1 1 

Xi 

1 1 1 

X2 

1 f £ 2 

X-i 

1 £ 2 t 


Character Table of Z3 

Next we construct the character table of S3 . Recall from Example 2 in Section 1 8.2 
that S3 has 3 irreducible characters whose values are described in that example and in 
Example 1 at the end of Section 18 . 3 . 


classes: 

1 

0 2) 

( 123 ) 

sizes: 

1 

3 

2 

Xi 

1 

1 

1 

X2 

1 

-1 

1 

X 3 

2 

0 

-1 


Character Table of S 3 

Next we consider Dg, adopting the notation of Example 3 of Section 18 . 3 . By 
Corollary 1 1 , Dg has four characters of degree 1 . Also, in Example 3 we constructed 
an irreducible degree 2 representation. Since the sum of the squares of the degrees 
of these representations is 8, this accounts for all irreducible representations (or, since 
there are 5 conjugacy classes, there are 5 irreducible representations). If we let bars 
denote passage to the commutator quotient group (which is the Klein 4 -group), then 
1 = r 7 . The degree 1 representations (= their characters) are computed by sending 
generators s and r to ± 1 (and the product class is mapped to the product of the values). 
Matrices for the degree 2 irreducible representation were computed in Example 3 of 
Section 18.3 and the character of this representation can be read directly from these 
matrices. The character table of Dg is therefore the following: 


classes: 

1 

r 2 

s 

r 

sr 

sizes: 

1 

1 

2 

2 

2 

Xi 

1 

1 

1 

1 

1 

X2 

1 

1 

-1 

1 

-1 

X 3 

1 

1 

1 

-1 

-1 

X 4 

1 

1 

-1 

-1 

1 

X 5 

2 

-2 

0 

0 

0 


Character Table of Dg 
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Now we compute the character table of the quaternion group of order 8. We use 
the usual presentation 

Q& = ( i, j I i 4 = 1, i 2 = j 2 , i~ l ji = r X ) 

and let k = ij and i 1 = — 1. The conjugacy classes of Qg are represented by 1,1, r, 
j and k of sizes 1, 1, 2, 2 and 2, respectively. Since the commutator quotient of Qg is 
the Klein 4-group, there are four characters of degree 1. The one remaining irreducible 
character must have degree 2 in order that the sum of die squares of the degrees be 8 . Let 
Xs be the degree 2 irreducible character of Qg. One may check that the representation 
(p : Qg GL 2 (C) described explicitly in Example 7 in the second set of examples of 
Section 18.1 affords xs> but we show how the orthogonality relations give the values 
of xs without knowing these explicit matrices. If <p is an irreducible representation of 
degree 2, by Schur’s Lemma (cf. Exercise 18 in Section 18.1) yj(— 1) is a 2 x 2 scalar 
matrix and so is ± the identity matrix since — 1 has order 2 in Qg. Hence xs (— 1) = ±2. 
Let xs (0 = a, xs (j) — ^ and Xs(k) = c. The orthogonality relations give 

1 = (Xs, X$) = ~(2 2 + (±2) 2 + 2 da + 2 bb + 2 cc). 

O 

Since aa, bb and cc are nonnegative real numbers, they must all be zero. Also, since 
Xs is orthogonal to the principal character we get 

0 — tXi* Xs) = -(2 + (±2) +0 + 0 + 0), 
hence xs(— 1) = — 2. The complete character table of Qg is the following: 


classes: 

1 

-1 

i 

j 

k 

sizes: 

1 

1 

2 

2 

2 

Xi 

1 

1 

1 

1 

1 

X2 

1 

1 

-1 

1 

-1 

X3 

1 

1 

1 

-1 

-1 

X4 

1 

1 

-1 

-1 

1 

Xs 

2 

-2 

0 

0 

0 


Character liable of Qg 

Observe that Dg and Qg have the same character table, hence 

nonisomorphic groups may have the same character table. 

Note that the values of the degree 2 representation of Qg could also have been easily 
calculated by applying the second orthogonality relation to each column of the character 
table. We leave this check as an exercise. Also note that although the degree 2 irreducible 
characters of Dg and Qg have the same (real number) values the degree 2 representation 
of Dg may be realized by real matrices whereas it may be shown that Qg has no faithful 
2-dimensional representation over M (cf. Exercise 10 in Section 18.1). 

For the next example we construct the character table of £4. The conjugacy classes 
of £4 are represented by 1, (1 2), (1 23), (1234) and (1 2)(3 4) with sizes 1, 6, 8, 6, 
and 3 respectively. Since £4 — A4, there are two characters of degree 1: the principal 
character and the character whose values are the sign of the permutation. 
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To obtain a degree 2 irreducible character let V be the normal subgroup of order 
4 generated by (1 2)(34) and (1 3)(24). Any representation <p of S 4 /V = S 3 gives, 
by composition with the natural projection S 4 — > S 4 /V, a representation of 54 ; if the 
former is irreducible, so is the latter. Let <p be the composition of the projection with the 
irreducible 2-dimensional representation of S 3 , and let X 3 he its character. The classes 
of 1 and (1 2) (3 4) map to the identity in the S 3 quotient, (12) and (1234) map to 
transpositions and (123) maps to a 3-cycle. The values of xa can thus be read directly 
from the values of the character of degree 2 in the table for S 3 . 

Since S 4 has 5 irreducible characters and the sum of the squares of the degrees is 
24, there must be two remaining irreducible characters, each of degree 3. In Example 2 
of Section 18.3 one of these was calculated, call it / 4 - Recall that 

X 4 (cr) = (the number of fixed points of cr) — 1 . 

The remaining irreducible character, xs, is X 4 X 2 - One can either use Proposition 17 in 
Section 18.3 or Exercise 13 in Section 18.3 to see that this product is indeed a character. 
The first orthogonality relation verifies that it is irreducible. 


classes: 

1 

( 12 ) 

(123) 

(1234) 

(1 2)(3 4) 

sizes: 

1 

6 

8 

6 

3 

Xi 

1 

1 

1 

1 

1 

X 2 

1 

-1 

1 

-1 

1 

X3 

2 

0 

-1 

0 

2 

X4 

3 

1 

0 

-1 

-1 

X5 

3 

-1 

0 

1 

-1 


Character Table of S4 


From the character table of S 4 one can easily compute the character table of A 4 . 
Note that A 4 has 4 conjugacy classes. Also IA 4 : A\\ — 3, so A 4 has three characters 
of degree 1 with V = A ' 4 in the kernel of each degree 1 representation. The remaining 
irreducible character must have degree 3. One checks directly from the orthogonality 
relation applied in A 4 that the character X 4 of S 4 restricted to A 4 (= X 5 1 Aa ) * s irreducible. 
T hi s irreducibility check is really necessary since an irreducible representation of a 
group need not restrict to an irreducible representation of a subgroup (for instance, the 
irreducible degree 2 representation of S 3 must become reducible when restricted to any 
proper subgroup, since these are all abelian). The character table of A 4 is the following 


classes: 

sizes: 

1 

1 

(1 2)(34) 

3 

(123) 

4 

(13 2 ) 

4 

Xi 

1 

1 

1 

1 

X 2 

1 

1 

£ 


X3 

1 

1 


£ 

X4 

3 

-1 

0 

0 


Character Table of A4 
where £ is a primitive cube root of 1 in C. 


Sec. 19.1 


Characters of Groups 0 } Small Order 


883 



As a final example we construct the following character table of Ss- 


classes: 

sizes: 

1 

1 

(12) 

10 

(123) 

20 

(1234) 

30 

(12345) 

24 

(1 2)(3 4) 

15 

(12)(345) 

20 

Xi 

1 

1 

1 

1 

l 

1 

1 

X2 

1 

-1 

1 

-1 

l 

1 

-1 

X3 

4 

2 

1 

0 

-1 

0 

-1 

X4 

4 

-2 

1 

0 

-1 

0 

1 

X5 

5 

-1 

-1 

1 

0 

1 

-1 

X6 

5 

1 

-1 

-1 

0 

1 

1 

X7 

6 

0 

0 

0 

1 

-2 

0 


Character Table of S5 

The conjugacy classes and their sizes were computed in Section 4.3 . Since \S$ : S ' 5 | — 2, 
there are two degree 1 characters: the principal character and the “sign” character. 

The natural permutation of Ss on 5 points gives rise to a permutation character of 
degree 5. As with £4 and S 3 the orthogonality relations show that the square of its norm 
is 2 and it contains the principal character. Thus X3 is the permutation character minus 
the principal character (and, as with the smaller symmetric groups, Xii#) is the number 
of fixed points of <r minus l). As argued with £4, it follows that X 4 = X 3 X 2 is also an 
irreducible character. 

To obtain xs recall that Ss has six Sylow 5-subgroups. Its action by conjugation on 
these gives a faithful permutation representation of degree 6. If \Jr is the character of 
the associated linear representation, then since a € S5 fixes a Sylow 5-subgroup if and 
only if it normalizes that subgroup, we have 

— the number of Sylow 5-subgroups normalized by a. 

The normalizer in Ss of the Sylow 5-subgroup ( (1 234 5) ) is ( (12345), (23 54) ) 
and all normalizers of Sylow 5-subgroups are conjugate in £5 to this group. This 
nor maliz er contains only the identity, 5-cycles, 4-cycles and products of two disjoint 
transpositions. No other cycle type normalizes any Sylow 5-subgroup so on any other 
class, is zero. To compute ^ on the re mainin g three nonidentity classes note (by 
inspection in Sg) that in any faithful action on 6 points the following hold: an element 
of order 5 must be a 5-cycle (hence fixes 1 point); any element of order 4 which fixes 
one point must be a 4-cycle (hence fixes 2 points); an element of order 2 which is the 
square of an element of order 4 fixes exactly 2 points also. This gives all the values of 
\j/. Now direct computation shows that 

II V' II 2 = 2 and (xi,^) = l. 

Thus xs = ’A — Xi is irreducible of degree 5. By the same theory as for X4 one gets 
that X6 = XsXi is another irreducible character. 

Since there are 7 conjugacy classes, there is one remaining irreducible character 
and its degree is 6. Its values can be obtained immediately from the decomposition of 
the regular character, p (cf. Example 3 in Section 18.2 and Example 4 in Section 18.3): 

P ~ Xi - X2 ~ 4x3 - 4x4 - 5X5 - 5x6 
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A direct calculation by the orthogonality relations checks that xi is irreducible. Note 
that the values of the character xi were computed without explicitly exhibiting a rep- 
resentation with this character. 


EXERCISES 

1. Calculate the character tables of Z 2 x Z 2 , Z 2 x Z 3 and Z 2 x Z 2 x Z 2 . Explain why the 
table of Z 2 x Z 3 contains primitive 6 th roots of 1. 

2. Compute the degrees of the irreducible characters of D\ 6 . 

3. Compute the degrees of the irreducible characters of A 5 . Deduce that the degree 6 irre- 
ducible character of S5 is not irreducible when restricted to A 5 . [The conjugacy classes of 
A 5 are worked out in Section 4.3.] 

4 . Using the character tables in this section, for each of parts (a) to (d) use the first orthogo- 
nality relation to write the specified permutation character (cf. Example 3, Section 18.3) 
as a sum of irreducible characters: 

(a) the permutation character of the subgroup A 3 of S 3 

(b) the permutation character of the subgroup { (1 2 3 4) ) of S 4 

(c) the permutation character of tile subgroup V4 of S4 

(d) the permutation character of the subgroup ( (1 2 3) , (1 2), (4 5)) of S 5 (this subgroup 
is the normalizer of a Sylow 3-subgroup of S 5 ). 

5. Assume that for any character^ ofa group, yj/ 1 is alsoa character (where ^ 2 (g) = (V'O?)) 2 ) 
— this is a special case of Proposition 17 in Section 18.3. Using the character tables in this 
section, for each of parts (a) to (e) write out the values of the square, x 2 > of the specified 
character x and use the first orthogonality relation to write x 2 as a sum of irreducible 
characters: 

(a) x = X3i the degree 2 character in the table of S 3 

(b) x = X5> die degree 2 character in the table of ]2s 

(c) x = X 5 > die last character in the table of S 4 

(d) X = X 4 » the second degree 4 character in the table of S 5 

(e) x = X 7 i die last character in the table of S 5 . 

6 . Calculate the character table of A 5 . 

7. Show that S(> has an irreducible character of degree 5. 

8 . Calculate the character table of Dio- (This table contains nonreal entries.) 

9. Calculate the character table of Di 2 - 

10. Calculate the character table of £3 x S 3 . 

11. Calculate the character table of Z 3 x S 3 . 

12. Calculate the character table of Z 2 x S 4 . 

13. Calculate the character table of S 3 x S 4 . 

14 . Let n be an integer with n > 3. Show that every irreducible character of E> 2 n has degree 1 
or 2 and find the number of irreducible characters of each degree. [The conjugacy classes 
of £> 2 „ were found in Exercises 31 and 32 of Section 4.3 and its commutator subgroup 
was computed in Section 5.4.] 

15. Prove that the character table is an invertible matrix. [Use the orthogonality relations.] 

16. For each of A 5 and Dio describe which irreducible characters are algebraically conjugate 
(cf. the exercises in Section 18.3). 
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17. Let p be any prime and let P be a non-abelian group of order p 3 (up to isomorphism there 
are two choices for P; for odd p these were constructed when the groups of order p 3 were 
classified in Section 5-5). This exercise determines the character table of P and shows that 
both isomorphism types have the same character table (the argument includes the p = 2 
case worked out in this section). 

(a) Prove that P has p 2 characters of degree 1 . 

(b) Prove that P has p - 1 irreducible characters of degree p and that these together with 
the p 2 degree 1 characters are all the irreducible characters of P. [Use Theorem 10(3) 
and Theorem 12 in Section 18.2.] 

(c) Deduce that (regardless of the isomorphism type) the group P has p 2 + p — 1 conjugacy 
classes, p of which are of size 1 (i.e., are central classes) and p 2 - 1 of which each 
have size p. Deduce also that the classes of size p are precisely the nonidentity cosets 
of the center of P (i.e., if x € P ~ Z(P) then the conjugacy class of x is the set of p 
elements in the coset xZ(P)). 

(d) Prove that if y is an irreducible character of degree p then the representation affording 
X is faithful. 

(e) Fix a generator, z, of the center of P and let e be a fixed primitive p* root of 1 in 
C. Prove that if / is an irreducible character of degree p then x (z) = pf' for some 

i e (1. 2 p — 1}. Prove further that xOO = 0 for all x e P - Z(P ). (Note then 

that the degree p characters are all algebraically conjugate.) [Use the same reasoning 
as in the construction of the character table of Q %. 1 

(f) Prove that for each / € {l, 2, .... p — 1} there is a unique irreducible character 
Xi of degree p such that x»(z) = pe ; . Deduce that the character table of P is 
uniquely determined, and describe iL [Recall from Section 6,1 that regardless of the 
isomorphism type, P' = Z(P) and P/P ' = Z p x Z p . From this one can write out 
the degree 1 characters. Part (e) describes the degree p characters.] 


19.2 THEOREMS OF BURNSIDE AND HALL 

In this section we give a “theoretical” application of character theory: Burnside’s p a q h 
Theorem. We also prove Philip Hall’s characterization of finite solvable groups, which 
is a group-theoretic proof relying on Burnside’s Theorem as the first step in its induction. 

Burnside's Theorem 

The following result was proved by Burnside in 1904. Although purely group-theoretic 
proofs of it were discovered recently (see Theorem 2.8 in Finite Groups III by B. 
Huppert and N. Blackburn, Springer- Verlag, 1982) the original proof by Burnside pre- 
sented here is very accessible, elegant, and quite brief (given our present knowledge of 
representation theory). 

Theorem 1. (Burnside) For p and q primes, every group of order p a q h is solvable. 

Before undertaking the proof of Burnside’s Theorem itself we establish some results 
of a general nature. An easy consequence of these preliminary propositions is that the 
degrees of the irreducible characters of any finite group divide its order. The particular 
results that lead directly to the proof of Burnside’s Theorem appear in Lemmas 6 and 7. 
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It follows quite easily that a counterexample to Burnside’s Theorem of minimal order 
is a non-abelian simple group, and it is these two character-theoretic lemmas that give 
the contradiction by proving the existence of a normal subgroup. 

We first recall from Section 15.3 the definition of algebraic integers. 

Definition. An element a € C is called an algebraic integer if it is a root of a monic 
polynomial with coefficients from Z. 

The basic results needed for the proof of Burnside's Theorem are: 

Proposition 2. Let a e C. 

(1) The following are equivalent: 

(i) a is an algebraic integer, 

(ii) a is algebraic over <Q and the minimal polynomial of a over Q has 
integer coefficients, and 

(iii) Z[a] is a finitely generated Z-module (where Z[a] is the subring of C 
generated by Z and a, i.e., is the ring of all Z-linear combinations of 
nonnegative powers of a). 

(2) The algebraic integers in C form a ring and the algebraic integers in <Q are the 
elements of Z. 

Proof: These are established in Section 15.3. (The portion of Section 15.3 consist- 
ing of integral extensions and properties of algebraic integers may be read independently 
from the rest of Chapter 15.) 

Corollary 3. For every character \J/ of the finite group G, i/r(x) is an algebraic integer 
for all x € G. 

Proof: By Proposition 14 in Section 18.3, \p(x) is a sum of roots of 1. Each root 
of 1 is an algebraic integer, so the result follows immediately from Proposition 2(2). 

We shall also need some preliminary character-theoretic lemmas before beginning 
the main proof. Adopt the following notation for the arbitrary finite group G: Xi , ■ ■ - , Xr 
are the distinct irreducible (complex) characters of G, K .\, . . . , K r are the conjugacy 
classes of G and (p-, is an irreducible matrix representation whose character is Xj f° r 
each /. 


Proposition 4. Define the complex valued function cu/ on {K \ , . . . , K, r ) for each / by 


<Oi0Cj) = 


XiU) 


where g is any element of fCj . Then m,- (fCj) is an algebraic integer for all i and j. 
Proof: We first prove that if / is the identity matrix, then 


£>(*) = (19.1) 

gelCj 
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To see this let X be the left band side of (1). As we saw in Section 18.2, each x e G 
acting by conjugation permutes the elements of ICj and so X commutes with <p-, (g) for 
all g. By Schur’s Lemma (Exercise ] 8 in Section 18.1) X is a scalar matrix: 

X =al for some or e C. 

It remains to show that a = a>,(/C 7 ). But 

trX = J2 tr^(g) = J2 Xi(s) = \JCj\Xi(g)- 

Thus «X/(1) = tr X = \fCj\xi(g), as needed to establish (1). 

Now let g be a fixed element of IQ and define a i}S to be the number of ordered 
pairs gi. gj with gj e IQ, gj e ICj and gigj = g. Notice that a iJs is an integer. It is 
independent of the choice of g in IC S because if x -1 gx is a conjugate of g , every ordered 
pair gj, gj whose product is g gives rise to an ordered pair x“ l g;x, x~ l gjX whose 
product is (and vice versa). 

Next we prove that for all i, j, t € {1, . . . , r) 

r 

UtdQMKj) = (19.2) 

S=1 

Tb see this note that by (1), the left hand side of (2) is the diagonal entry of the scalar 
matrix on the left of the following equation: 


( 52 ( 52 w(s)) = 52 52 

y g€K, /K geKj ' nelCigjetC) 


r 

= 52 52 a us<P'(g) 


S = 1 geic s 


y 

= 52 a, J s 52 

(since is independent 

of g € IQ) 

S=1 g€ic s 

r 

= ^ jj s (l)({IC s ')I 

s = 1 

( by (1) ). 

Comparing entries of these scalar matrices gives (2). 



Now (2) implies that the subring of C generated by Z and w, (/Ci), .... cu r (/C r ) is a 
finitely generated Z-module for each t e {1, .... r} (it is generated as a Z-module by 

1 , a> t (/Ci ),..., co t (IC r )). Since Z is a Principal Ideal Domain the submodule Z[o>, (/C/ )] 
is also a finitely generated Z-module, hence co, ( ICj ) is an algebraic integer by Proposition 

2. This completes the proof. 

Corollary 5. The degree of each complex irreducible representation of a fin ite group 
G divides the order of G, i.e., x/(l) I |G| for i = 1 , 2 , . . . , r. 
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Proof: Under the notation of Proposition 4 and with gj € KLj we have 


\C\ 

Xi( 1) 


\C\ 

X/(l) 


(X/. Xi) 


T 



j=i 


\£j\Xi(gj)Xi(gj) 

x.-(D 


= 5^,(£j)X/(£/)- 

j=l 

The right hand side is an algebraic integer and the left hand side is rational, hence is an 
integer. This proves the corollary. 


The next two lemmas lead directly to Burnside’s Theorem. 


Lemma 6. If G is any group that has a conjugacy class K and an irreducible matrix 
representation <p with character x such that (|/C|,x(l)) = 1, then for g e 1C either 
X (g) =0 or <p(g) is a scalar matrix. 


Proof: By hypothesis there exist s, t e Z such that s|/C| + /x ( 1) = 1. Thus 

s|£lx(g) + 'x(i)xCs) = X(g)- 

Divide both sides of this by xO) and note that by Corollary 3 and Proposition 4 both 

. , . |£|X(f) i u ■ ■ . k • j , X(g) , 

X(g) and are algebraic integers, hence so is . Let a\ = and 

X(l) X(l) X(!) 

\tl a\,a 2 , . . . ,a n be all its algebraic conjugates over Q (i.e., the roots of the minimal 

polynomial of a\ over <Q>). Since is a sum of x(l) toots of 1 divided by the integer 
X(l), each a; is also a sum of xO) roots of 1 divided by x(l)- Thus ai has complex 
absolute value < 1 for all Now b — fJjLi a > G Q and b is an algebraic integer (±b 
is the constant term of the irreducible polynomial of a \ ), hence b 6 Z. But 


W-flWsi. 


so b = 0, ±1. Since all afs are conjugate, 6 = 0 ^ fli = 0 x(b) = 0- Also, 
b = ±1 O- j«, | = 1 for all i. Thus either x(g) = 0 or lx(f)l = X(l)- In the former 
situation the lemma is established, so assume lx(#)l = xO)- 

Let <p\ be a matrix representation equivalent to (p in which (p\ (g) is a diagonal 
matrix: 

/ e ' \ 


(Cl (g) = 


e 2 





Thus x (<§) = €\ + ■ ■ ■ + e n - By the triangle inequality if e, ^ e; for any i, j, then 

|^i H f- € n | < n = x ( 1). Since this is not the case we must have q>\ (g) = rl (where 

€ = e, for all /)- Since scalar matrices are similar only to themselves, <p(g) — €l as 
well. This completes the proof. 
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Lemma 7. If \JC\ is a power of a prime for some nonidentity conjugacy class JC of G, 
then G is not a non-abelian simple group. 

Proof: Suppose to the contrary that G is a non-abelian simple group and let 
\JC\ = p c . Let g & JC. If c = 0 then g € Z(G ), contrary to a non-abelian simple 
group having a trivial center. As above, let xi> ■ - - > Xr be all the irreducible charac- 
ters of G with xi the principal character and let p be the regular character of G. By 
decomposing p into irredudbles we obtain 

r 

0 = Pte)= I+X>(1)*(£)- (19.3) 

1=2 

^ P I X/0) for every j > 1 with Xj(g) f 1 0, then write X/(l) = pdj. In this case 
(3) becomes 

0=1 + P^2 d jXj(g)- 
j 

Thus 53 . djXj(g) = — 1 Ip is an algebraic integer, a contradiction. This proves there is 
some j such that p does not divide x; ( 1) and Xjis) 0- If ^ is a representation whose 
character is xj , then <p is faithful (because G is assumed to be simple) and, by Lemma 6, 
(p(g) is a scalar matrix. Since (pig ) commutes with all matrices, <p(jg) € Z(y?(G)). This 
forces g € Z(G), contrary to G being a non-abelian simple group. The proof of the 
lemma is complete. 

We now prove Burnside’s Theorem. Lei G be a group of order p a q h for some primes 
p and#. I f p ~ q or if either exponent is 0 then G is nilpotent hence solvable. Thus we 
may assume this is not the case. Proceeding by induction let G be a counterexample 
of minimal order. If G has a proper, nontrivial normal subgroup N t then by induction 
both N and G/N are solvable, hence so is G (cf. Section 3.4 or Proposition 6.10). Thus 
we may assume G is a non-abelian simple group. Let P e Syl p iG). By Theorem 8 
of Chapter 4 there exists g e Z(P) with g ^ 1. Since P < Ccig ), the order of the 
conjugacy class of g (which equals \G : CgO?)|) is prime to p, i.e., is a power of q. 
T his violates Lemma 7 and so completes the proof of Burnside’s Theorem. 

Philip Hall’s Theorem 

Recall that a subgroup of a finite group is called a Hall subgroup if its order and index are 
relatively prime. For any subgroup H of a group G a subgroup K such that G = HK 
and H D K = 1 is called a complement to H in G. 

Theorem 8. (P. Hall) Let G be a group of order p a x x p^ 1 -••/?“' where py , . . . , p, 
are distinct primes. If for each / e {1, . . . , t) there exists a subgroup //, of G with 
| G : Hi\ = pf 1 , then G is solvable. 

Hall’s Theorem can also be phrased if for each i e { 1 a Sylow p- x -subgroup 
of G has a complement, then G is solvable. The converse to Hall’s Theorem is also true 
— this was Exercise 33 in Section 6. 1 . 

We shall first need some elementary lemmas. 
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Lemma 9. If G is solvable of order > 1, then there exists P < G with P a nontrivial 
/(-group for some prime p. 


Proof: This is a special case of the exercise on minimal normal subgroups of 
solvable groups at the end of Section 6.1. One can see this easily by letting P be a 
nontrivial Sylow subgroup of the last nontrivial term, in the derived series of G 

(where G has solvable length n). In this case G (,I_,) is abelian so P is a characteristic 
subgroup of G ( " -l \ hence is normal in G. 


Lemma 10. Let G be a group of order p" 1 p^ 1 ■ ■ ■ p“' where p t> . . . , p, are distinct 
primes. Suppose there are subgroups H and K of G such that for each i e {1, . . . , t), 
either pf divides \H\ or p"' divides \K\. Then G = HK and \H D K\ = (\H\, |K|). 


Proof: Fix some / e {1, and suppose first that p“' divides the order of H. 
Since HK is a disjoint union of right cosets of H and each of these right cosets has 
order equal to \H |, it follows that p“' divides \HK\. Similarly, if pf divides \K |, since 
HKi&& disjoint union of left cosets of K , again pf divides \HK\. Thus | G) | \HK\ 
and so G = HK. Since 


\HK\ = 


\m\K\ 

\HHK\' 


it follows thal|//ntf| = (|//|, \K\). 


We now begin the proof of Hall’s Theorem, proceeding by induction on |G| . Note 
that if t = 1 the hypotheses are trivially satisfied for any group (H\ = 1) and if t = 2 
the hypotheses are again satisfied for any group by Sylow’s Theorem (H\ is a Sylow 
p 2 -subgroup of G and H 2 is a Sylow p y -subgroup of G). If t = 1, G is nilpotent, hence 
solvable and if t — 2, G is solvable by Burnside’s Theorem. Assume therefore that 
t > 3. 

Fix i and note that by the preceding lemma, for ail j e {1, . . . , f) — (/}, 

I Hi : Hi DHj\ = p] 1 . 

Thus every Sylow py -subgroup of H, has a complement in Hp. H } fl H,. By induction 
H-, is solvable. 

By Lemma 9 we may choose P < Hi with |P| = p? > 1 for some / > 1. Since 
t > 3 there exists an index j € {1, — {1, i). By Lemma 10 

I Hi n Hj | = • ■ • P “L~; ■■ p? ■ 

Thus H\ fl Hj contains a Sylow p, -subgroup of H\. Since P is a normal p-, -subgroup 
of Hu P is contained in every Sylow p, -subgroup of H\ and so P < Hy D Hj. By 
Lemma 10, G = H\ Hj so each g e G may be written g = hyhj for some h y e Hy and 
hj e Hj. Then 

gHjg - 1 = (hyhj)Hj(hyhj )- 1 = hyHjhy' 

and so 

n I**-' = n h < 

geG 
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Now P < Hj and hiPh ^ 1 — P for all hy e H \ . Thus 

l^p < n hiHjhJ 1 . 

Thus N = Pi S £cgHjg- ] is a nontrivial, proper normal subgroup of G. It follows that 
both N and G/N satisfy the hypotheses of the theorem (cf. the exercises in Section 
3.3). Both N and G/N are solvable by induction, so G is solvable. This completes the 
proof of Hall’s Theorem. 


EXERCISES 

1. Show that every character of the symmetric group S n is integer valued, for all n (i.e., 
y}f(g) € Z for all g € S n and all characters V' of S„). [See Exercise 22 in Section 1 8.3.J 

2. Let G be a finite group with the property that every maximal subgroup has either prime 
or prime squared index. Prove that G is solvable. (The simple group GL 3 (IF 2 ) has the 
property that every maximal subgroup has index either 7 or 8, i.e., either prime or prime 
cubed index — cf. Section 6.2.). [Let p be the largest prime dividing |G| and let P be 
a Sylow p-subgroup of G. If P < G, apply induction to G/P. Otherwise let M be a 
maximal subgroup containing Nc(P). Use Exercise 51 in Section 4.5 to show that p = 3 
and deduce that |G| = 2“3 fc .J 

3. Assume G is a finite group that possesses an abelian subgroup H whose index is a power 
of a prime. Prove that G is solvable. 

4. Repeat the preceding exercise with the word “abelian" replaced by “nilpotent” 

5. Use the ideas in the proof of Philip Hall’s Theorem to prove Burnside’s p a q b Theorem in 
the special case when all Sylow subgroups are abelian (without use of character theory.) 


19.3 INTRODUCTION TO THE THEORY OF INDUCED CHARACTERS 

Let G be a finite group, let H be a subgroup of G and let q> be a representation of 
the subgroup H over an arbitrary field F . In this section we show how to obtain a 
representation of G, called the induced representation, from the representation <p of its 
subgroup. We also determine a formula for the character of this induced representation, 
the induced character, in terms of the character of <p and we illustrate this formula by 
computing some induced characters in specific groups. Finally, we apply the theory of 
induced characters to prove that there are no simple groups of order 3 3 - 7 - 13 ■ 409, 
a group order which was discussed at the end of Section 6.2 in the context of the 
existence problem for simple groups. The theory of induced representations and induced 
characters marks the be ginning of more advanced representation theory. This section 
is intended as an introduction rather than as a comprehensive treatment, and the results 
we have included were chosen to serve this purpose. 

First observe that it may not be possible to extend a representation cp of the subgroup 
H to a representation 4> of G in such a way that <Z>|w = (fi. For example, A 3 < S 3 
and A 3 has a faithful representation of degree 1 (cf. Section 1). Since every degree 1 
representation of S 3 contains A 3 = S 3 in its kernel, this representation of A 3 cannot 
be extended to a representation of S 3 . For another example of a representation of a 
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subgroup which cannot be extended to the whole group take G to be any simple group 
and let <p be any representation of H with the property that ker ip is a proper, nontrivial 
normal subgroup of H. If ip extended to a representation 0 of G then the kernel of 0 
would be a proper, nontrivial subgroup of G, contrary to G being a simple group. We 
shall see that the method of induced characters produces a representation 0 of G from 
a given representation <p of its subgroup H but that 0 \ H ^ <p in general (indeed, unless 
H = G the degree of 0 will be greater than the degree of <p). 

We saw in Example 5 following Corollary 9 in Section 10.4 that because FH is a 
subring of FG, the ring FG is an (FG, F //)-bimodule; and so for any left F //-module 
V, the abelian group FG ®fh V is a left FG-module (called the extension of scalars 
from FH to FG for V). In the representation theory of finite groups this extension is 
given a special name. 

Definition. Let H be a subgroup of the finite group G and let V be an F //-module 
affording the representation <p of H. The FG -module FG®fhV is called the induced 
module of V and the representation of G it affords is called the induced representation 
of <p. If -[j/ is the character of <p then the character of the induced representation is called 
the induced character and is denoted by Ind^O/r). 

Theorem 1L Let H be a subgroup of the finite group G and let gi , . . . , g m be rep- 
resentatives for the distinct left cosets of H in G. Let V be an FH -module affording 
the matrix representation <p of H of degree n. The FG-module W = FG ®fh V 
has dimension nm over F and there is a basis of W such that W affords the matrix 
representation 0 defined for each g e G by 

( <p(§i l 8gi) ■■■ 

<P(8 m l 88i) <P(87n88m)J 

where each <p(gf l ggj) is an n x n block appearing in the i, j block position of 0(g), 
and where <p(gf l ggj) i s defined to be the zero block whenever gf^ggj £ /?- 

Proof: First note that FG is a free right FH -module: 

FG = giFH 0 giFH 0 • * ■ 0 g m FH. 

Since tensor products commute with direct sums (Theorem 17, Section 10.4), as abelian 
groups we have 

W = FG ® FH V = (g! ® V) 0 (g 2 <8> V) 0 ■ ■ ■ 0 (g m <8> V). 

Since F is in the center of FG it follows that this is an F-vector space isomorphism as 

well. Thus if V[,V 2 v„ is a basis of V affording the matrix representation <p, then 

{g; 0 vj | 1 < i < m, 1 < j < n) is a basis of W. This shows the dimension of W is 
mn. Order the basis into m sets, each of size n as 


£t<S>Vl,gl ®V2, <S>V7.,g2<S)V ll ...,g2<8>^, gm®V„. 
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We compute the matrix representation 0 (g) of each g acting on W with respect to this 
basis. Fix j and g, and let ggj = g } h for some index /' and some h e H. Then for 
every k 


g(gj ® v k ) = (ggj) <8>v k = gi ® hv k 

n 

= ^o lk (h)(gi <g> v,) 

1=1 

where a tk is the r, k coefficient of the matrix of h acting on V with respect to the basis 
{i»i , . . . , v n }. In other words, the action of g on W maps the j 0 * block of n basis vectors 
of W to the I th block of basis vectors, and then has the matrix <p(h) on that block. Since 
h — gf i ggj, this describes the block matrix 0(g) of the theorem, as needed. 

Corollary 12. In the notation of Theorem 1 1 

(1) if if/ is the character afforded by V then the induced character is given by 

m 

1 ndg(^Kg) = 52 V'te/ -1 *#) 

1=1 

where ty(g~ x ggi) is defined to be 0 if gf X ggi £ H, and 

(2) Ind^(V0(£) = 0 if g is not conjugate in G to some element of H. In particular, 
if H is a normal subgroup of G then lndjj (ty) is zero on all elements of G — H. 


Remark : Since the character if of H is constant on the conjugacy classes of H we have 
\J/(g) = \}f(h~ l gh) for allli e H. As h runs over all elements of H, xh runs over 
all elements of the coset xH. Thus the formula for the induced character may also be 
written 


I )(g) = 52 ***) 


xeC 


where the elements x in each fixed coset give the same character value | H | times (which 
accounts for the factor of l/\H\), and again xj/(x~ l gx) — 0 if x~ l gx £ H. 

Proof: From the matrix of g computed above, the blocks <p(gf X ggi) down the 
diagonal of 0 (g) are zero except when gf'ggi 6 H. Thus the trace of the block matrix 
0(g) is the sum of the traces of the matrices <p(gf x ggi) for which g~ l gg, € H. Since 
the trace of <p(gf { ggi) is f(gf l ggi), part (1) holds. 

If g~ x ggi H for all coset representatives g- t then each term in the sum for 
Ind^(V0(£) is zero. In particular, if g is not in the normal subgroup H then neither is 
any conjugate of g, so Ind^(^) is zero on g. 


Examples 

(1) Let G = Dn = (r,s \ r 6 = s 2 = 1 . rs = sr~ l ) be the dihedral group of order 1 2 and 
let H = {1, s,r 3 , jr- 1 }, so that H is isomorphic to the Klein 4-group and \G : H 1=3. 
Following the notation of Theorem 1 1 we exhibit the matrices for r and s of the induced 
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representation of a specific representation <p of H. Let the representation of H on a 
2-dimensional vector space over Q with respect to some basis iq , t >2 be given by 

, ’ (l) = (”o l) =i4 ' *’ <r3) = (o -l) = fi ’ ^“("o -l) =C 

so n = 2, m = 3 and the induced representation <P has degree nm = 6. Fix represen- 
tatives g\ = 1, g 2 = r, and §3 = r 2 for the left cosets of H in G, so that gk = r k ~ x . 
Then 


gr'rg] = r -0-i)+i+O--') = r M+ \ and 
gf l sgj = =sr l+ j~ 2 . 

Thus the 6x6 matrices for the induced representation are seen to be 


(0 

0 

B \ 


(A 

0 

0\ 

4>(r) = I 

0 

0 

<P(r) = 

0 

0 

c 

\o 

I 

0/ 



c 

0/ 


where the 2 x 2 matrices A, B and C are given above, I is the 2x2 identity matrix 
and 0 denotes the 2 x 2 zero matrix. 

(2) If H is any subgroup of G and V'l is the principal character of H, then Ind^(^i)(g) 
counts 1 for each coset representative gi such that g ~ 1 ggj e H. Since g~ l ggi e H 
if and only if g fixes the left coset g;H under left multiplication, Ind^(\!q)(g) is the 
number of points fixed by g in the permutation representation of g on the left cosets 
of H. Thus by Example 3 of Section 18.3 we see that: if \J/\ is the principal character 
of H then Indffj (^/\) is the permutation character on the left cosets cfH in G. In the 
special case when H = 1, this implies if xi is the principal character of the trivial 
subgroup H = 1 then Irutf (xi) is the regular character of G. This also shows that an 
induced character is not, in general, irreducible even if the character from which it is 
induced is irreducible. 

(3) Let G = S3 and let be anonprincipal linear character of A3 = { x ), so that \j/(x) = 
for some primitive cube root of unity £ (the character tables of A3 = Z3 and S3 appear 
in Section 1). Let ^ = Ind^* (\lr). Thus ^ has degree 1 ■ IS3 : A3I = 2 and, by the 
corollary, ^ is zero on all transpositions. If y is any transposition then 1, y is a set of 
left coset representatives of A3 in S 3 and y -1 xy = x 2 . Thus 'P(x) = ij/(x) + if(x 2 ) 
equals £ +£ 2 = — 1. This shows that if ^ is either of the two nonprincipal irreducible 
characters of A3 then the induced character of \j/ is tile (unique) irreducible character 
of S 3 of degree 2. In particular, different characters of a subgroup may induce the 
same character of the whole group. 

(4) Let G = Z>8 have its usual generators and relations and let H = (s ). Let V/- be the 
Donprincipal irreducible character of H and let vp = Ind^(^). Pick left coset represen- 
tatives 1, r, r 2 , r 3 for H. By Theorem 1 1, ^(1) = 4. Since \f(s) = — 1, one computes 
directly that '!'(?) = —2. By Corollary 12(2) we obtain 'l'(r) = ^(r 2 ) = 'I'fsr) = 0. 
In the notation of the character table of D& in Section 1 , by the orthogonality relations 
we obtain = X2 + X4 4- Xs (which may be checked by inspection). 

For the remainder of this section the field F is taken to be the complex numbers: 

F = C. 

Before concluding with an application of induced characters to simple groups we 
compute the characters of an important class of groups. 
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Definition. A finite group G is called a Frobenius group with Frobenius kernel Q if 
Q is a proper, nontrivial normal subgroup of G and Cq 00 < Q for ah nonidentity 
elements x of Q. 

In view of the application to simple groups mentioned at the beginning of this 
section we shall restrict attention to Frobenius groups G of order q a p , where p and q 
are distinct primes, such that the Frobenius kernel Q is an elementary abelian q -group 
of order q a and the cyclic group G(Q acts irreducibly by conjugation on Q. In other 
words, we shall assume Q is a direct product of cyclic groups of order q and the only 
normal subgroups of G that are contained in Q are 1 and Q, i.e., Q is a minimal normal 
subgroup of G. For example, A 4 is a Frobenius group of this type with Frobenius kernel 
V4, its Sylow 2-subgroup. Also, if p and q are distinct primes with p < q and G is a 
non-abelian group of order pq (one always exists if p | q — 1) then G is a Frobenius 
group whose Frobenius kernel is its Sylow 4 -subgroup (which is normal by Sylow's 
Theorem). We essentially determine the character table of these Frobenius groups. 
Analogous results on more general Frobenius groups appear in the exercises. 

Proposition 13. Let G be a Frobenius group of order q a p, where p and q are distinct 
primes, such that the Frobenius kernel Q is an elementary abelian 4 -group of order q a 
and the cyclic group G/Q acts irreducibly by conjugation on Q. Then the following 
hold: 

(1) G = QP where P is a Sylow p-subgroup of G. Every nonidentity element of 
G has order p or q. Every element of order p is conjugate to an element of 
P and every element of order q belongs to Q. The nonidentity elements of P 
represent the p — 1 distinct conjugacy classes of elements of order p and each 
of these classes has size q a . There are ( q a — l)/p distinct conjugacy classes of 
elements of order q and each of these classes has size p. 

(2) G' = Q so the number of degree 1 characters of G is p and every degree 1 
character contains Q in its kernel. 

(3) If t/r is any nonprincipal irreducible character of Q, then Indg(^) is an irre- 
ducible character of G. Moreover, every irreducible character of G of degree 
> 1 is equal to Indg(^) for some nonprincipal irreducible character xj/ of Q. 
Every irreducible character of G has degree either 1 or p and the number of 
irreducible characters of degree p is (q a — l)/p. 

Proof: Note that QP equals G by order consideration. By definition of a Frobenius 
group and because Q is abelian, C c (h) — Q for every nonidentity element h of Q. If 
x were an element of order pq , then x p would be an element of order q , hence would 
lie in the unique Sylow 4 -subgroup Q of G. But then x would commute with x p and so 
x would belong to Cq (x p ) = Q, a contradiction. Thus G has no elements of order pq. 
By Sylow’s Theorem every element of order p is conjugate to an element of P and every 
element of order 4 lies in Q. No two distinct elements of P are conjugate in G because 
if g~^*g — y for some x, y € P then g~ l xg = y in the abelian group G — G/Q 
and so x = y. Then x = y because P = P. Thus there are exactly p — 1 conjugacy 
classes of elements of order p and these are represented by the nonidentity elements of 
P. If x is a nonidentity element of P, then C c (x) = P and so the conjugacy class of 


896 


Chap. 19 Examples and Applications of Character Theory 



x consists of |G : P\ = q a elements. Finally, if h is a nonidentity element of Q, then 
Cc(h) = Q and the conjugacy class of h is {h, h x , ... f h xt ^ 1 }, where P = (x). This 
proves all parts of ( 1 ). 

Since G/Q is abelian, G' < Q. Since G is non-abelian and Q is, by hypothesis, 
a minim al normal subgroup of G we must have G' — Q. Part (2) now follows from 
Corollary 11 in Section 18.2. 

Let \J/ be a nonprincipal irreducible character of Q and let *P = Ind^(^). We use 
the orthogonality relations to show that 'P is irreducible. Let 1, x , . . . , x p ~ l be coset 
representatives for Q in G. By Corollary 12, 'P is zero on G — Q so 

|L/| heQ 

l P~ l 

= 77^7 ^ 2 ^ 2 ^ x>hx ~‘^^ xihx ^ 
hzQ i-o 

= 

1 1 hzQ 

Pie 

\G\ 

where the second line follows from the definition of the induced character 'P, the third 
line follows because each element of Q appears exactly p times in the sum in the second 
line, and the last line follows from the first orthogonality relation in Q because ^ is an 
irreducible character of Q. This proves vp is an irreducible character of G. 

We prove that every irreducible character of G of degree > 1 is the induced char- 
acter of some nonprincipal degree 1 character of Q by counting the number of distinct 
irreducible characters of G obtained this way. By parts (1) and (2) the number of irre- 
ducible characters of G (= the number of conjugacy classes) is p + (q a — 1 )/p and the 
number of degree 1 characters is p. Thus the number of irreducible characters of G of 
degree > 1 is ( q a — 1 )/p. The group P acts on the set C of nonprincipal irreducible 
characters of Q as follows: for each & G C and each x g P let be defined by 

\J/ X (h) = ilr(xhx~ l ) for all ft g Q. 

Since ^ is a nontrivial homomorphism from Q into C x (recall that all irreducible 
characters of the abelian group Q have degree 1) it follows easily that xfP is also a 
homomorphism. Thus tJP 1 g C and so P permutes the elements of C. Now let x be a 

generator for the cyclic group P. Then 1, x x p ~ l are representatives for the left 

cosets of Q in G. By Corollary 12 applied with this set of coset representatives we see 
that if g C then the value of Ind^(^) on any element h of Q is given by the sum 

\}r(h) + \fr K (h)-\ h 4t xP ~ l (h). Thus when the induced character Indg(V0 is restricted 

to Q it decomposes into irreducible characters of Q as 

Indg(V0| e = $ + V + ■ ■ ■ + V^ 1 - 

If i]/ 1 and t /^2 are in different orbits of the action of P on C then the induced characters 
Ind^(V'i) and Ind^fV^) restrict to distinct characters of Q (they have no irreducible 
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constituents in common). Thus characters induced from elements of distinct orbits 
of P on C are distinct irreducible characters of G. The abelian group Q has q a — 1 
nonprincipal irreducible characters (i.e., \C\ — q a — 1) and |P| = p so there are at 
least (q a — l)/p orbits of P on C and bence at least this number of distinct irreducible 
characters of G of degree p. Since G has exactly (q a — \)/p irreducible characters of 
degree > 1, every irreducible character of G of degree > 1 must have degree p and 
must be an induced character from some element of C. The proof is complete. 

For the final example we shall require two properties of induced characters. These 
properties are listed in the next proposition and the proofs are straightforward exercises 
which follow easily from the formula for induced characters or from the definition of 
induced modules together with properties of tensor products. 

Proposition 14. Let G be a group, let H be a subgroup of G and let and be 
characters of H. 

(1) (Induction of characters is additive) Ind^(V^ + VO = Ind^(VO + Ind^(VO- 

(2) (Induction of characters is transitive ) If H < K < G then 

Ind£(Ind*(V0) = Indg(V')- 


It follows from part (1) of Proposition 14 that if X^=i w / V'/ is any integral linear 
combination of characters of H with n, > 0 for all i then 

Indg f n ‘ ifc ) = 5Z Indfatyi). (*) 

' i=i ' i=i 

A class function of H of the form ■ where the coefficients are any integers 

(not necessarily nonnegative) is called a generalized character or virtual character of 
H. For a generalized character of H we define its induced generalized character of G 
by equation (*), allowing now negative coefficients «,■ as well. In this way the function 
Ind^ becomes a group homomorphism from the additive group of generalized characters 
of H to the additive group of generalized characters of G (which maps characters to 
characters). This implies that the formula for induced characters in Corollary 12 holds 
also if V/ is a generalized character of H. 

Application to Groups of Order 3 3 - 7 - 13 - 409 

We now conclude with a proof of the following result: 

there are no simple groups of order 3 ? ■ 7 ■ 13 • 409. 

As mentioned at the be ginnin g of this section, simple groups of this order were discussed 
at the end of Section 6.2 in the context of the existence problem for simple groups. It is 
possible to prove that there are no simple groups of this order by arguments involving a 
permutation representation of degree 819 (cf. the exercises in Section 6.2). We include 
a character-theoretic proof of this since the methods illustrate some important ideas in 
the theory of finite groups. The approach is based on M. Suzuki’s seminal paper The 
nonexistence of a certain type of simple group of odd order , Proc. Amer. Math. Soc., 
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8( 1957), pp. 686-695, which treats much more general groups. Because we are dealing 
with a specific group order, our arguments are simpler and numerically more explicit, 
yet they retain some of the key ideas of Suzuki’s work. Moreover, Suzuki’s paper and its 
successor, Finite groups in which the centralizer of any non-identity element is nilpotent , 
by W. Feit, M. Hall and J. Thompson, Math. Zeit., 74(1960), pp. 1-17, are prototypes 
for the lengthy and difficult Feit— Thompson Theorem (cf. Section 3.4). Our discussion 
also conveys some of the flavor of these fundamental papers. In particular, each of 
these papers follows the basic development in which the structure and embedding of 
the Sylow subgroups is first determined and then character theory (with heavy reliance 
on induced characters) is applied. 

For the remainder of this section we assume G is a simple group of order 3 3 * 7 ■ 1 3 ■ 
409. We list some properties of G which may be verified using the methods stemming 
from Sylow’s Theorem discussed in Section 6.2. The details are left as exercises. 

(1) Let q\ = 3, let Q\ be a Sylow 3-subgroup of G and let N\ = Nc(Q\)- Then Qi 
is an elementary ab elian 3-group of order 3 3 and is a Frobenius group of order 
3 3 ■ 13 with Frobenius kernel Q\ and with Ni/Qi acting irreducibly by conjugation 

on Q\- 

(2) Let q 2 = 7, let Q 2 be a Sylow 7-subgroup of G and let N 2 = Nc(Qi)- Then Q 2 is 
cyclic of order 7 and N 2 is the non-abelian group of order 7 ■ 3 (so N 2 is a Frobenius 
group with Frobenius kernel Q 2 ). 

(3) Let 43 = 13, let 03 be a Sylow II -subgroup of G and let N$ = Nc(Qi). Then 
£>3 is cyclic of order 13 and Nj is the non-abelian group of order 13 ■ 3 (so N 3 is a 
Frobenius group with Frobenius kernel Q3). 

(4) Let q 4 = 409, let Q 4 be a Sylow 409-subgroup of G and let N 4 = Then 

0 4 is cyclic of order 409 and N 4 is the non-abelian group of order 409 ■ 3 (so N 4 
is a Rrobenius group with Frobenius kernel 0 4 ). 

(5) Every nonidentity element of G has prime order and 0; fl 0f = 1 for every 
g € G — N it for each i = 1, 2, 3, 4. The nonidentity conjugacy classes of G are: 

(a) 2 classes of elements of order 3 (each of these classes has size 7 • 13 - 409) 

(b) 2 classes of elements of order 7 (each of these classes has size 3 3 ■ 13 - 409) 

(c) 4 classes of elements of order 13 (each of these classes has size 3 3 - 7 * 409) 
<d) 136 classes of elements of order 409 (each of these classes has size 3 3 * 7 • 13), 
and so there are 145 conjugacy classes in G. 

Since each of the groups N { is a Frobenius group satisfying the hypothesis of Proposition 
1 3, the number of characters of Nj of degree > 1 may be read off from that proposition: 

(i) N\ has 2 irreducible characters of degree 13 

(ii) N 2 has 2 irreducible characters of degree 3 

(iii) N 3 has 4 irreducible characters of degree 3 

(iv) N 4 has 136 irreducible characters of degree 3. 

From now on, to simplify notation, for any subgroup H of G and any generalized 
character /x of H let 

p* = IndgQx) 

so a star will always denote induction from a subgroup to the whole group G and the 
subgroup will be clear from the context. 
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The following lemma is a key point in the proof. It shows bow the vanishing 
of induced characters described in Corollary 12 (together with the trivial intersection 
property of the Sylow subgroups Q it namely the fact that Qi fi Qf = 1 for all g G 
G — Nq(Qi)) may be used to relate inner products of certain generalized characters to 
the inner products of their induced generalized characters. For these computations it is 
important that the generalized characters are zero on the identity (which explains why 
we are considering differences of characters of the same degree). 

Lemma 15. For any / G {1,2, 3, 4} let q — qi , let Q — Q/, let N = Nj and let 
p = \N : (21 . Let xfi , . . . , \J /4 be any irreducible characters of N of degree p (not 
necessarily distinct) and let a = V'i — lta and 0 = V0 — 1 ^ 4 - Then a and 0 are 
generalized characters of N which are zero on every element of N of order not equal 
to q. Furthermore, or* and 0* are generalized characters of G which are zero on every 
element of G of order not equal to q and 

(a\ 0*) G = (a, j3) N 

(where ( , )// denotes the usual Hermitian product of class functions computed in the 
group H). In other words, induction from AT to G is an inner product preserving map 
on such generalized characters a, 0 of N. 


Proof: By Proposition 13, there are nonprincipal characters X\, . . . , A 4 of Q of 
degree 1 such that xJ/j = Ind^ (Ay ) for 7 = 1, .... 4. By Corollary 12 therefore, each \J/j 
vanishes on TV — Q, hence so do or and 0. Note that since ~ p for all j we have 
or ( 1) = >0(1) = 0. By the transitivity of induction, xfr* = Ind^(^q) = Ind^(A>) for all 
j . Again by Corollary 12 applied to the latter induced character we see that xj/* vanishes 
on all elements not conjugate in G to some element of Q , hence so do both or* and 0*. 
Since the induced characters xj/j all have degree | G : Q|, the generalized characters 
or* and 0* are zero on the identity. Thus a* and 0* vanish on all elements of G which 
are not of order q . Finally, if g \ , . . . , g m are representatives for the left cosets of A in 
G with gi = 1, then because Q fl Q St — 1 for all k > 1 (by (5) above), it follows 
immediately from the formula for induced (generalized) characters that or*(x) = or(x) 
and /S*(x) = 0(x) for all nonidentity elements x G Q ( i.e., for all elements x G N of 
order q). Furthermore, by Sylow ’s Theorem every element of G of order q lies in a 
conjugate of Q, hence the collection of G- conjugates of the set Q — {1} partition the 
elements of order q in G into | G : N\ disjoint subsets. Since or* and 0* are class 
functions on G, the sum of a*(x)0* (x) as x runs over any of these subsets is the same. 
These facts imply 

xeG 

= E 

xeG 

\x\=q 


1 

]g\ 


E 

xeN 

\x\=q 


| G : N\oc*(x)0*(x) 
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7T77 5^ «(*)£(*) = (CC,P) N . 

IV •—* ' 

1 1 xeN 

This completes the proof. 

The next lemma sets up a correspondence between the irreducible characters of TV,- 
of degree > 1 and some nonprincipal irreducible characters of G . 

Lemma 16- For any i € {1, 2, 3, 4} let q = q\, let Q = Q iy let N = and let 
p = \N : Q |. Let fa, fa be the distinct irreducible characters of N of degree 

p. Thai there are distinct irreducible characters Xi Xk of G, all of which have 

the same degree, and a fixed sign e = ±1 such that fa — fa — e(xi — Xj) for all 

j = 2 , 3 , . . . , k. 


Proof: Let ctj = fa — fa for j = 2, 3, so ctj satisfies the hypothesis of 
Lemma 15. Since fa ^ fa, by Lemma 15 

2 = || ctj || 2 = (a 'j, ctj) N = (fa, fa) c = 11^7 || 2 
for all /. Thus a*, must have two distinct irreducible characters of G as its irreducible 
constituents. Since cr*(l) = 0 it must be a difference of two distinct irreducible char- 
acters, both of which have the same degree. In particular, the lemma holds if k = 2 
(which is the case for q =3 and q = 7). Assume therefore that k > 2 and write 

<4 = r,-n = ^x-x h ) 

fa = fa- fa = e'(G - 6') 

for some irreducible characters x * X '> of G and some signs e, e'. As proved above, 

X ^ x' and 6 ^ 6'. Interchanging 6 and 6' if necessary, we may assume e = e'. Thus 

fa- fa = fa - fa = e(G-G'-y + X% 

By Lemma 15, fa ~ fa = (fa ~ fa)* also has exactly two distinct irreducible con- 
stituents, hence either 0 = x or 6' = yf. Replacing e by — e if necessary we may 
assume that 6 = y so that now we have 

fa = fa - fa = c(x - X') 

fa = fa - fa = c(x - O') 

where y,y' and 6 are distinct irreducible characters of G and the sign e is dete rmin ed. 
Label x — X\, x' = Xi and 6 = X 3 - Now one similarly checks that for each j > 3 
there is an irreducible character yj of G such that 

fa = fa~ fa = € (X\ - Xj) 

and Xi. ■ ■ ■ . Xk are distinct Since all yf s have the same degree as Xi> the proof is 
complete. 

We remark that it need not be the case that yj = fa for any j , but only that the 
differences of irreducible characters of N induce to differences of irreducible characters 
of G. 

The irreducible characters Xj of G obtained via Lemma 16 are called exceptional 
characters associated to Q. 
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Lemma 17. The exceptional characters associated to Q, are all distinct from the 
exceptional characters associated to Qj for / and j distinct elements of (1, 2, 3, 4}. 

Proof: Let x be an exceptional character associated to Qi and let $ be an exceptional 
character associated to Qj. By construction, there are distinct irreducible characters 
and V ' 7 of Q t such that t]/* — ifr'* = X ~ X' and there are distinct irreducible characters 
k and k' of Qj such that k* — k'* = 6 — &'. Let a — if — \}r f and let (5 = k — k 1 . By 
Le mm a 15, a* is zero on all elements of G whose order is not equal to q j (including 
the identity) and ft* is zero on all elements of G whose order is not equal to qj . Thus 
clearly ( a *, f*) — 0. It follows easily that the two irreducible constituents of a* are 
pairwise orthogonal to those of f? as well. This establishes the lemma. 

It is now easy to show that such a simple group G does not exist. By Le mma 16 
and properties (i) to (iv) of G we can count the number of exceptional characters: 

(i) there are 2 exceptional characters associated to Q i 

(ii) there are 2 exceptional characters associated to £>2 

(iii) there are 4 exceptional characters associated to Q 3 

(iv) there are 136 exceptional characters associated to Q 4 . 

Denote the common degree of the exceptional characters associated to Qj by d, for 
i = 1, . . . , 4. By Le mma 17, the exceptional characters account for 144 nonprincipal 
irreducible characters of G hence these, together with the principal character, are all the 
irreducible characters of G (the number of conjugacy classes of G is 145). The sum of 
the squares of the degrees of the irreducible characters is the order of G: 

1 + 2d? + 2d? + 4dJ + 136d? = 1004913. 

Simplifying this, we obtain 

d? + 4 + 24 + 68 dj = 502456. (1 9.4) 

Finally, since each nonprincipal irreducible representation of the simple group G is 
faithful and since the smallest degree of a faithful representation of Ny is 13, each 
d-, > 13. Since d 4 < ^502456/68 < 86 and d 4 divides |G|, it follows that 

d 4 e (13, 21, 27, 39, 63}. 

Furthermore, each d, | | G | by Corollary 5 and so there are a small number of possibil ities 
for each d, . One now checks that equation (4) has no solution (this is particularly easy 
to do by computer). This contradiction completes the proof. 

EXERCISES 

Throughout the exercises all representations are over the complex numbers. 

1. Let G = S 3 , let H = A 3 and let V be the 3-dimensional CH -module which affords the 
natural permutation representation of A 3 . More explicitly, let V have basis e\, ei, ^3 and 
let o e A 3 act on V by = e a (iy Let 1 and (1 2) be coset representatives for the left 
cosets of A 3 in £3 and write out the explicit matrices described in Theorem 1 1 for the 
action of S 3 on the induced module W, for each of the elements of S 3 . 

2. In each of pans (a) to (0 a character ^ of a subgroup H of a particular group G is specified. 
Compute the values of the induced character Ind^ (V'O on all the conjugacy classes of G and 
use the character tables in Section 1 to write Ind^ (V'O as a sum of irreducible characters: 
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(a) \J/ is the unique nonpiincipal degree 1 character of the subgroup ((12)) of S3 

(b) \}t is the degree 1 character of the subgroup (r ) of £>8 defined by vHr) = i, where 
1 e C is a square root of — 1 

(c) tfr is the degree 1 character of the subgroup (r ) of D& defined by \J/(r) — — 1 

(d) tfr is any of the nonprincipal degree 1 characters of the subgroup V4 = ( (1 2), (3 4) ) 
of S4 

(e) \jt = X4 is the first of the two characters of degree 3 in the character table of H = £4 
in Section 1 and H is a subgroup of G — S 5 

(f) V'' is any of the nonprincipal degree 1 characters of the subgroup V4 = ( (1 2), (3 4) ) 
of S5. 

3. Use Proposition 13 to explicitly write out the character table of each of the following 
groups: 

(a) the dihedral group of order 10 

(b) the non-abelian group of order 57 

(c) the non-abelian group of order 56 which has a normal, elementary abelian Sylow 
2-subgroup. 

4. Let H be a subgroup of G, let <p be a representation of H and suppose that N is a normal 
subgroup of G with N < H and N contained in the kernel of <p. Prove that N is also 
contained in the kernel of the induced representation of <p. 

5. Let N be a normal subgroup of G and let xjr\ be the principal character of N. Let 'h be 
the induced character Ind^(Vn) so that by the preceding exercise we may consider 4> as 
the character of a representation of G/N. Prove that 4* is the character of the regular 
representation of G/N . 


6 . Let Z be any subgroup of the center of G, let \G : Z\ — m and let ^ be a character of Z. 


Prove that 


Ind£(V0 (g) = 


if geZ 
0 if giZ. 


7, Let <p be a matrix representation of the subgroup H of G and define matrices d> (g) for 

every g e G by the displayed formula in the statement of Theorem 1 1 . Prove directly that 
<t> is a representation by showing that 4>(xy) = for all x, y e G. 

8. Let G be a Frobenius group with Frobenius kernel Q. Assume that both Q and G/Q are 
abelian but G is not abelian (i.e., G ± Q). Let \ Q\ — n and \ G : Q\ — m. 

(a) Prove that G/Q is cyclic and show that G = QC for some cyclic subgroup C of G 
with C H Q = 1 (i.e., G is a semidirect product of Q and C and \C\ — m). [Let q 
be a prime divisor of n and let G/Q act by conjugation on the elementary abelian 
g-group {h e Q \ h q — 1}. Apply Exercise 14(e) of Section 18.1 and the definition 
of a Frobenius group to an irreducible constituent of this F g G/ ^-module.] 

(b) Prove that n and m are relatively prime. [If a prime p divides both the order and 
index of Q, let P be a Sylow p-subgroup of G. Then P Pi Q < P and P D Q is a 
Sylow p-subgroup of Q. Consider the centralizer in G of the subgroup Z(P) H Q 
(this intersection is nontrivial by Theorem 1 of Section 6.1).] 

(c) Show that G has no elements of order qp f where q is any nontrivial divisor of n and 
p is any nontrivial divisor of m. [Argue as in Proposition 13.] 

(d) Prove that the number of nonidentity conjugacy classes of G contained in Q is 
(n — Y)/m and that each of these classes has size m. [Argue as in Proposition 13.] 

(e) Prove that no two distinct elements of C are conjugate in G. Deduce that the non- 
identity elements of C are representatives for m — 1 distinct conjugacy classes of G 
and that each of these classes has size n. Deduce then that every element of G — Q 
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is conjugate to some element of C and that G has m + (n — 1 )/m conjugacy classes. 

(f) Prove that G' = Q and deduce that G has m distinct characters of degree l. [To 
show Q < G let C — (x) and argue that the map h i-> [ h,x ] = x _ 1 h~ l xh is a 
homomorphism from Q to Q whose kernel is trivial, hence this map is surjective.] 

(g) Show that if rjr is any nonprincipal irreducible character of Q, then Indg(Vr) is an 
irreducible character of G. Show that every irreducible character of G of degree > 1 
is equal to Ind^(^) for some nonprincipal irreducible character ^ of Q. Deduce 
that every irreducible character of G has degree either 1 or m and the number of 
irreducible characters of degree m is (n — l)/m. [Check that the proof of Proposition 
1 3(3) establishes this more general result with the appropriate changes to the n umb ers 
involved.] 

9. Use the preceding exercise to explicitly write out the character table of 

((1 2345), (2354)), which is the normalizer in S5 of a Sylow 5-subgroup (this group 
is a Frobenius group of order 20). 

10. Let N be a normal subgroup of G, let \}r be a character of N and let g € G. Define fa by 
fa(h) — tKghg -1 ) for all h e N. 

(a) Prove that fa is a character of N (\J/ and }J / 8 are called G -conjugate characters of N). 
Prove that \J / 8 is irreducible if and only if ^ is irreducible. 

(b) Prove that the map $ t-y fa is a right group action of G on the set of characters of 
N and N is in the kernel of this action. 

(c) Prove that if fa and fa are G-conjugate characters of N, then Ind^ (fa) = Ind^ (fa). 
Prove also that if fa and fa are characters of N that are not G-conjugate then 
Ind/v(^l) ^ frid^J (fa). [Use the argument in the proof of Proposition 13(3).] 

11. Show that if G = A4 and N = V 4 is its Sylow 2- subgroup then any two nonprincipal 
irreducible characters of N are G-conjugate (cf. the preceding exercise). 

12. Let G = Din be presented by its usual generators and relations. Prove that if ^ is any 
degree 1 character of H = (r) such that rj/ ^ fa f then Ind^ (\p) is an irreducible character 
of Din . Show that every irreducible character of Di„ is the induced character of some 
degree 1 character of (r). 

13. Prove both parts of Proposition 14. 

14. Prove the following result known as Frobenius Reciprocity : let H < G, let ^ be any 
character of Ff and let x be any character of G. Then 

(^ , Xl h)h = (Indg(^) , x)g- 

[Expand the right hand side using the formula for the induced character Ind^ (fa or follow 
the proof of Shapiro’s Lemma in Section 17.2.] 

15. Assume G were a simple group of order 3 3 ■ 7 • 13- 409 whose Sylow subgroups and their 
nor mal iz ers are described by properties (1) to (5) in this section. Prove that the permutation 
character of degree 819 obtained from the action of G on the left cosets of the subgroup 
A/4 decomposes as xo + y + y\ where xo > s the principal character of G and y and y' 
arc distinct irreducible characters of G of degree 409. [Use Exercise 9 in Section 18.3 to 
show that this permutation character n has || rr || 2 = 3.] 
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APPENDIX I 


Cartesian Products 
and Zorn’s Lemma 


Section 1 of this appendix contains the definition of the Cartesian product of an arbitrary 
collection of sets. In the text we shall primarily be interested in products of finitely 
many (or occasionally countably many) sets. We indicate how the general definition 
agrees with the familiar “ordered n -tuple” notion of a Cartesian product in these cases. 
Section 2 contains a discussion of Zorn’s Lemma and related topics. 

1 * CARTESIAN PRODUCTS 

A set 7 is called an indexing set or index set if the elements of / are used to index 
some collection of sets. In particular, if A and 7 are sets, we can form the collection 
{ Ai | i € I) by specifying that Aj — A for all i el. Thus any set can be an indexing 
set; we use this term to emphasize that the elements are used as indices. 

Definition. 

(1) Let / be an indexing set and let (A, \ i el) be a collection of sets. A choice 
function is any function 

/ = '- U* 

/€/ 


such that / (/) e A, for all i e I. 

(2) Let / be an indexing set and for alii € / let A/ be a set. The Cartesian product 
of {A; | / € 7} is the set of all choice functions from 7 to U ie /A, and is denoted 
by f| (€/ Aj (where if either 7 or any of the sets Aj are empty the Cartesian 
product is the empty set). The elements of this Cartesian product are written as 
riie/ a i> where this denotes the choice function / such that f(i) = a, for each 
i e 7. 

(3) For each j e 7 the set Ay is called the j & component of the Cartesian product 
n e/ Aj and ny is the coordinate of the element f|/ e / a \- 

(4) For j e 7 the projection map of I~[i € / Aj onto the y* coordinate. Ay, is defined 
by n, e / Oi >-* aj . 

Each choice function / in the Cartesian product A/ may be thought of as a 
way of “choosing” an element f(i) from each set Aj . 

If 7 = (1. 2, .... n) for some n e Z + and if / is a choice function from 7 to 
Ai U ■ ■ ■ U A n , where each Aj is nonempty, we can associate to / a unique (ordered) 
n -tuple: 

/"►(/(l),/(2),..../(n)). 
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Note that by definition of a choice function, /(/) e A,- for all r, so the n-tuple above 
has an element of A/ in the i 0 * position for each i . 

Conversely, given an n-tuple (cq, cq , . . . , a„), where a t e A, for all / e /, there is 
a unique choice function, /, from I to U,- e /A; associated to it, namely 

f(i) = ai , for all i € I. 

It is clear that this map from n-tuples to choice functions is the inverse to the map 
described in the preceding paragraph. Thus there is a bisection between ordered n~ 
tuples and elements of flie/ - Henceforth when / — {1 , 2, .... n] we s hall write 

n 

^ A, or Ai x A 2 x • • • x A n 
1=1 

for the Cartesian product and we shall describe the elements as ordered n -tuples. 

If I = Z + , we shall similarly write: ]~f^i or Ai x A 2 x ■ • ■ for the Cartesian 
product of the A/’s. We shall write the elements as ordered tuples: (a \ , 02 , . . . ), i.e., as 
infinite sequences whose terms are in A/. 

Note that when I = { 1 , 2, or / = Z + we have used the natural ordering on 
I to arrange the elements of our Cartesian products into n-tuples. Any other ordering 
of I (or any ordering on a finite or countable index set) gives a different representation 
of the elements of the same Cartesian product 

Examples 

(1) Axfi = {(a. b) | a € A, b e B). 

(2) R" = R x R x - ■ ■ x R (n factors) is the usual set of n-tuples with real number entries. 
Euclidean n-space. 

(3) Suppose I = Z + and A/ is the same set A, for all i e 1. The Cartesian product fli eZ + ^ 
is the set of all (infinite) sequences a\ , 122 . ^3 - - • of elements of A. In particular, if 
A = R, then the Cartesian product flieZ + ® is the set of all real sequences. 

(4) Suppose / is any indexing set and A/ is the same set A, tor all i el. The Cartesian 
product fU, A is just the set of all functions from 7 to A, where the function / : 
/ -*■ A corresponds to the element fj ie/ f(i) in the Cartesian product This Cartesian 
product is often (particularly in topology books) denoted by A 1 . Note that for each 
fixed j e I the projection map onto the j* coordinate sends the function / to /(j), 
i.e., is evaluation at j. 

(5) Let R be a ring and let* bean indeterminate over R. The definition of the ring 7 ?[jc] of 
polynomials in x with coefficients from R may be given in terms of Cartesian products 
rather than in the more intuitive and familiar terms of “formal sums” (in Chapters 7 
and 9 we introduced them in the latter form since this is the way we envision and 
work with them). Let / be the indexing set Z + U {0} and let /?[*] be the subset 
of the Cartesian product n^o ^ consisting of elements ( 00 , 01 , 02 ,...) such that 

only finitely many of the a-, ’s are nonzero. If (ao, a\ , 02 a,, , 0, 0, ... ) is such a 

sequence we represent it by the more familiar “formal sum” YTiz=o aix 1 . Addition and 
multiplication of these sequences is defined so that the usual rules for addition and 
multiplication of polynomials hold. 
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Proposition 1. Let 7 be a nonempty countable set and for each i £ I let A,- be a set 
The cardinality of the Cartesian product is the product of the cardinalities of the sets 
Aj , i.e., 

in*i=riiM 

/€/ /€/ 

(where if some A,- is an infinite set or if / is infinite and an infinite number of A, ’s have 
cardinality > 2, both sides of this equality are infinity). In particular, 

|Ai x A 2 x - ■ - x A„| = |Aj| x |A 2 j x • - - x |A„|. 


Proof: In order to count the number of choice functions note that each i £ I may be 
mapped to any of the | A,- 1 elements of A; and for i j the values of choice functions at 
i and j may be chosen completely independently. Thus the number of choice functions 
is the product of the cardinalities of the A, ’s, as claimed. 

For Cartesian products of finitely many sets, Aj x A 2 x - ■ ■ x A„, one can see this 
easily from the n-tuple representation: the elements of A i x A 2 x ■ ■ ■ x A„ are n -tuples 
(a i , a 2 , . . . , a„) and each a,- may be chosen as any of the |A, | elements of A-,. Since 
these choices are made independently for if 1 ], there are J A 1 1 > |A 2 | - - - |A„| elements 
in the Cartesian product 


EXERCISE 

1. Let / and J be any two indexing sets and let A be an arbitrary set For any function 
<p : J 1 define 

<p* : Y\ A — ► Y\ A- by <P*(f) = f °<P for all choice functions / € J^[ A. 
i€f jeJ iel 

(a) Let 1 = {1 , 2}, let J = {1 , 2, 3} and let <p : J -+ / be defined by ^>(1) = 2, <p(2) = 2 
and <p( 3) = 1. Describe explicitly how a 3-tuple in A x A x A maps to an ordered 
pair in A x A under this <p *. 

(b) Let 1 = J = {1, 2, . . . , n} and assume <p is a permutation of /. Describe in terms of 
n-luples in A x A x • ■ • x A the function <p*. 


2. PARTIALLY ORDERED SETS AND ZORN’S LEMMA 

We shall have occasion to use Zorn’s Lemma as a form of “infinite induction” in a 
few places in the text where it is desirable to know the existence of some set which is 
maximal with respect to certain specified properties. For example, Zorn’s Lemma is 
used to show that every vector space has a basis. In this situation a basis of a vector space 
V is a subset of V which is maximal as a set consisting of linearly independent vectors 
(the maximal] ty ensures that these vectors span V). For finite dimensional spaces this 
can be proved by induction; however, for spaces of arbitrary dimension Zorn’s Lemma 
is needed to establish this. By having results which hold in full generality the theory 
often becomes a little neater in places, although the main results of the text do not 
require its use. 
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A specific instance in the text where a maximal object which helps to simplify 
matters is constructed by Zorn’s Lemma is the algebraic closure of a field. An algebraic 
closure of a field F is an extension of F which is maximal among any collection of 
algebraic extensions. Such a field contains (up to isomorphism) all elements which 
are algebraic over F, hence all manipulations involving such algebraic elements can 
be effected in this one larger field. In any particular situation the use of an algebraic 
closure can be avoided by adjoining the algebraic elements involved to the base field 
F, however this becomes tedious (and often obscures matters) in complicated proofs. 
For the specific fields appearing as examples in this text the use of Zom’s Lemma 
to construct an algebraic closure can be avoided (for example, the construction of an 
algebraic closure of any subfield of the complex numbers or of any finite field does not 
require it). 

The first example of the use of Zom’s Lemma appears in the proof of Proposition 
1 1 in Section 7.4. 

In order to state Zom’s Lemma we need some terminology. 

Definition. A partial order on a nonempty set A is a relation < on A satisfying 

(1) x < x for all x e A (reflexive), 

(2) if x < y and y < x then x — y for all x, y € A (antisymmetric), 

(3) if x < y and y < z then x < z for all x, y. z € A (transitive). 

We shall usually say that A is a partially ordered set under the ordering < or that 
A is partially ordered by <. 

Definition. Let the nonempty set A be partially ordered by <. 

(1) A subset B of A is called a chain if for all x, y € B, either x < y or y < x. 

(2) An upper bound for a subset B of A is an element u € A such that b < u, for 
all b e B. 

(3) A maximal element of A is an element m € A such that if m < x for any x e A, 
then m = x. 

In the literature a chain is also called a tower or called a totally ordered or linearly 
ordered or simply ordered subset 

Some examples below hi ghli ght the distinction between upper bounds and maximal 
elements. Also note that if m is a maximal element of A, it is not necessarily the case 
that x < m for all x e A (i.e., m is not necessarily a maximum element). 

Examples 

(1) Let A be the power set (i.e., set of all subsets) of some set X and < be set containment: 
c. Notice that this is only a partial ordering since some subsets of X may not be 
comparable, e.g. singletons: if x ^ y then [x} g {y} and [y] g {x}. In this situation 
an example of a chain is a collection of subsets of X such as 

Xi Q X 2 £ Xi C ■ ■ 

Any subset B of A has an upper bound, b, namely, 

»=!> 

xgB 
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This partially ordered set A has a (unique) maximal element, X. 

In many instances the set A consists of some (but not necessarily all) subsets of 
a set X (Le., A is a subset of the power set of X) and with the ordering on A again 
being inclusion. The existence of upper bounds and maximal elements depends on 
the nature of A. 

(2) Let A be the collection of all proper subsets of Z+ ordered under c. In this situation, 
drains need not have maximal elements, e.g. the chain 

{1} c {l, 2} c {1, 2, 3) c . . . 

does not have an upper bound. The set A does have maximal elements: for example 
Z+ — {«} is a maximal element of A for any neZ + , 

(3) Let A = R under the usual < relation. In this example every subset of A is a chain 
(including A itself). The notion of a subset of A having an upper bound is the same as 
the usual notion of a subset of R being bounded above by some real number (so some 
sets, such as intervals of finite length, have upper bounds and others, such as the set 
of positive reals, do not). The set A does not have a maximal element 

Zorn’s Lemma If A is a nonempty partially ordered set in which every chain has 
an upper bound then A has a maximal element. 

It is a nontrivial result that Zorn's Lemma is independent of the usual (Zermelo- 
Fraenkel) axioms of set theory 1 in the sense that if the axioms of set theory are con- 
sistent, 2 then so are these axioms together with Zorn’s Lemma; and if the axioms of 
set theory are consistent, then so are these axioms together with the negation of Zorn’s 
Lemma. The use of the term “lemma” in Zom’s Le mma is historical. 

For the sake of completeness (and to relate Zom’s Lemma to formulations found 
in other courses) we include two other equivalent formulations of Zom’s Lemma. 

The Axiom of Choice The Cartesian product of any nonempty collection of nonempty 
sets is nonempty. In other words, if I is any nonempty (indexing) set and A, is a 
nonempty set for all i € /, then there exists a choice function from I to U ie/ A/. 

Definition. Let A be a nonempty set A well ordering on A is a total ordering on A 
such that every nonempty subset of A has a minim um (or smallest) element i.e., for 
each nonempty B C. A there is some s e B such that s < b, for all b e B. 

The Well Ordering Principle Every nonempty set A has a well ordering. 

Theorem 2. Assuming the usual (Zermelo-Fraenkel) axioms of set theory, the following 
are equivalent 

(1) Zom’s Lemma 

(2) the Axiom of Choice 

(3) the Well Ordering Principle. 

Proof: This follows from elementary set theory. We refer the reader to Real and 
Abstract Analysis by Hewitt and Stromberg, Springer- Verlag, 1965, Section 3 for these 
equivalences and some others. 

’See P.J. Cohen’s papers in: Proc. Nat Acad. ScL, 50(1963), and 51(1964). 

2 This is not known to be the case! 
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EXERCISES 


1. Let A be the collection of all finite subsets of R ordered by inclusion. Discuss the exis- 
tence (or nonexistence) of upper bounds, minimal and maximal elements (where minimal 
elements are defined analogously to maximal elements). Explain why this is not a well 
ordering. 

2. Let A be the collection of all infinite subsets of R ordered by inclusion. Discuss the 
existence (or nonexistence) of upper bounds, minimal and maximal elements. Explain 
why this is not a well ordering. 

3. Show that the following partial orderings on the given sets are not well orderings: 

(a) M under the usual relation <. 

(b) R + under the usual relation <. 

(c) R + U {0} under the usual relation <. 

(d) Z under the usual relation <. 

4 . Show that Z + is well ordered under the usual relation <. 
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APPENDIX II 


Category Theory 


Category theory provides the language and the mathematical foundations for discussing 
properties of large classes of mathematical objects such as the class of “all sets” or “all 
groups” while circumventing problems such as Russell’s Paradox. In this framework 
one may explore the commonality across classes of concepts and methods used in 
the study of each class: homomorphisms, isomorphisms, etc., and one may introduce 
tools for studying relations between classes: functors, equivalence of categories, etc. 
One may then formulate precise notions of a “natural” transformation and “natural” 
isomorphism, both within a given class or between two classes. (In the text we described 
“natural” as being “coordinate free.”) A prototypical example of natural isomorphisms 
within a class is the isomorphism of an arbitrary finite dimensional vector space with its 
double dual in Section 1 1 .3. In fact one of the primary motivations for the introduction 
of categories and functors by S. Eilenberg and S. MacLane in 1945 was to give a precise 
me anin g to the notions of “natural” in cases such as this. Category theory has also played 
a foundational role for form alizin g new concepts such as schemes (cf. Section 15.5) that 
are fundamental to major areas of contemporary research (e.g., algebraic geometry). 
Pioneering work of this nature was done by A. Grothendieck, K. Morita and others. 

Our treatment of category theory should be viewed more as an introduction to some 
of the basic language. Since we have not discussed the Zermel o-Fraenkel axioms of set 
theory or the Godel-Bemays axioms of classes we make no mention of the foundations 
of category theory. To remain consistent with the set theory axioms, however, we 
implicitly assume that there is a universe set U which contains all the sets, groups, 
rings, etc. that one would encounter in “ordinary” mathematics (so that the category 
of “all sets” implicitly means “all sets in U”). The reader is referred to books on set 
theory, logic, or category theory such as Categories for the Working Mathematician by 
S. MacLane, Springer-Verlag, 1971 for further study. 

We have organized this appendix so that wherever possible the examples of each 
new concept use terminology and structures in the order that these appear in the body 
of the text For instance, the first example of a functor involves sets and groups, the 
second example uses rings, etc. In this way the appendix may be read early on in one’s 
study, and a greater appreciation may be gained through rereading the examples as one 
becomes conversant with a wider variety of mathematical structures. 

1. CATEGORIES AND FUNCTORS 

We begin with the basic concept of this appendix. 

Definition. A category C consists of a class of objects and sets of mcrphisms between 
those objects. For every ordered pair A, B of objects there is a set Home: (A, B) of 
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morphisms from A to B, and for every ordered triple A, B, C of objects there is a law 
of composition of morphisms, i.e M a map 

HomcCA, B ) x Homc(£, C) — > Homc(A, C) 

where (/, g) gf, and gf is called the composition of g with /. The objects and 
morphism satisfy the following axioms: for objects A, B t C and D 

(i) if A ^ or C ^ D, then Home (A » B) and Homc(C, D) are disjoint sets, 

(ii) composition of morphisms is associative, i.e., h(gf) = ( hg)f for every / in 
Homc(A, B), g in Homc(£, C) and h in Homc(C, D) t 

(iii) each object has an identity morphism, i.e., for every object A there is a mor- 
phism 1 A e Homc(A, A) such that fl a = / for every / € Homc(A, B) and 
Ug = g for every g € Horn C (B, A). 

Morphisms are also called arrows . It is an exercise to see that the identity morphism 
for each object is unique (by the same argument that the identity of a group is unique). 
We shall write Hom( A , B) for Home (A , B) when the category is clear from the context. 

The terminology we use throughout the text is co mm on to all categories: a mor- 
phism from A to B will be denoted by / : A — >■ fi or A 4 fi. The object A is the 
domain of / and B is the codomain of /. A morphism from A to A is an endomorphism 
of A. A morphism / : A B is an isomorphism if there is a morphism g : B — > A 
such that gf — 1a and fg — 1b. 

There is a natural notion of a subcategory category C of D, i.e., when every ob- 
ject of C is also an object in D, and for objects A, B in C we have the containment 
Homc(A, B ) c Hom D (A, B ). 

Examples 

In each of the following examples we leave the details of the verification of the axioms for 

a category as exercises. 

(1) Set is the category of all sets. For any two sets A and B, Hom(A, B) is the set of 
all functions from A to B. Composition of morphisms is the familiar composition of 
functions: gf = go/. The identity in Hom(A, A) is the map 1 a (fl) = fl, for all 
a € A. This category contains the category of all finite sets as a subcategory. 

(2) Grp is the category of all groups, where morphisms are group homomorphisrns. Note 
that the composition of group homomorphisrns is again a group homomorphism. A 
subcategory of Grp is Ab, the category of all abelian groups. Similarly, Ring is the 
category of all nonzero rings with 1, where morphisms are ring homomorphisrns that 
send 1 to 1. The category CRing of all commutative rings with 1 is a subcategory of 
Ring. 

(3) For a fixed ring R, the category K-mod consists of all left fl -modules with morphisms 
being fl -module homomorphisrns. 

(4) Top is the category whose objects are topological spaces and morphisms are continuous 
maps between topological spaces (cf. Section 15.2). Note that the identity (set) map 
from a space to itself is continuous in every topology, so Hom(A, A) always has an 
identity. 

(5) Let 0 be the empty category, with no objects and no morphisms. Let 1 denote 
the category with one object. A, and one morphism: Hom(A, A) = {1 a 1- Let 2 
be the category with two objects, A[ and A 2 , and only one nonidentity morphism: 
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Hom(A) , A-i) = {/} and Hom(A 2 , Aj) = 0. Note that the objects A\ and A 2 and the 
morphism / are “primitives” in the sense that A 1 and A 2 are not defined to be sets 
and / is simply an arrow (literally) from Aj to A 2 ; it is not defined as a set map on the 
elements of some set One can continue this way and define N to be the category with 

N objects Ai , A 2 An with the only nonidentity morphisms being a unique arrow 

from A, to Aj for every j > i (so that composition of arrows is uniquely determined). 

(6) If G is a group, form the categoiy G as follows. The only object is G and Hom(G, G) = 
G; the composition of two functions / and g is the product gf in the group G. Note 
that Hom(G, G) has an identity morphism; the identity of the group G. 

Definition. Let C and D be categories. 

(1) We say T is a covariant functor from C to D if 

(a) for every object A in C, T A is an object in D, and 

(b) for every / € Hom c (A, B) we have T(f) € Homu (.FA, J~B), 
such that the following axioms are satisfied; 

(i) if gf is a composition of morphisms in C, then J^igf) — -F(g).F(/) 
in D, and 

<ii) nu) = 1 ta. 

(2) We say fF is a contravariant functor from C to D if the conditions in (1) hold 
but property (b) and axiom (i) are replaced by: 

(b') for every / e Homc(A, B), F(f) € Horo D (.Ffl, FA), 

(F) if gf is a composition of morphisms in C, then F(gf) — F(f)F(g) 
in D 

(i.e., contravariant functors reverse the arrows). 

Examples 

In each of these examples the verification of the axioms far a functor are left as exercises. 

Additional examples of functors appear in the exercises at the end of this section. 

(1) The identity functor X c maps any categoiy C to itself by sending objects and mor- 
phisms to themselves. More generally, if C is a subcategory of D, the inclusion functor 
maps C into D by sending objects and morphisms to themselves. 

(2) Let F be the functor from Grp to Set that maps any group G to the same set G and 
any group homomorphism <p to the same set map <p. This functor is called the forgetful 
functor since it "removes” or “forgets” the structure of the groups and the homomor- 
phisms between them. Likewise there arc forgetful functors from the categories Ab, 
R- mod. Top, etc., to Set 

(3) The abelianizing functor maps Grp to Ab by sending each group G to the abelian 
group G ab = G/G', where G' is the commutator subgroup of G (cf. Section 5.4). 
Each group homomorphism <p : G -* H is mapped to the induced homomorphism on 
quotient groups: 

Ip : G** by v(xG') = <p{x)H'. 

The definition of the commutator subgroup ensures that <p is well defined and the 
axioms for a functor are satisfied. 

(4) Let R be a ring and let D be a left /?-module. For each left /f-module N the 
set Hom/?(£>, N) is an abelian group, and is an /?-module if R is commutative 
(cf. Proposition 2 in Section 10.2). If <p •. N\ —* N 2 is an /?-module homomor- 
phism, then for every / € Hom^(D, Ni) we have <p o f e Horn r(D. Nf)- Thus 
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(p ' : Hom/?(Z), N\) -v Hom/?(Z), Nj) by cp'(f) = <po f. This shows the map 

fiom(D, ) : N — * Hom/?(Z), N) 

fiom(D, ) : q> — * (p 

is a covariant functor from R — Mod to Grp. If R is commutative, it maps R — Mod to 
itself. 

(5) In the notation of the preceding example, we observe that if <p : Ah — v N 2 , then for 
every g e Hom/?(A^ 2 , D) wehavego^> G Hom/?(A^i. D). Thus#/ : Hom/j(A^. D) -► 
Hom^(Nj , D) by tp'(g) = g o<p. In this case the map 

'Hom( , D ) : N — » Hom/?(N, D) 

fiom^ , D) : <p — » #/ 

defines a contravariant functor. 

(6) When D is a right R -module the map D ® r : N — j- D ®r N defines a covariant 

functor from J?-Mod to Ab (or to R-Mod when R is commutative). Hoe the mor- 
phism <p : N\ — »• N 2 maps to the morphism 1 ® <p. 

Likewise when D is a left R-module ®r D : N ->■ N ®r D defines a co- 

variant functor from the category of right R -modules to Ab (or to A-Mod when R is 
commutative), where the morphism (p maps to the morphism #? ® 1. 

(7) Let A be a field and let A-fdVec be the category of all finite dimensional vector spaces 
over K, where morphisms in this category are A' -linear transformations. We define 
the double dual functor Tf 2 from /C-fdVec to itself. Recall from Section 1 1.3 that 
the dual space, V*, of V is defined as V* = Hom* r (V / , AT); the double dual of V is 
V** = HomjKP*, K)- Then V 2 is defined on objects by mapping a vector space V 
to its double dual V**. If <p : V — >■ IV is a linear transformation of finite dimensional 
spaces, then 

V 2 (<p ) : V** -y W** by V\<p){E v ) = E viv) , 

where E v denotes “evaluation at a” for each u G V. By Theorem 19 in Section 11.3, 
E v G V**, and each element of V ** is of the form E v for a unique v G V. Since 
<p(v) G W we have E^ v ) € IV**, so T> 2 (<p) is well defined. 

The functor T from C to D is called faithful (or is called full) if for every pair 
of objects A and B in C the map T : Hom(A, B) — ► Hom^A, IFB) is injective (or 
surjective, respectively). Thus, for example, the forgetful functor is faithful but not full. 

EXERCISES 

1. Let N be a group and let Nor-N be the collection of all groups that contain N as a normal 

subgroup. A morphism between objects A and B is any group homomorphism that maps 

N into N. 

(a) Prove that Nor —N is a category. 

(b) Show how the projection homomorphism G G/N may be used to define a functor 

from Nor — N to Grp. 

2. Let H be a group. Define a map fix from Grp to itself on objects and morphisms as 

follows: 


fix. : G — »■ H x G, and 

if (p : Gi G 2 then%x(#j) : H x G\ -> H x G 2 by (h, g) (h, <p(g)). 


914 


Appendix II Category Theory 



Prove that 'Hx is a functor. 

3. Show that the map Ring to Grp by mapping a ring to its group of units (i.e., R h* R x ) 
defines a functor. Show by explicit examples that this functor is neither faithful nor full 

4. Show that for each n > 1 the map QC n : R — > GL„(R) defines a functor from CRing to 
Grp. [Define QC n on morphisms by applying each ring homomorphism to the entries of 
a matrix.] 

5. Supply the details that show the double dual map described in Example 7 satisfies the 
axioms of a functor. 


2. NATURAL TRANSFORMATIONS AND UNIVERSALS 

As mentioned in the introduction to this appendix, one of the motivations for the in- 
ception of category theory was to give a precise definition of the notion of “natural” 
isomorphism. We now do so, and see how some natural maps mentioned in the text 
are instances of the categorical concept We likewise give the categorical definition of 
“universal arrows” and view some occurrences of universal properties in the text in this 
light 

Definition. Let C and D be categories and let T,Q be covariant functors from C to 
D. A natural transformation or morphism of functors from T to Q is a map r\ that 
assigns to each object A in C a morphism r\ A in Horn© (.FA, QA) with the following 
property: for every pair of objects A and B in C and every / e Homc(A, B) we have 
Q(f)rj A = i?b.F(/), i.e., the following diagram commutes: 

TA — QA 

Q(P 

TB — ^ QB 

If each t)a is an isomorphism, ij is called a natural isomorphism of functors. 

Consider the special case where C = D and C is a subcategory of Set, and where 
T is the identity functor. There is a natural transformation rj from the identity functor 
to Q if whenever Q maps the object A to the object QA there is a morphism i) A from 
A to QA, and whenever there is a morphism / from A to B the morphism Q(f) is 
compatible with / as a map from QA to QB . In fact Q(f) is uniquely determined by 
/ as a map from the subset t] A (A ) in QA to the subset t)b(B) of QB. If rj is a natural 
isomorphism, then the value of Q on every morphism is completely determined by rj, 
namely Q(f) = Vb/Va 1 - In tins 0386 the functor Q is entirely specified by rj. We shall 
see that some of the examples of functors in the preceding section arise this way. 

Examples 

(1) For any categories C and D and any functor JF from C to D the identity is a natural 
isomorphism from T to itself: ij A = \p A for every object A in C. 
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(2) Let R be a ring and let T be any functor from /?-Mod to itself. The zero map is a 
natural transformation from T to itself: t]A = 0^ for every R -module A, where 0^ is 
the zero map from A to itself. This is not a natural isomorphism. 

(3) Let T be the identity functor from Grp to itself, and let Q be the abelianizing functor 
(Example 3) considered here as a map from Grp to itself. For each group G let 
r)c : G -> G/G' be the usual projection map onto the quotient group. Then t) is a 
natural transformation (but not an isomorphism) with respect to these two functors. 
(We call the maps r)G the natural projection maps.) 

(4) Let Q = V 2 be the double dual functor from the category of finite dimensional vector 
spaces over a field K to itself (Example 7). Then there is a natural isomorphism rj 
from the identity functor to Q given by 

r)v : V -+ V** by w(v) = E v 

where E v is “evaluation at v" for every v e V. 

(5) Let QC n be the functor from CRing to Grp defined as follows. Each object (com- 
mutative ring) R is mapped by Q£„ to the group GL„ (R) ofnxn invertible matrices 
with entries from R. For each ring homomorphism / : R -> S let QC n {f) be the 
map of matrices that applies / to each matrix entry. Since / sends 1 to 1 it follows 
that Q£ r (/) sends invertible matrices to invertible matrices (cf. Exercise 4 in Section 
1). Let Q be the functor from CRing to Grp that maps each ring R to its group of 
units R x , and each ring homomorphism / to its restriction to the groups of units (also 
denoted by /). The determinant is a natural transformation from QCn to Q because 
the determinant is defined by the same polynomial for all rings so that the following 
diagram commutes: 

GL n (R) det > R * 

oc,in[ [ i 

GL n (S) dct > S x 

Let C, D and E be categories, let T be a functor from C to D, and let Q be a 
functor from D to E. There is an obvious notion of the composition of functors QT 
from C to E. When E = C the composition QT maps C to itself and TQ maps D 
to itself. We say C and D are isomorphic if for some T and Q we have QT is the 
identity functor Tq, and TQ — X D . By the discussion in Section 10.1 the categories Z- 
Mod and Ab are isomorphic. It also follows from observations in Section 10. 1 that the 
categories of elementary abelian p-groups and vector spaces overF p are isomorphic. In 
practice we tend to identify such isomorphic categories. The following generalization 
of isomorphism between categories gives a broader and more useful notion of when 
two categories are “similar.” 

Definition. Categories C and D are said to be equivalent if there are functors T from 
C to D and Q from D to C such that the functor QT is naturally isomorphic to Tc (the 
identity functor of C) and TQ is naturally isomorphic to the identity functor Jo- 
lt is an exercise that equivalence of categories is reflexive, symmetric and transi- 
tive. The example of Affine ^-algebras in Section 15.5 is an equivalence of categories 
(where one needs to modify the direction of the arrows in the definition of a natural 
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transformation to accommodate the contravariant functors in this example). Another 
example (which requires some proving) is that for R a commutative ring with 1 the 
categories of left modules /?-Mod and M,x„(f?)-Mod are equivalent. 

Finally, we introduce the concepts of universal arrows and universal objects. 
Definition. 

(1) Let C and D be categories, let T be a functor from C to D, and let X be an 
object in D. A universal arrow from X to T is a pair ( U (X), i), where U (X) is 
an object in C and i : X -> TU (X) is a morphism in D satisfying the following 
property: for any object A in C if <p is any morphism from X to TA in D, then 
there exists a unique morphism <P : U (X) -»■ A in C such that T(G>)t = <p , 
i.e., the following diagram commutes: 

X ► TU(X) 

v 

TA 

(2) Let C be a category and let T be a functor from C to the category Set of all 
sets. A universal element of the functor T is a pair (U, l), where U is an object 
in C and i is an element of the set TU satisfying the following property: for 
any object A in C and any element g in the set TA there is a unique morphism 
<p : U -> A in C such that T(<p)(i) = g. 

Examples 

(1) (Universal Arrow: Free Objects ) Let R be a ring with 1 . We translate into the language 
of universal arrows the statement that if U(X) is the free R -module on a set X then any 
set map from X to an R -module A extends uniquely by R- linearity to an /f-module 
homomorphism from U (X) to A (cf. Theorem 6, Section 10.3): Let T be the forgetful 
functor from ft-Mod to Set, so that T maps an /^-module A to the set A, i.e., A — T A 
as sets. Let X be any set (i.e., an object in Set), let U (X) be the free 7?-module with 
basis X, and let i : X -v TU (X) be the set map which sends each b € X to the basis 
element b in U(X). then the universal property of free -modules is precisely the 
result that ( U(X ), i) is a universal arrow from X to the forgetful functor T. 

Similarly, free groups, vector spaces (which are free modules over a field), poly- 
nomial algebras (which are free /?-algebras) and the like are all instances of universal 
arrows. 

(2) (Universal Arrow: Fields of Fractions) Let T be the forgetful functor from the category 
of fields to the category of integral domains, where the morphisms in both categories 
are injective ring homomorphisms. For any integral domain X let U (X) be its field 
of fractions and let t be the inclusion of X into U(X). Then (U(X), i) is a universal 
arrow from X to the functor T (cf. Theorem 15(2) in Section 7.5). 

(3) (Universal Object: Tensor Products) This example refers to the construction of the 
tensor product of two modules in Section 10.4. Let C = /?-Mod be the category of 
fl-modules over the commutative ring R, and let M and N be /^-modules. For each 
7?-module A let Bilin(M , N\ A) denote the set of all X-bilinear functions from M x N 
to A. Define a functor from i?-Mod to Set on objects by 

T : A — > Bilin(M, N\ A), 
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and if <p : A -v B is an fl-module homomorphism then 

— <poh for every h e Bilin(M, N; A). 

Let U = M N and let i be the bilinear function 

l \ M X N M ®r M by i(m, n) = m ® n, 

so i is an element of the set Bilin(M, N ; M ®r N) = TU. Then ( M ®j? N, i) 
is a universal element of !F because for any fl-module A and for any bilinear map 
g : M x AT A (i.e., any element of T A) there is a unique f?-module homomorphism 
tp : M N —*■ A such that£ = (poi = F((p)(C). 


EXERCISES 

1. Let Nor- be the category described in Exercise I of Section 1, and let T be the inclusion 
functor from Nor-N into Grp. Describe a functor Q from Nor-N into Grp such that the 
transformation r) defined by t)g : G G/N is a natural transformation from T to Q. 

2. Let H and K be groups and let 'Hx and fCx be functors from Grp to itself described in 
Exercise 2 of Section 1. Let <p : H K be a group homomorphism* 

(a) Show that the maps t)a ■ H x A K x A by i)A(h, a) = (<p(h), a ) determine a 
natural transformation tj from 7ix to /Cx. 

(b) Show that the transformation tj is a natural isomorphism if and only if <p is a group 
isomorphism. 

3. Express the universal property of the commutator quotient group — described in Propo- 
sition 7(5) of Section 5.4 — as a universal arrow for some functor T . 
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A 

1 - parameter subgroup. 505 

2- slage Euclidean Domain, 294 
A.C.C. — see ascending chain condition 
abelian, 17 

abelian categories, 791 
abelian extensions of Q, 599# 
abelian group, 17, 84, 15f#, 196, 339, 468 
representation of, 861 

Abel’s Theorem (insolvability of quintic). 625 
absolutely flat, 797 
action, faithful, 43, 112# 
group ring, 842 
group, 41#, 112#, 451 
left vs. right, 128. 156 
Adjoint Associativity, 401, 804, 81 1 
affine algebraic sets, 658_# 
affine curve, 726 
affine A -algebra, 734 
affine n-space, 338, 658 
affine scheme, 742 
affords a representation, 1 14, 843 
algebra, 342#, 657 
algebraic, element, 520#, 527 
extension, 520#, 527 
integer, 695#, 887 
number, 527 
algebraic closure, 543 
of a finite field, 588 
algebraic conjugate — see conjugate 
algebraic geometry, 330, 655#, 658, 742, 745, 
760,762,911 
algebraically closed, 543 
algebraically conjugate characters, 878 
algebraically independent, 645, 699 
algebraically indistinguishable, 518 
algorithm, for Jordan Canonical Fain, 496 
for rational canonical form, 481 
alternating form, 437 
alternating group, 107#, 611 

A j, 110, 111 

simplicity of, 127, 145 
characters of, 883 
simplicity of. 110, 149# 


alternating, function, 436, 446 
tenser, 451 

angle trisecting, 535, 535 
annihilated by, 338 
annihilalor, 249 

of a submodule, 344, 460 
of a subspace, 434, 435 
arrow, 912 

Artin-Schreier extensions, 589, 636 

Artin-Schreier map, 623 

Artinian, 657, 750#, 855 

ascending chain condition (A.C.C.), 458, 656# 

assassin, 670 

associate, 284# 

associated primes, of a module, 670, 730, 748 
of a prime ideal, 685 
of an ideal, 682 
associative, 16 
asymptotic behavior, 508 
augmentation, ideal, 245, 253, 255, 258, 846 
map, 245, 255. 799,811 
augmented matrix, 424 
Aut(lR/Q), 567 
automorphism, 41, 133# 
group, 41, 133# 
of Dg, 136,220 
of <2s. 136,220# 
of Sg» 221 
of S„. 136# 

of a cyclic group, 61, 135, 136,314 
of a field extension, 558# 
of a field, 55^# 

of an elementary abelian group, 1 36 
autonomous system, 507 

B 

B n (G, A) — see coboundaries 

Baer’s Criterion, 396 

balanced map, 365# 

bar resolution, 799 

base field, 511 

basic open set, 738 

basis, 354 


919 



free, 218, 354 

of a field extension, 513 

of a vector space, 408 

Bass’ Characterization of Noetherian Rings, 793 
belongs to an ideal, 682 

Beiiekamp’s Factorization Algorithm, 311, 589# 
Betti number, 159, 464 
Bezout Domain, 274, 283, 294, 302, 307, 775 
bijection, 2 

bilinear, 368#, 372, 436 
bimodule, 366, 404 
binary, operation, 16 
relation, 3 

Binomial Theorem, 60, 249, 548 
biquadratic, extension, 530, 582, 589 
polynomial, 617 
block, 1 17 

diagonal, 423, 475 
upper triangular, 423 

Boolean ring, 23 1 , 232, 249, 250, 258, 267 
Brauer group, 836 
Buchberger’s Algorithm, 324# 

Buchberger's Criterion, 324#, 332 
building. 212 
Building-Up Lemma, 411 
Burnside’s Basis Theorem, 199 
Burnside’s Lemma, 877 
Burnside’s AY C -Theorem, 213 
Burnside’s p a q h Theorem, 196, 886# 

c 

C" (G\ A) — see cochains 
cancellation laws, 20 
canonical forms, 457, 472 
canonical model, 734 
Cardano’s Formulas, 630//., 638# 
cardinality, 1 

Cartesian product, 1 , 905# 

Castelnuovo’s Theorem, 646 
Casus irreducibilis, 633, 637 
category, 391 , 91 1# 

Cauchy’s Theorem, 93, 96, 102, 146 
Cayley-Hamilton Theorem, 478 
Cayley’s Theorem, 118# 
center, of a group, 50, 84, 89, 124, 134, 198 
of a group ring, 239 
of a matrix ring, 239, 834, 856 
of a p-group, 125, 188 
of a ring, 23 1 , 23 1 , 344, 832#, 856 
central idempolent, 357, 856 
central product, 157, 169 
central simple algebra, 832# 
centralize, 94 


centralizer, 49#, 123#, 133# 
of a cycle, 173 
of a representation, 853 
chain complex, 777 
homotopy, 782 
change of basis, 40, 419 
changing the base — see extension of scalars 
character, of a group, 568, 866 
of a representation, 866 
character table, 880# 
of A4, 883 
of Dg, 881 
of Q S , 882 
of S3, 881 
of £4, 883 
of £5, 884 
of Z/2Z, 880 
ofZ/3Z, 881 

characteristic, of a field, 5 1 0 
of a ring, 250 

characteristic function, 249 
characteristic p fields, 510 
characteristic polynomial, 473 
characteristic subgroup, 135#, 174 
Chinese Remainder Theorem, 246. 265#, 313, 357, 
768 

choice function, 905 

class equation, 122#, 556 

class field theory, 600 

class function, 866, 870 

class group, 761, 774 

class number, 761 

Classical Greek Problems, 531# 

classification theorems, 38, 142#, 1 81# 

closed, topologically, 676 

under an operation, 16, 242, 528 
closed points, 733 
coboundaries, 800 
cochain, 777, 799, 808 
cochain complex, 777 
cochain homotopy, 792 
cocycle. 800 
codomain, 1 
coefficient matrix, 424 
cofactor, 439 

Expansion Formula, 439 
Formula for the Inverse of a Matrix, 440 
coherent module sheaf, 748 
cohomologically trivial, 802, 804, 812 
cohomology group, 777, 798# 
coinduced module, 803, 8 1 1, 812 
cokemel, 792 
coloring graphs, 335 
column rank, 418, 427, 434 
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comaxima] ideals, 265 
commutative, 16, 223 
diagram, 100 
commutator, 89, 169 
commutator series — see derived series 
commutator subgroup, 89, 169, 195# 
commute, diagram, 100 
compact, 688 
support, 225 
companion matrix, 475 
compatible homomorphisms, 805 
complement, 1 80, 453, 454, 820, 829, 890 
complete, 759# 
complete preimage, 83 
completely reducible, 847 
completion, 759# 

complex conjugation, 345, 567, 603, 618, 654, 872 
complex numbers, 1, 512, 515, 654 
component of a direct product, 155, 338 
composite extensions, 529, 591.# 
of fields, 528 
composition factors. 103 
composition series, 103,# 
computing A-algebra homomorphisms, 664# 
computing Galois groups, 640# 
congruence class, 8# 
congruent, 8 

conjugacy class, 123#. 489. 860 
conjugate, algebraic, 573 
field, 573 

of a field element, 573 
of a group element, 82, 123,# 
of a set, 123j# 
of a subgroup, 134, 139# 
conjugation, 45, 52, 122#, 133 
in A„, 127, 131 
in 125# 
connected, 687 

connecting homomorphisms, 778, 791 
constituent of a module, 847 
constructive, 532# 

constructibility of a regular n-gon, 534#, 601# 
construction of cube roots, 535 
construction of the regular 17-gon, 602# 
continuous cohomology groups. 809 
continuous group action, 808# 
contracting homomorphisms, 809 
contraction of ideals, 693, 708# 
contravariam, 659 
converge, 503 
coordinate ring, 661 
coprime — see relatively prime 
corestriction homomorphism, 806, 807 
corresponding group actions, 129 


coset, 77#. 89# 
representatives, 77 
Cramer’s Rule, 438 

Criterion for the Solvability of a Quintic, 639 
crossed homomorphisms, 814# 
crossed product algebra, 833# 
cubic equations, formulas for roots, 630# 
curve. 726 

cycle, 29, 30, 33. 106#, 173 
cycle decomposition. 29. 30, 115#, 641 
algorithm, 30# 
cycle type. 126# 

of automorphisms. 640 
cyclic extensions, 625, 636 
cyclic group, 22. 54#. 90, 149. 192, 198, 539 
characters of, 880, 881 
cohomology of. 801, 811 
cyclic module, 351, 462 
cyclotomic extensions, 552#, 596# 
cyclotomic field, 540#. 698 
cyclotomic polynomial, 310, 489, 552# 
cyclotomy, 598 

D 

D.C.C. — see descending chain condition 
decomposable module, 847 
Dedekind Domain, 764# 
modules over, 769# 

Dedekind-Hasse Criterion, 281 
Dedekind— Hasse norm, 281, 289, 294 
degree, of a character, 866 
of a field element, 520 
of a field extension. 512 
of a monomial, 62 1 
of a polynomial, 234, 295, 297 
of a representation. 840 
of a symmetric group, 29 
degree ordering, 331 
dense, 677. 687 
density of primes. 642 
derivative, of a polynomial, 312, 546 
of a power scries, 505 
derived functors. 785 
derived series, 195# 

descending chain condition (D.C.C), 33 1 , 657. 75 1 , 
855 

determinant, 248, 435#, 450, 488 
computing, 441 
determinant ideal, 67 1 
diagonal subgroup, 49, 89 
diagonalizable matrices criterion, 493, 494 
Dickson’s Lemma, 334 
differential, 723 
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of a morphism, 728 
dihedral group. 23# 
as Galois group, 617# 
characters of, 88 1 , 885 
commutator subgroup of, 171 
conjugacy classes in, 132 
dimension, of a ring, 750, 754# 
of a tensor product, 421 
of a variety, 681, 729 
of a vector space, 408, 41 1 
of S*(V), 446 
of7>(V),443 
of/\*(V),449 
dimension shifting, 802 
Diopbantine Equations, 14, 245, 276. 278 
direct factor, 455 
direct limit, 268, 358, 741 
direct product, characters of. 879 
infinite, 157, 357, 414 
of free modules, 358 
of groups, 18, 15^#, 385, 593 
of injective modules, 793 
of injective resolutions. 793 
of modules, 353, 357, 358, 385 
of rings, 23 1 , 233, 265# 
direct sum, infinite, 158. 357, 414 
of injective modules, 403 
of modules, 351#, 357, 385 
of projective modules, 392, 403, 793 
of projective resolutions, 793 
of rings, 232 

direct summand, 373, 385, 451 
directed set, 268 

Dirichlet’s Theorem on Primes in Arithmetic 
Progressions, 557 
discrete G-module, 808 
discrete cohomology groups, 808# 
discrete valuation. 232, 238, 272, 755 
Discrete Valuation Ring, 232, 272, 755#, 762 
discriminant, 610 
as resultant, 621 
of a cubic, 612 
of a polynomial, 610 
of a quadratic, 6 1 1 
of a quartic. 614 

of cyclotomic polynomial, 621 
distributive laws, 34, 223 
divides, 4, 252, 274 
divisibility of ideals, 767 
divisible, group, 66, 86, 167 
module, 397 

Division Algorithm, 4, 270, 299 
division ring, 224, 225, 834 
divisor, 274 


domain, 1 

double coset, 1 17 

double dual, 432, 823, 914 

Doubling the Cube impossibility of, 531# 

doubly transitive, 1 17, 877 

dual basis, 432 

dual group, 167, 815. 823 

dual module, 404, 404 

dual numbers, 729 

dual vector space, 431 

E 

echelon, 425 
eigenspace, 473 
eigenvalue. 414, 423, 472 
eigenvector; 414, 423, 472 
Eisenstein’s Criterion, 309#, 312 
elementary abelian group, 136, 155, 339, 654 
elementary divisor, 161#, 465# 
decomposition. 161#, 464 
decomposition algorithm, 495 
elementary Jordan matrix, 492 
elementary row and column operations, 424, 470#, 
479# 

elementary symmetric functions, 607 
elimination ideal, 328# 
elimination theory, 327# 
elliptic, curve, 14 
function. 600 
function field, 653 
integral, 14 

embedded prime ideal, 685 
embedding. 83, 359, 569 
endomorphism, 347 
ring, 347 

equivalence class, 3, 45, 1 14 
equivalence of categories, 734, 916 
equivalence of short exact sequences, 38 1 
equivalence relation, 3, 45, 114 
equivalent extensions, 38 1 , 787, 824 
equivalent representations, 846, 869 
Euclidean Algorithm, 5. 27 1 
Euclidean Domain, 270#, 299 
modules over, 470, 490 

Euler ^-function, 7, 8, 1 1, 267, 315, 539#, 589 
Euler’s Theorem, 13, 96 
evaluation homomorphism, 244, 255, 432# 
exact, functor, 391, 396 
sequence, 378 

exactness, of Horn, 385#, 393# 
of tensor products, 399 
exceptional characters, 901 
exponent of a group, 165#, 626 
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exponential map, 86 
exponential notation, 20, 22 
exponential of a matrix, 503# 

Ext^(A,£). 779# 
extension, of a map, 3, 386, 393 
of ideals, 693, 708# 
of modules, 378 

of scalars, 359#, 363#, 369. 373 
extension field, 511# 
extension problem. 104, 378, 776 
Extension Theorem, for Isomorphisms of Fields, 
519,541 

exterior algebra, 446 
exterior power, 446 
exterior product — see wedge product 
external, direct product, 172 
direct sum, 353 

F 

F-algebra — see algebra 
factor group — see quotient group 
factor set, 824# 

factor through, homomorphism, 100, 365 
factorial variety, 726 
factorization, 283# 
faithful, action, 43, 1 12# 
functor, 914 
representation, 840 
Fano Plane, 210 

Feit-Thompson Theorem, 104, 106, 149, 196, 212, 
899 

Fermat primes, 601 
Fermat's Little Theorem, 96 
Fermat’s Theorem on sums of squares, 291 
fiber, 2, 73#. 240# 
fiber product of homumurpliisiiis, 407 
fiber Sum of homomorphisms, 407 
field, 34. 224, 226, 510# 
of fractions, 260# 
of p-adic numbers, 759 
of rational functions, 264, 516, 530, 567, 585, 
647#, 681, 721 
field extension, 511# 
field generated by, 51 1 , 5 16 
field norm, 229 
finite covering, 704 
finite dimensional, 408, 411 
finite extensions, 51?#, 521. 526 
finite fields, 34, 301, 529 
algebraic closure of, 588 
existence and uniqueness of, 549# 

Galois groups of, 566, 586 
of four elements, 516, 653 


subfields of, 588 
finite group, 17 

finitely generated, field extension, 524#. 646 
group, 65, 158, 218# 
ideal, 251, 317 
A: -algebra, 657 
module, 351#, 458 
finitely presented, group, 218# 
module, 795# 

First Order Diophantine Equation, 276, 278 
First Orthogonality Relation, 872 
Fitting ideal, 67 1 
Fitting’s Lemma, 668 
fixed, element, 558 
field, 560 
set, 131,798 
fixed point free, 41, 132 
flat module, 400#, 405#, 790, 795 
form, 297 

formal Laurent series, 238, 265, 756, 759 

formal power series. 238. 258. 265, 668 

formally real fields, 530 

Fourier Analysis, 875# 

fractional ideal, 760# 

fractional linear transformations, 567, 647 

Frattini subgroup, 19§# 

Fratlini's Argument, 193 
free, abelian group, 158, 355 
group, 215# 

module, 338, 352, 354#, 358, 400 
nil potent group, 221 
free generators, 218 
of a module, 354 
free rank, 159, 218, 355, 460, 464 
Frobenius automorphism. 549. 556. 566. 586, 589, 
604 

Frobenius group, 168, 638, 643#, 896 
as Galois group, 638 
characters of, 896 
Frobenius kernel, 896 
Frobenius Reciprocity, 904 
full functor, 914 
function, 1 

function field, 646, 653 
functor, 391,396, 398,913 
contravariant, 395, 913 
covariant, 391, 398, 913 
fundamental matrix, 506 
Fundamental Theorem, of Algebra, 545, 615# 
of Arithmetic, 6, 289 

of Finitely Generated Abelian Groups, 158#, 
196,468 

of finitely Generated Modules over a 
Dedekind Domain, 769# 
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of Finitely Generated Modules over a PJX>., 462, 
464, 466 

of Galois Theory, 574# 
on Symmetric Functions, 608 

G 

G -in variant, 843 

G-module, 798 

G-stable, 843 

Galois closure, 594 

Galois cohomology groups, 809# 

Galois conjugates, 573 
Galois extension, 562, 572# 

Galois group, 562#, 574# 
of Fpn, 566, 586 
ofQ(2 l/8 , i) or* 8 — 2, 577# 
of Q(2 I/8 , /) over quadratic subfields, 581 

of (0(^(2 + ^(3 + ^)), 584 

ofQ(N/2+ V2), 582 
of Q(%/2), 563 

of Q(%/2 , %/3 ), 563#, 567, 576 

of Q(VCT, jDi), 582 

of Q(f| 3 ), 598# 

of Q(fc), 597 

ofQf^ + f- 1 ), 601,603 

of Q(? n ), 596# 

Of Q(fp), 597 
of* 3 -2, 564#, 568, 576 
of x 4 + 1. 579# 
of * 4 - 2X 2 - 2, 582 
of* 6 - 2* 3 - 2, 623, 644 
of x n — a, 636 
of * p — * — a, 589 
of a biquadratic, 582 
of a composite extension, 592 
of a cubic, 612 
of a cyclotomic field, 599 
of a general polynomial, 609 
of a quadratic, 563 
of a quartic, 615, 618 
Galois groups, of polynomials, 606# 
infinite, 651# 
over Q, 640# 

Galois Theory, 14, 105, 558# 

Gaschiitz’s Theorem, 838 

Gauss’ Lemma, 303, 530, 819, 824 

Gauss-Jordan e limina tion, 327, 424# 

Gauss sum, 637 

Gaussian integers, 229#, 271, 278, 289#, 377 
general linear group, 35, 89, 236, 413, 418 
general polynomial, 607, 609, 629, 646 
general polynomial division, 320#, 331 


generalized associative law, 18 
generalized character, 898 
generalized eigenspace, 501 
generalized quaternion group, 178 
generating set, 61# 
generator, 25#, 54, 218# 
of S n , 64, 107#, 219 
of S p , 111 

of a cyclic group, 57 
of a free module, 354 
of a subgroup, 61# 
of a submodule, 351 
of an ideal, 251 
generic point, 733 
germs of co ntin uous functions, 269 
GL 3 ( F 2 ), 211#, 489, 644 
global sections, 740 
globally asymptotically stable, 508 
Going-down Theorem, 694, 728 
Going-up Theorem, 694, 720 
graded, ordering, 331 
ring, 443 
graded ideal, 443 

graded lexicographic ordering (grlex), 33 1 
graph, 210, 669, 687 
coloring, 335# 

greatest co mm on divisor (g.c.d.), 4, 252, 274# » 287 
of ideals, 767 

grevlex monomial ordering, 331 
Grbbner basis, 315#, 319#, 6 64#, 702, 712 
in field extensions, 672 
group, 13, 16# 

of n* roots of unity — see root of unity 
of units in a ring, 226 
group extensions, 824# 
group ring, 236#, 798, 840 
group table, 21 
groups, of order 12, 144, 182 
of order 30, 143, 182 
of order 56, 185 
of order 60, 145#, 186 
of order 75, 185 
of order 147, 185 
of order 168, 207# 
of order 3 3 - 7 - 13 409, 212#, 898# 
of order p 2 , 125, 137 
of order p 3 , 179, 183, 198, 199#, 886 
of order 2p 2 , 186 
of order 4p, 186 
of order pq, 143, 179, 181 
of order p 2 q, 144 
groups, table of small order, 167# 
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H 

H n (C; A) — see cohomology group 

Hall subgroup. 101, 200, 829, 890 

Hall’s Theorem, 105, 196, 890 

Hamilton Quaternions, 224$, 231, 237, 249, 299 

Harmonic Analysis, 875 

Heisenberg group, 35, 53. 174, 179, 187 

Hilbert’s Basis Theorem, 316, 334, 657 

Hilbert's Nullstcllensatz, 675, 700# 

Hilbert’s Specialization Theorem, 648 
Hilbert’s Theorem 90, 583, 8 14 
additive form, 584, 8 15 
Hilbert’s Zahlbericht, 815 
Holder Program, 103# 
holomorph, 179, 186 
Horn, of direct products, 404 
of direct sums, 388. 388, 404 
Hom/r(P. W), 416 
Horn* (A*. AO. 345#., 385# 
homeoroorphism, 738 
homogeneous cochains, 8 10 
homogeneous component, of a polynomial, 297 
of a graded ring, 443 
homogeneous ideal, 299 
homogeneous of degree m, 621 
homogeneous polynomial. 297 
homological algebra, 391, 655, 776# 
homology group*, 777 
homomorphism, of algebras, 343, 657 
of complexes, 777 
of fields, 253, 512 
of graded rings, 443 
of groups, 36, 73#, 215 
of modules, 345# 
of rings, 239# 

of short exact sequences, 38 1# 
of tensor algebras, 450 
homotopic, 792 
hypemilp>otent group, 191 
hypersurfacc, 659 

I 

icosahedron — see Platonic solids 
ideal quotient, 333, 691 
ideal, 242ff. 

generated by set, 251 
idempotent, 267, 856 
idempotent linear transformation. 423 
identity, of a group, 17 
matrix, 236 
of a ring, 223 
image, of a map, 2 


of a it -algebra homomorphism . computing, 665# 
of a linear transformation, computing, 429 
implicitization, 678 
incidence relation, 210 
indecomposable module, 847 
independence of characters, 569, 872 
independent transcen den Lais, 645 
index, of a subgroup, 90$ 
of a field extension, 512 
induced, character, 892$, 898 
module, 363, 803, 811, 812, 893 
representation, 893 
inductive limit — see direct limi t 
inequivalent extensions, 379# 
inert prime, 749, 775 
infinite cyclic group, 57, 8 1 1 
infinite Galois group*, 65 \ff. 
inflation homomorphism. 806 
inhomogeneous cochains, 810 
injective envelope — see injective hull 
injective hull, 398, 405, 405 
injective map, 2 

injective module, 395#, 403#, 784 
injective resolution, 786 
injectively equivalent, 407 
inner automorphism, 134 
inner product of characters, 87Q$ 
inseparable degree, of a polynomial, 550 
of a field extension, 650 
inseparable extension, 551 , 566 
inseparable polynomial, 546 
insolvabiliry of the quintiq, 625, 629 
integer. 1. 695# 
integers mod n — see Z//rZ 
integral basis, 698, 775 
integral closure. 229, 691# 
integral domain, 228, 235 
integral element, 691 
integral extension, 691# 
integral group ring (Z G), 237, 798 
integral ideal, 760 
integral Quaternions, 229 
integrally closed, 69 1# 
internal, direct product, 172 
direct sum, 354 

intersection of ideals, comparting. 330# 
intertwine, 847 

invariant factor, 159#, 464, 774 
decomposition, 159#, 462# 
of a matrix, 475. 477 

Invariant Factor Decomposition Algorithm, 480 
invariant subspace, 341, 843 
inverse, of a map, 2 

of an element in a group, 17 
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inverse image, 2 
inverse limit, 268. 358, 652# 
inverse of a fractional ideal, 760 
inverse of matrices, 427, 440 
invertible fractional ideal, 760 
irreducibility, criteria, 307# 

of a cyclotomic polynomial, 310 
irreducible algebraic set, 679 
irreducible character, 866, 870, 873 
irreducible element, 284 
in Z[i], 289# 
irreducible ideal, 683 
irreducible module, 356, 847 
irreducible polynomial, 287, 512#, 572 
of degTee n overF p , 301, 586 
irreducible topological space, 733 
isolated prime ideal, 685 
isomorphism, classes, 37 
of algebras. 343 
of cyclic groups. 56 
of groups, 37 
of modules, 345 
of rings, 239 

of short exact sequences, 381 
of vector spaces, 408 
Isomorphism Theorems, for groups, 97# 
for modules, 349 
for rings, 243, 246 
isomorphism type, 37 
isotypic component, 869 

J 

Jacobson radical, 259, 750 
join, 67. 88 
Jordan block, 492 

Jordan canonical form, 457, 472, 492# 
Jordan-Holder Theorem, 103# 

K 

H -stage Euclidean Domains, 294 
it -tensors, 442 

kernel, of a group action, 43, 51, 112# 
of a homomorphism, 40, 75, 239. 345 
of ale -algebra homomorphism, computing, 665# 
of a k -algebra homomorphism, 678 
of a linear transformation, computing, 429 
Klein 4-group (Viergruppe), 68, 136, 155 
Kronecker product, 421#, 431 
Kronecker— Weber Theorem, 600 
Km II dimension, 704, 750#, 754 
Krull topology, 652 


Krull's Theorem, 652 
Kummer extensions, 627, 817 
Kummer generators for cyclic extensions, 636 
Kummer theory, 626. 816, 823 

L 

Lagrange resolvent, 626 
Lagrange's Theorem, 1 3, 45, 89#, 460 
lattice of subficlds, 574 
ofQl^.pJ.SOS 
of<Xfi3>,598 
of <Q>(2 ,/8 . i), 581 
lattice of subgroups, 66# 

Of 4 4 , 111 
of Dg. 69, 99 
of Dig, 70 
of eg, 6 9,99 
of QD i6 . 72, 580 
of 5^,69 
of 2/22,67 
ofZ/42,67 
of Z/6Z, 68 
of Z/8Z, 67 
of Z/ 122,68 
of Z/nZ, 67 
of Z/p"Z,68 

of Z/2Z x Z/2Z (Klein 4-group), 68 
of Z/2Z x Z/4Z, 71# 
of Z/2Z x Z/8Z, 72 
of the modular group of order 16, 72 
lattice of subgroups for quotient group, 98# 
Laurent series — see formal Laurent scries 
leading coefficient, 234, 295 
leading term, 234, 295, 318 
ideal of, 318# 

least common multiple (l.c.m.), 4, 279, 293 
least residue. 9 
left derived functor, 788 
left exact, 391, 395, 402 
left group action, 43 
left ideal, 242, 251, 256 
left inverse, in a ring, 233 
of a map, 2 
left module, 337 

left multiplication, 44, 118#. 531 
left Principal Ideal Domain, 302 
left regular representation, 44, 120 
left translation, 44 
left zero divisor, 233 
Legendre symbol, 818 
length of a cycle, 30 

lexicographic monomial ordering, 317#, 622 
Lie groups, 505, 876 
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lifts, 386 

linear algebraic sets, 659 
linear character, 569 
linear combination, 5, 275, 280, 408 
linear equations, solving. 425# 
linear functional, 43 1 
linear representation, 840 
linear transformation, 340#. 346, 408 
linearly independent, characters, 569, 872 
vectors, 409 

local homomorphism , 723, 744 
local ring, 259, 717, 752#, 755 
of an affine variety, 721# 
localization, 706_#, 795, 796 
at a point in a variety, 722 
at a prime, 708#, 718 
of a module, 714# 
locally ringed spaces, 745 
locus. 659 

Long Exact Sequence, 778, 789 
in Group Cohomology, 802 
lower central series, 193 
LGroth’s Theorem, 647 

M 

map, 1,215 

Maschke’s Theorem, 453. 849 
matrix, 34, 235, 415# 
of a composition, 418 
of a linear transformation, 415# 
matrix representation, 840 
matrix ring, 235#, 418 
ideals of, 249 

maximal ideal, 253#, 280, 512 
maximal order. 232 

maximal real subfield of a cyclotomic field, 603 
maximal spectrum 73 1 
of*[x], 735 
of £[jc, y], 735 
of Z[i], 735 
ofZ[x], 736 

maximal subgroup, 65. 117, 131, 188. 198 
of solvable groups, 200 
middle linear map — see balanced map 
minimal element, 4 
minim al Grobner basis, 325# 
minim al normal subgroup, 200 
minima] polynomial, 474 
of a field element, 520 
of a field element, computing, 667 
minimal prime ideal, 298, 688 
minimal primary decomposition, 683 
minimum condition, 855 


Minkowski’s Criterion, 441 
min or, 439 

Mobius invasion formula, 555, 588 
modular arithmetic. 9. 224 
modular group of order 1 6, 72, 1 86 
modular representations, 846 
module, 337# 
over 2, 339, 45<# 
over F[x], 340#, 45<$# 
over a Dedekind Domain, 769# 
over a group ring, 798#. 843# 
over a RLD., 456# 
sbeaf of, 748 
module of fractions, 714 
monic, 234 
monomial, 297 

monomial ideal, 318, 332, 334 
monomial ordering, 317 
monomial part, 297 
monomial term, 297 
Monster simple group, 865 
morphism. 911 

of affine algebraic sets, 662 
of affine schemes, 743 
multidegree, 297, 318 
multilinear form, 435 
multilinear map, 372, 435 
multiple, 252, 274 

multiple root of a polynomial, 3 1 2, 545, 547 
multiplicative field norm, 230, 582 
multiplicative function, 7, 267 
multiplicative subgroup of a field, 314 
multiplies tivity of extension degrees, 523, 529 
multiplicity of a root, 313, 545 

N 

Nakayama'g Lemma, 751 
natural, 83, 167, 432, 911# 
projection, 83, 243, 348. 916 
Newton’s Formulas, 618 
nilpotence class, 190 
nilpotent, element, 23 1 , 250, 596, 689 
group, 190#, 198 
idea), 251, 258, 674 
matrix, 502 

nilradical, 250, 258, 673, 674 
Noetherian, module, 458, 469 
ring, 316, 458, 656#, 793 
Noether’s Normalization Lemma, 699# 
noncommutafive polynomial algebra, 302, 443 
nonflnitely generated ideal, 298, 657 
nongenerator, 199 
nonpivotal, 425 
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nonprincipal idea], 252, 273, 298 
nonsimple field extension, 595 
nonsingular, point, 725, 742, 763 
variety, 725 

nonsingular, linear transformation, 413 
matrix, 417 

nonsingular curve, 775 
nonsingular model, 726 
norm. 232, 270, 299 
of a character, 872 
of an element in a field, 582, 585 
normal basis, 815 
normal complement, 385 
normal extension, 537, 650 
normal ring, 69 1 
normal subgroup, 82 # 
normal variety, 726 
normalization, 691, 726 
normalize, 82, 94 
normalized, cocycle, 827 
factor set, 825 
section, 825 

normalizer, 50#. 123#, 134, 147, 206# 
null space, 413 
nullity, 413 
num ber fields, 696 

o 

object, 911 
opposite algebra, 834 
orbit, 45, 115#, 877 
order, of a permutation, 32 
of a set, 1 

of an element in a group, 20, 55. 57, 90 
order of conductor /, 232 
order of zero or pole, 756, 763 
ordered basis, 409 
orthogonal characters, 872 
orthogonal idempotents, 377, 856, 870 
orthogonality relations, 872 
outer automorphism group, 137 

P 

p-adic integers, 269, 652, 758# 
p-adic Laurent scries, 759 
p-adic valuation, 759 
p-extensions, 596, 638 
p-group, 139, 188 
characters of, 886 
representations of, 854, 864 
p-primary component, 142, 358, 465 
p^ -power map, 166, 174 


PXD. — sec Principal Ideal Domain 
parabolic subgroup, 212 
partition, of a set, 3 
of n, 126, 162 
Pell's equation, 230 
perfect field, 549 
perfect group, 174 

periods in cyclotomic fields, 598, 602, 604 
permutation, 3, 29, 42 
even, 108# 
odd, 108# 
sign of, 108#, 436# 
permutation character, 866, 877, 895 
permutation group, 116, 120 
permutation matrix, 157 
permutation module, 803 
permutation representation, 43, 112#, 203#, 840, 
844, 852, 877 
pivotal element, 425 

Platonic solids, symmetries of, 28, 45, 92, 1 1 1 , 148 
pole. 756 
polynomial, 234 
map, 299, 662 
ring. 234#, 295# 

polynomials with S„ as Galois group, 642# 

Pontriagin dual group, 787 

positive norm, 270 

Postage Stamp Problem, 278 

power of an ideal, 247 

power series of matrices, 502# 

power set, 232 

preimage, 2 

presentation, 26#, 39, 218#, 380 
primary component — - see p - primary component 
Primary Decomposition Theorem, for abelian 
groups, 161 
for ideals, 681#, 716# 
for modules, 357, 465, 772 
primary ideal, 260, 298, 748 
prime, 6 

prime element in a ring, 284 
prime factorization, 6 
for ideals, 765# 
prime ideal, 255#, 280, 674 

algorithm for determining, 710# 
prime spectrum, 731# 
prime subfield, 264, 511, 558 
primes associated, to a module, 670 
to an ideal, 670 

primitive central idempotent, 856, 870 
primitive element, 517, 594 
Primitive Element Theorem, 595 
primitive idempotent, 856 
primitive permutation group, 1 17 
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primitive roots of unity, 539$! 
principal character, 866 
principal crossed homomoiphisms, 8 14 
principal fractional ideal, 760 
principal ideal, 251 

Principal Ideal Domain (PJ.D.), 279$!, 284, 459 
characterization of, 281, 289, 294 
that is not Euclidean, 282 
principal open set, 687, 738 
product, of ideals, 247, 250 
of subgroups, 93$/! 
profinite, 809, 813 
projection, 83, 423, 453 
homomorphism, 153$! 
projections of algebraic sets, 679 
projective limit — see inverse limit 
projective module, 390$!!, 400, 403$/!, 761, 773, 786 
projective plane, 210 
projective resolution, 779 
protectively equivalent, 407 
Public Key Code, 279 
pullback of a homomorphism, 407 
purely inseparable, 649 
purely transcendental, 646 
pushout of a homomorphism, 407 
Pythagoras' equation rational solutions, 584 

Q 

Q, subgroups of, 65, 198 
Q/Z. 86 

quadratic, equation, 522, 533 
extensions, 522, 533 
field, 227, 698 

subfield of cyclic quartic fields, criterion, 638 
subfield of Q(£ p ), 621, 637 
quadratic integer rings, 229$!, 248, 271, 278, 286, 
293$!, 698, 749 
that are Euclidean, 278 
that are P.LD.S, 278 
Quadratic Reciprocity Law, 819 
quadratic residue symbol, 818 
quartic equations, formulas for roots, 634$! 
quasicompacl, 688, 738, 746 
quasidihedral group, 71$/!, 186 
as Galois group, 579 
quaternion group, 36 
as Galois group, 584 
characters of, 882 
generalized, 178 
representations of, 845, 852 
Quaternion ring, 224, 229, 258 

(see also Hamilton Quaternions) 
quintic, insolvability, 625, 629 


quotient, computations in fc-algebras, 672 
group, 15, 73$/!. 76, 574 
module, 348 
ring, 241$/! 

vector space, 408, 412 
quotient field, 26Q$! 

R 

radical extension, 625$/! 
radical ideal, 258, 673, 689 
radical of an ideal, 258, 673$/!, 701 
computing, 701 

radical of a zero-dimensional ideal, 706$! 

radicals, 625 

ramified prime, 749, 775 

range, 2 

rank, of a free module, 338, 354, 356, 358, 459 
of a group, 165, 218, 355 
of a linear transformation, 413 
of a module, 460, 468, 469, 471, 719, 773 
rational canonical form, 457, 472$! 
computing, 48 1$! 

rational functions — see field of rational functions 
rational group ring, 237 
rational numbers, 1 , 260 
rational valued characters, 879 
real numbers, 1 
modulo 1, 21, 86 
reciprocity, 229, 621 
recognition theorem, 171, 180 
reduced Grobner basis, 326$/! 
reduced row echelon form, 425 
reduced word, 216$! 
reducible character, 866 
reducible element, 284 
reducible module, 847 

reduction homomorphism, 245, 296, 300, 586 

reduction mod n, 10, 243, 296, 640 

reduction of polynomials mod p , 586, 589 

reflexive, 3 

regular at a point, 721 

regular local ring, 725, 755 

regular map, 662, 722 

regular representation, 844, 862$! 

relations, 25$/!, 218#, 380 

relations matrix, 470 

relative Brauer group, 836 

relative degree of a field extension, 5 12 

relative integral basis, 775 

relatively prime, 4, 282 

remainder, 5, 270, 320$! 

Replacement Theorem, 410, 645 
representation, 840$! 
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permutation, 43, 1 12$, 203$, 840, 844, 852, 
877 

representative, 3, 9, 77 
residue class, 8 
resolvent cubic, 614, 623 
resolvent polynomials, 642 
restricted direct product, 158 
restriction homomorphism, 269, 805, 807 
restriction maps, 269, 740 
restriction of scalars, 359 
resultant, 619$ 
reverse of a polynomial, 312 
right derived functor, 785 
right Euclidean Domain, 302 
right exact, 400, 402 
right group action, 43, 128, 844, 852 
right ideal 242, 251 
right inverse, in a ring, 233 
of a map, 2 
right module, 337 
right regular representation, 132 
right zero divisor, 233 
ring, 223 

of algebraic integers, 695$ 
of continuous functions. 225. 227, 259 
of dual numbers, 729 
of fractions, 260$, 708 
of integers, 229 
of sets. 232 
root, 310, 521 
root extension, 627 
root of a polynomial, 307$, 512 
root of unity, 22, 66, 86, 539$, 552 
row equivalent, 425 
row rank, 418, 427, 434 
row reduced, 424 

ruler and compass constructions, 534 

s 

saturated, 710 

saturation of an ideal 710$ 

scalar, 408 

scalar matrix, 236 

scalar transformations, 348 

Schanuel’s Lemma, 407 

scheme, 745 

Schur multiplier, 838 

Scbur’s Lemma, 356, 853, 856 

Schur’s Theorem, 829 

second dual — see double dual 

Second Orthogonality Relation, 872 

section, 384, 740 

s emidihe dral group — see quasidihedral group 


semidirect product, 175$, 383. 385, 821, 829 
se misimp le, 855 
separable, 551 

extension, 551, 572, 594$ 
polynomial, 546, 562, 572 
separable degree, of a held extension, 650 
of a polynomial 550 
separating transcendence base, 650 
Shapiro’s Lemma, 804 
short exact sequence, 379 
of complexes, 778 
Short five Lemma, 383 
similar, linear transformations, 419, 476 
matrices, 419, 476, 493$ 
s imil ar central simple algebras, 835 
similar representations, 846 
similarity, 40 
simple algebra, 832 
simple extensions, 517, 586, 594 
simple group, 91, 102$, 149$, 201$, 212 
classification of, 103, 212 
of order 168, 207$ 
sporadic, 104, 865 

simple module — see irreducible module 
simple radical extension, 625 
simple ring, 253, 863 
simple tensor, 360 
Simultaneous Resolution, 783 
singular point, 725 
skew field — - see division ring 
skew-symmetrization, 452 
Smith Normal Form, 479 
smooth, 725, 742 
Snake Lemma, 792 
solution, of cubic equations, 630 
of quartic equations, 634$ 
solvability of a quintic, criterion, 630, 639 
solvability of groups of odd order — see 
Feit-Thompson Theorem 
solvable by radicals, 627$ 
solvable extensions, 625$ 
solvable group, 105, 149, 19 6ff., 628, 886, 890 
solvable length, 195$ 
solving algebraic equations, 327$ 
solving linear equations, 425$ 
span, 62, 351, 408, 427 
special linear group, 48, 89, 101, 669 
specialization, 648 
spectral sequences, 808 

spectrum — see also prime spectrum and maximal 
spectrum 
ofJt[x], 735 
af*[x,y], 735 
ofZ(Z/2Z], 747 
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of Z[i], 735 
of Z[x], 736 
split algebra, 835 
split exact sequence, 384, 388# 
split extension, 384 
split prime, 749, 775 
splits completely, 536 
splitting field, 513, 536#, 562, 572 
of Or 2 - 2)(x 2 - 3). 537 
ofx 2 -2. 537 
of x 2 — / over Jk(0, 5 1 6 
ofx 2 + 1,515 
of X 2 + X + 1 over F 2 , 516 
of x 3 - 2, 537 
ofx 4 — px + q t 618 
ofx 4 - px 2 +q , 618 
ofx 4 +4, 538 
ofx 4 + 8. 581 
of x 4 — 2x 2 — 2, 582 
of x 6 -2x 3 -2,623 
ofx 8 -2, 577 # 
ofx" - 1 , 539$ 
ofx* -2, 541 
of x* - x — a over Fp, 589 
splitting homomorphism, 384 
splitting of polynomials in Galois extensions, 572, 
584. 595 

sporadic simple group — see simple group, sporadic 
square root of a matrix, 502 
squarefrec part, 227 

Squaring the Circle, impossibility of, 531# 

stability group, 819 

stabilizer, 44, 51#, 112#, 123# 

stable subspace, 341, 843 

stalk, 741 

standard bimodule structure, 367 

standard resolution, 799 

steady states, 507 

Steinitz class, 773 

Stone-tech compaclification, 259 

straightedge and compass constructions, 531#, 602 

structure sheaf, 740# 

Sturm’s Theorem, 624 
subfield, 511, 516 
subgroup, 22, 46# 
criterion, 47 
of cyclic groups, 58# 
of index 2, 91, 120, 122 
sublattice, 70 
submodule, 337 
criterion, 342 
subring, 228 
subspace topology, 677 
sum, of ideals, 247, 250 


of submodules, 349, 35 1 
support, 729# 
surjective, 2 

Sylow p-subgroup, 101, 139#, 161 
Sylow's Theorem, 93, 105, 139#, 617 
symmetric algebra, 444 
symmetric function, 436. 608 
symmetric group. 29# 

as Galois group, 642#, 649# 
characters of, 879, 881, 883, 884 
conjugation in — see conjugation 
isomorphisms between, 37, 40 
Sylow p-subgroups of, 168, 187 
symmetric polynomials, 608, 621# 
symmetric relation, 3 
symmetric tensor, 451 
symmetrization, 452 

T 

table, group, 21 
tangent space, 724# .741# 
Tchcbotarov Density Theorem, 642 
tensor algebra, 443 
tensor product, 359#, 788# 
associativity of, 371 
of algebras, 374 
of direct products, 376 
of direct sums, 373, 376 
of fields, 377, 531, 596 
of free modules, 404 
of homomorphisms, 370 
of ideals, 377 
of matrices, 421 
of projective modules, 402, 404 
of vector spaces, 420 
tensors, 360, 364 
tetrahedron — see Platonic solids 
Thompson subgroup, 1 39 
Thompson Transfer Lemma, 822 
Thompson’s Theorem, 196 
topological space, 676# 

Tor* (A. B), 788# 
torsion, element, 344 
module, 356, 460, 463 
subgroup, 48 
submodule, 344 
torsion free, 406. 460 
trace, of a field element, 583, 585 
of a matrix, 248, 431, 431, 488, 866 
trace ideal of a group ring. 846 
transcendence base, 645 
transcendence degree, 645 
transcendental, clement, 520, 527, 534 
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extension, 645# 

transfer homomorphism, 817. 822 
transgression homomorphism, 807 
transition matrix, 419 
transitive, action. 1 15, 606, 640 
subgroups of Sj, 643 
subgroups of 5,i. 640 
transitive relation, 3 
transpose, 434, 501 
transposition, 107# 
trilinear, 372, 436 

Trisecting an Angle impossibility of, 53 )# 
trivial, action, 43 
homomorphism. 79 
ideal, 243 
representation, 844 
ring, 224 
subgroup, 47 
submodule, 338 
twisted polynomial ring, 302 
two-sided ideal, 242, 251 
two-sldcd inverse, 2 

u 

U.F.D. — see Unique Factorization Domain 
ultrametric, 759 
uniformizing parameter, 756 
unipotent radical, 212 

Unique Factorization Domain (U.F.D.), 283#, 303#, 
690, 698. 769 

unique factorization of ideals, 767 
uniqueness of splitting fields, 542 
unital module, 337 
units, 226 

in Z/nZ. 10, 17. 61, 135, 267, 314, 596 
universal property, of direct limits, 268 
ol ifee groups, 21 5# 
of free modules, 354 
of inverse limits, 269 
of multilinear maps, 372, 442, 445, 447 
of tensor products, 361, 365 
universal side divisor, 277 
universe. 9 1 1 
upper central series, 190 

upper triangular matrices, 49, 174. 187. 236, 502 


V 

valuation ring, 232, 755# 
value of / in Spec R, 732 
Vandermonde determinant 619 
variety, 679# 

vector space, 338, 408#, 512 
Vfcrlagcrungen — see transfer homomorphism 
virtual character, 898 

w 

Wedderbum components, 855 
Wedderbum decomposition, 855 
Wedderbum “s Theorem on Finite Division Rings, 
556# 

Wedderbum ‘s Theorem on Semislmple rings, 854# 
wedge product 447 
of ideals. 449, 455 
of a monomial, 621 
well defined, 1, 77, 100 
Well Ordering of Z, 4, 8, 273, 909 
Wilson’s Theorem, 551 
word, 215 
wreath product. 1 87 

z 

Z"(G; A) — see cocycles 
l\i \ — see Gaussian integers 
Z[V2], 278, 311 
Z[V=5].273.279. 283# 

ZI(1 + V^T9)/2], 277, 280, 282 

Z/nZ, 8#, 17, 56, 75#, 226, 267 

(Z/nZ)*. 10, 18, 61, 135. 267. 314. 596 

Zariski closed set 676 

Zariski closure, 677#, 691 

Zariski dense, 677, 687 

Zariski topology, 676#, 733 

zero divisor, 226, 689 

zero ring, 224 

zero set 659 

zero-dimensional ideal, 705# 

Zorn's Lemma, 65, 254, 414, 645, 907# 
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